The taming of an impossible child: a standardized all-in approach to the phylogeny of Hymenoptera using public database sequences
Open Access
- 18 August 2011
- journal article
- research article
- Published by Springer Nature in BMC Biology
- Vol. 9 (1) , 55
- https://doi.org/10.1186/1741-7007-9-55
Abstract
Enormous molecular sequence data have been accumulated over the past several years and are still exponentially growing with the use of faster and cheaper sequencing techniques. There is high and widespread interest in using these data for phylogenetic analyses. However, the amount of data that one can retrieve from public sequence repositories is virtually impossible to tame without dedicated software that automates processes. Here we present a novel bioinformatics pipeline for downloading, formatting, filtering and analyzing public sequence data deposited in GenBank. It combines some well-established programs with numerous newly developed software tools (available at http://software.zfmk.de/).This publication has 57 references indexed in Scilit:
- A large-scale phylogeny of Amphibia including over 2800 species, and a revised classification of extant frogs, salamanders, and caeciliansMolecular Phylogenetics and Evolution, 2011
- A Rapid Bootstrap Algorithm for the RAxML Web ServersSystematic Biology, 2008
- The PhyLoTA Browser: Processing GenBank for Molecular Phylogenetics ResearchSystematic Biology, 2008
- Recent developments in the MAFFT multiple sequence alignment programBriefings in Bioinformatics, 2008
- Improvement of Phylogenies after Removing Divergent and Ambiguously Aligned Blocks from Protein Sequence AlignmentsSystematic Biology, 2007
- Insights into social insects from the genome of the honeybee Apis melliferaNature, 2006
- RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed modelsBioinformatics, 2006
- ProtTest: selection of best-fit models of protein evolutionBioinformatics, 2005
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004