The taming of an impossible child: a standardized all-in approach to the phylogeny of Hymenoptera using public database sequences

Open Access

18 August 2011

journal article
research article
Published by Springer Nature in BMC Biology

Vol. 9 (1) , 55
https://doi.org/10.1186/1741-7007-9-55

Abstract

Enormous molecular sequence data have been accumulated over the past several years and are still exponentially growing with the use of faster and cheaper sequencing techniques. There is high and widespread interest in using these data for phylogenetic analyses. However, the amount of data that one can retrieve from public sequence repositories is virtually impossible to tame without dedicated software that automates processes. Here we present a novel bioinformatics pipeline for downloading, formatting, filtering and analyzing public sequence data deposited in GenBank. It combines some well-established programs with numerous newly developed software tools (available at http://software.zfmk.de/).

This publication has 57 references indexed in Scilit:

A large-scale phylogeny of Amphibia including over 2800 species, and a revised classification of extant frogs, salamanders, and caecilians
Molecular Phylogenetics and Evolution, 2011
A Rapid Bootstrap Algorithm for the RAxML Web Servers
Systematic Biology, 2008
The PhyLoTA Browser: Processing GenBank for Molecular Phylogenetics Research
Systematic Biology, 2008
Recent developments in the MAFFT multiple sequence alignment program
Briefings in Bioinformatics, 2008
Improvement of Phylogenies after Removing Divergent and Ambiguously Aligned Blocks from Protein Sequence Alignments
Systematic Biology, 2007
Insights into social insects from the genome of the honeybee Apis mellifera
Nature, 2006
RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models
Bioinformatics, 2006
ProtTest: selection of best-fit models of protein evolution
Bioinformatics, 2005
MUSCLE: multiple sequence alignment with high accuracy and high throughput
Nucleic Acids Research, 2004