EDGAR: A software framework for the comparative analysis of prokaryotic genomes
Top Cited Papers
Open Access
- 20 May 2009
- journal article
- software
- Published by Springer Nature in BMC Bioinformatics
- Vol. 10 (1) , 1-14
- https://doi.org/10.1186/1471-2105-10-154
Abstract
The introduction of next generation sequencing approaches has caused a rapid increase in the number of completely sequenced genomes. As one result of this development, it is now feasible to analyze large groups of related genomes in a comparative approach. A main task in comparative genomics is the identification of orthologous genes in different genomes and the classification of genes as core genes or singletons. To support these studies EDGAR – "Efficient Database framework for comparative Genome Analyses using BLAST score Ratios" – was developed. EDGAR is designed to automatically perform genome comparisons in a high throughput approach. Comparative analyses for 582 genomes across 75 genus groups taken from the NCBI genomes database were conducted with the software and the results were integrated into an underlying database. To demonstrate a specific application case, we analyzed ten genomes of the bacterial genus Xanthomonas, for which phylogenetic studies were awkward due to divergent taxonomic systems. The resultant phylogeny EDGAR provided was consistent with outcomes from traditional approaches performed recently and moreover, it was possible to root each strain with unprecedented accuracy. EDGAR provides novel analysis features and significantly simplifies the comparative analysis of related genomes. The software supports a quick survey of evolutionary relationships and simplifies the process of obtaining new biological insights into the differential gene content of kindred genomes. Visualization features, like synteny plots or Venn diagrams, are offered to the scientific community through a web-based and therefore platform independent user interface http://edgar.cebitec.uni-bielefeld.de , where the precomputed data sets can be browsed.Keywords
This publication has 48 references indexed in Scilit:
- Whole-genome comparison of disease and carriage strains provides insights into virulence evolution in Neisseria meningitidisProceedings of the National Academy of Sciences, 2008
- xBASE2: a comprehensive resource for comparative bacterial genomicsNucleic Acids Research, 2007
- eggNOG: automated construction and annotation of orthologous groups of genesNucleic Acids Research, 2007
- Improvement of Phylogenies after Removing Divergent and Ambiguously Aligned Blocks from Protein Sequence AlignmentsSystematic Biology, 2007
- Identifying bacterial genes and endosymbiont DNA with GlimmerBioinformatics, 2007
- Database resources of the National Center for Biotechnology InformationNucleic Acids Research, 2006
- Ensembl 2007Nucleic Acids Research, 2006
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- OrthoMCL: Identification of Ortholog Groups for Eukaryotic GenomesGenome Research, 2003
- Comparison of the genomes of two Xanthomonas pathogens with differing host specificitiesNature, 2002