Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly
Open Access
- 19 June 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 25 (21) , 2878-2881
- https://doi.org/10.1093/bioinformatics/btp377
Abstract
Motivation: Most microbial species can not be cultured in the laboratory. Metagenomic sequencing may still yield a complete genome if the sequenced community is enriched and the sequencing coverage is high. However, the complexity in a natural population may cause the enrichment culture to contain multiple related strains. This diversity can confound existing strict assembly programs and lead to a fragmented assembly, which is unnecessary if we have a related reference genome available that can function as a scaffold. Results: Here, we map short metagenomic sequencing reads from a population of strains to a related reference genome, and compose a genome that captures the consensus of the population's sequences. We show that by iteration of the mapping and assembly procedure, the coverage increases while the similarity with the reference genome decreases. This indicates that the assembly becomes less dependent on the reference genome and approaches the consensus genome of the multi-strain population. Contact:dutilh@cmbi.ru.nl Supplementary Information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 14 references indexed in Scilit:
- Enrichment and Molecular Detection of Denitrifying Methanotrophic Bacteria of the NC10 PhylumApplied and Environmental Microbiology, 2009
- Accurate whole human genome sequencing using reversible terminator chemistryNature, 2008
- Denitrifying bacteria anaerobically oxidize methane in the absence of ArchaeaEnvironmental Microbiology, 2008
- Next-Generation DNA Sequencing MethodsAnnual Review of Genomics and Human Genetics, 2008
- Mapping short DNA sequencing reads and calling variants using mapping quality scoresGenome Research, 2008
- “ Candidatus Cloacamonas Acidaminovorans”: Genome Sequence Reconstruction Provides a First Glimpse of a New Bacterial DivisionJournal of Bacteriology, 2008
- Whole-genome re-sequencingPublished by Elsevier ,2006
- Initial sequencing and analysis of the human genomeNature, 2001
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Nomenclature for incompletely specified bases in nucleic acid sequences: rcommendations 1984Nucleic Acids Research, 1985