Characterization of large‐insert DNA libraries from soil for environmental genomic studies of Archaea
- 11 August 2004
- journal article
- research article
- Published by Wiley in Environmental Microbiology
- Vol. 6 (9) , 970-980
- https://doi.org/10.1111/j.1462-2920.2004.00663.x
Abstract
Complex genomic libraries are increasingly being used to retrieve complete genes, operons or large genomic fragments directly from environmental samples, without the need to cultivate the respective microorganisms. We report on the construction of three large-insert fosmid libraries in total covering 3 Gbp of community DNA from two different soil samples, a sandy ecosystem and a mixed forest soil. In a fosmid end sequencing approach including 5376 sequence tags of approximately 700 bp length, we show that mostly bacterial and, to a much lesser extent, archaeal and eukaryotic genome fragments (approximately 1% each) have been captured in our libraries. The diversity of putative protein-encoding genes, as reflected by their distribution into different COG clusters, was comparable to that encoded in complete genomes of cultivated microorganisms. A huge variety of genomic fragments has been captured in our libraries, as seen by comparison with sequences in the public databases and by the large variation in G+C contents. We dissect differences between the libraries, which relate to the different ecosystems analysed and to biases introduced by different DNA preparations. Furthermore, a range of taxonomic marker genes (other than 16S rRNA) has been identified that allows the assignment of genome fragments to specific lineages. The complete sequences of two genome fragments identified as being affiliated with Archaea, based on a gene encoding a CDC48 homologue and a thermosome subunit, respectively, are presented and discussed. We thereby extend the genomic information of uncultivated crenarchaeota from soil and offer hints to specific metabolic traits present in this group.Keywords
This publication has 70 references indexed in Scilit:
- Environmental Genome Shotgun Sequencing of the Sargasso SeaScience, 2004
- Community structure and metabolism through reconstruction of microbial genomes from the environmentNature, 2004
- Metagenomic Profiling: Microarray Analysis of an Environmental Genomic LibraryApplied and Environmental Microbiology, 2003
- Unsuspected diversity among marine aerobic anoxygenic phototrophsNature, 2002
- Grassland Management Regimens Reduce Small-Scale Heterogeneity and Species Diversity of β-Proteobacterial Ammonia Oxidizer PopulationsApplied and Environmental Microbiology, 2002
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001
- Everything in moderation: Archaea as ‘non-extremophiles’Current Opinion in Genetics & Development, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Fully automated genome analysis that reflects user needs and preferences. A detailed introduction to the MAGPIE system architectureBiochimie, 1996
- Acidophilic and thermophilicBacillusstrains from geothermally heated antarctic soilFEMS Microbiology Letters, 1989