Accurately quantifying low-abundant targets amid similar sequences by revealing hidden correlations in oligonucleotide microarray data
- 12 September 2006
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 103 (37) , 13629-13634
- https://doi.org/10.1073/pnas.0601476103
Abstract
Microarrays have enabled the determination of how thousands of genes are expressed to coordinate function within single organisms. Yet applications to natural or engineered communities where different organisms interact to produce complex properties are hampered by theoretical and technological limitations. Here we describe a general method to accurately identify low-abundant targets in systems containing complex mixtures of homologous targets. We combined an analytical predictor of nonspecific probe–target interactions (cross-hybridization) with an optimization algorithm that iteratively deconvolutes true probe–target signal from raw signal affected by spurious contributions (cross-hybridization, noise, background, and unequal specific hybridization response). The method was capable of quantifying, with unprecedented specificity and accuracy, ribosomal RNA (rRNA) sequences in artificial and natural communities. Controlled experiments with spiked rRNA into artificial and natural communities demonstrated the accuracy of identification and quantitative behavior over different concentration ranges. Finally, we illustrated the power of this methodology for accurate detection of low-abundant targets in natural communities. We accurately identified Vibrio taxa in coastal marine samples at their natural concentrations (<0.05% of total bacteria), despite the high potential for cross-hybridization by hundreds of different coexisting rRNAs, suggesting this methodology should be expandable to any microarray platform and system requiring accurate identification of low-abundant targets amid pools of similar sequences.Keywords
This publication has 47 references indexed in Scilit:
- Base Pair Interactions and Hybridization Isotherms of Matched and Mismatched Oligonucleotide Probes on MicroarraysLangmuir, 2005
- GenXHC: a probabilistic generative model for cross-hybridization compensation in high-density genome-wide microarray dataBioinformatics, 2005
- Fine-scale phylogenetic architecture of a complex bacterial communityNature, 2004
- Environmental Genome Shotgun Sequencing of the Sargasso SeaScience, 2004
- Community structure and metabolism through reconstruction of microbial genomes from the environmentNature, 2004
- ChipCheckA Program Predicting Total Hybridization Equilibria for DNA Binding to Small Oligonucleotide MicroarraysJournal of Chemical Information and Computer Sciences, 2003
- Exploration, normalization, and summaries of high density oligonucleotide array probe level dataBiostatistics, 2003
- Quantitative Detection of Microbial Genes by Using DNA MicroarraysApplied and Environmental Microbiology, 2002
- The human body as microbial observatoryNature Genetics, 2002
- Molecular interactions on microarraysNature Genetics, 1999