Integration of Microbial Ecology and Statistics: a Test To Compare Gene Libraries
Open Access
- 1 September 2004
- journal article
- research article
- Published by American Society for Microbiology in Applied and Environmental Microbiology
- Vol. 70 (9) , 5485-5492
- https://doi.org/10.1128/aem.70.9.5485-5492.2004
Abstract
Libraries of 16S rRNA genes provide insight into the membership of microbial communities. Statistical methods help to determine whether differences in library composition are artifacts of sampling or are due to underlying differences in the communities from which they are derived. To contribute to a growing statistical framework for comparing 16S rRNA libraries, we present a computer program, ∫-LIBSHUFF, which calculates the integral form of the Cramér-von Mises statistic. This implementation builds upon the LIBSHUFF program, which uses an approximation of the statistic and makes a number of modifications that improve precision and accuracy. Once ∫-LIBSHUFF calculates the P values, when pairwise comparisons are tested at the 0.05 level, the probability of falsely identifying a significant P value is 0.098 for a study with two libraries, 0.265 for three libraries, and 0.460 for four libraries. The potential negative effects of making the multiple pairwise comparisons necessitate correcting for the increased likelihood that differences between treatments are due to chance and do not reflect biological differences. Using ∫-LIBSHUFF, we found that previously published 16S rRNA gene libraries constructed from Scottish and Wisconsin soils contained different bacterial lineages. We also analyzed the published libraries constructed for the zebrafish gut microflora and found statistically significant changes in the community during development of the host. These analyses illustrate the power of ∫-LIBSHUFF to detect differences between communities, providing the basis for ecological inference about the association of soil productivity or host gene expression and microbial community composition.Keywords
This publication has 34 references indexed in Scilit:
- Identification of uncultured bacteria tightly associated with the intestine of the earthworm Lumbricus rubellus (Lumbricidae; Oligochaeta)Soil Biology and Biochemistry, 2003
- Statistical Approaches for Estimating Actinobacterial Diversity in Marine SedimentsApplied and Environmental Microbiology, 2003
- Metagenomic Profiling: Microarray Analysis of an Environmental Genomic LibraryApplied and Environmental Microbiology, 2003
- Molecular analysis of bacterial microbiota in the gut of the termite Reticulitermes speratus (Isoptera; Rhinotermitidae)FEMS Microbiology Ecology, 2003
- A Census of rRNA Genes and Linked Genomic Sequences within a Soil Metagenomic LibraryApplied and Environmental Microbiology, 2003
- Depth Distribution of Microbial Diversity in Mono Lake, a Meromictic Soda Lake in CaliforniaApplied and Environmental Microbiology, 2003
- Phylogenetic characterization of the bacterial assemblage associated with mucous secretions of the hydrothermal vent polychaete Paralvinella palmiformisFEMS Microbiology Ecology, 2002
- Phylogenetic Approaches for Describing and Comparing the Diversity of Microbial CommunitiesApplied and Environmental Microbiology, 2002
- Empirical and Theoretical Bacterial Diversity in Four Arizona SoilsApplied and Environmental Microbiology, 2002
- THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERSBiometrika, 1953