The empirical Bayes approach as a tool to identify non-random species associations
- 15 October 2009
- journal article
- research article
- Published by Springer Nature in Oecologia
- Vol. 162 (2) , 463-477
- https://doi.org/10.1007/s00442-009-1474-y
Abstract
A statistical challenge in community ecology is to identify segregated and aggregated pairs of species from a binary presence–absence matrix, which often contains hundreds or thousands of such potential pairs. A similar challenge is found in genomics and proteomics, where the expression of thousands of genes in microarrays must be statistically analyzed. Here we adapt the empirical Bayes method to identify statistically significant species pairs in a binary presence–absence matrix. We evaluated the performance of a simple confidence interval, a sequential Bonferroni test, and two tests based on the mean and the confidence interval of an empirical Bayes method. Observed patterns were compared to patterns generated from null model randomizations that preserved matrix row and column totals. We evaluated these four methods with random matrices and also with random matrices that had been seeded with an additional segregated or aggregated species pair. The Bayes methods and Bonferroni corrections reduced the frequency of false-positive tests (type I error) in random matrices, but did not always correctly identify the non-random pair in a seeded matrix (type II error). All of the methods were vulnerable to identifying spurious secondary associations in the seeded matrices. When applied to a set of 272 published presence–absence matrices, even the most conservative tests indicated a fourfold increase in the frequency of perfectly segregated “checkerboard” species pairs compared to the null expectation, and a greater predominance of segregated versus aggregated species pairs. The tests did not reveal a large number of significant species pairs in the Vanuatu bird matrix, but in the much smaller Galapagos bird matrix they correctly identified a concentration of segregated species pairs in the genus Geospiza. The Bayesian methods provide for increased selectivity in identifying non-random species pairs, but the analyses will be most powerful if investigators can use a priori biological criteria to identify potential sets of interacting species.Keywords
This publication has 43 references indexed in Scilit:
- Patterns in the assembly of an island plant communityJournal of Biogeography, 2006
- The control of the false discovery rate in multiple testing under dependencyThe Annals of Statistics, 2001
- Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple TestingJournal of the Royal Statistical Society Series B: Statistical Methodology, 1995
- Land snail faunas of the Napier and Oscar Ranges, Western Australia; diversity, distribution and speciationBiological Journal of the Linnean Society, 1992
- 20. A Null Model for Null Models in BiogeographyPublished by Walter de Gruyter GmbH ,1984
- Examination of the ?null? model of connor and simberloff for species co-occurrences on IslandsOecologia, 1982
- Changes in Species Composition of Floras on Islets Near Perth, Western AustraliaJournal of Biogeography, 1980
- The Assembly of Species Communities: Chance or Competition?Ecology, 1979
- Lots of Weeds: Insular Phytogeography of Vacant Urban LotsJournal of Biogeography, 1979
- Origin of the New Hebridean AvifaunaEmu - Austral Ornithology, 1976