STATISTICAL TESTS FOR TAXONOMIC DISTINCTIVENESS FROM OBSERVATIONS OF MONOPHYLY
Open Access
- 6 February 2007
- Vol. 61 (2) , 317-323
- https://doi.org/10.1111/j.1558-5646.2007.00023.x
Abstract
The observation of monophyly for a specified set of genealogical lineages is often used to place the lineages into a distinctive taxonomic entity. However, it is sometimes possible that monophyly of the lineages can occur by chance as an outcome of the random branching of lineages within a single taxon. Thus, especially for small samples, an observation of monophyly for a set of lineages—even if strongly supported statistically—does not necessarily indicate that the lineages are from a distinctive group. Here I develop a test of the null hypothesis that monophyly is a chance outcome of random branching. I also compute the sample size required so that the probability of chance occurrence of monophyly of a specified set of lineages lies below a prescribed tolerance. Under the null model of random branching, the probability that monophyly of the lineages in an index group occurs by chance is substantial if the sample is highly asymmetric, that is, if only a few of the sampled lineages are from the index group, or if only a few lineages are external to the group. If sample sizes are similar inside and outside the group of interest, however, chance occurrence of monophyly can be rejected at stringent significance levels (P < 10−5) even for quite small samples (≈ 20 total lineages). For a fixed total sample size, rejection of the null hypothesis of random branching in a single taxon occurs at the most stringent level if samples of nearly equal size inside and outside the index group—with a slightly greater size within the index group—are used. Similar results apply, with smaller sample sizes needed, when reciprocal monophyly of two groups, rather than monophyly of a single group, is of interest. The results suggest minimal sample sizes required for inferences to be made about taxonomic distinctiveness from observations of monophyly.Keywords
This publication has 21 references indexed in Scilit:
- DNA Barcoding Will Often Fail to Discover New Animal Species over Broad Parameter SpaceSystematic Biology, 2006
- Prioritizing species conservation: does the Cape Verde kite exist?Proceedings Of The Royal Society B-Biological Sciences, 2005
- Biological identifications through DNA barcodesProceedings Of The Royal Society B-Biological Sciences, 2003
- MATHEMATICAL CONSEQUENCES OF THE GENEALOGICAL SPECIES CONCEPTEvolution, 2002
- Stochastic models and descriptive statistics for phylogenetic trees, from Yule to todayStatistical Science, 2001
- Neandertal DNA Sequences and the Origin of Modern HumansCell, 1997
- A Likelihood-Ratio Test of MonophylySystematic Biology, 1996
- Probabilities of Evolutionary TreesSystematic Biology, 1994
- Null Models for the Number of Evolutionary Steps in a Character on a Phylogenetic TreeEvolution, 1991
- The probabilities of rooted tree-shapes generated by random bifurcationAdvances in Applied Probability, 1971