Neighbour joining trees, dominant markers and population genetic structure
- 17 March 2004
- journal article
- research article
- Published by Springer Nature in Heredity
- Vol. 92 (6) , 490-498
- https://doi.org/10.1038/sj.hdy.6800445
Abstract
Population genetic theory for ‘traditional’ codominant loci showing low levels of allelic diversity (eg allozymes) has been well characterised and evaluated. In contrast, appropriate methods for the analysis of data from more recently developed marker systems are still being refined. For multilocus dominant markers such as amplified fragment length polymorphisms (AFLPs) and randomly amplified polymorphic DNA (RAPDs), the methods of data analysis can be split into two main categories. In population-based approaches, population allele frequencies are compared to obtain some measure of the partitioning of genetic diversity into within- and between-population components. In contrast, individual-based approaches use individual multilocus genotypes as the unit of analysis. Inferences on population processes such as gene flow are based on inter-relationships among individual samples as visualised on phenetic diagrams such as neighbour joining trees. Using a simulation approach coupled with neighbour joining analyses, we show that while the underlying population genetic structure is an important determinant of tree shape in the analysis of dominant data, the number of loci examined also affects the topology. At low levels of population differentiation (eg FST=0.07), mutually exclusive clustering of individuals into their respective populations can occur when sufficiently large numbers of loci are scored (eg 250 loci, typical of many AFLP studies). In contrast, unresolved star-shaped topologies can be recovered at higher levels of population differentiation (FST=>0.15) when lower numbers of loci are employed (eg 50 loci, typical of many RAPD studies). Thus, the relationship between tree topology and the extent of genetic structuring of populations is contingent upon the number of dominant loci scored. The consequences of these findings for the biological interpretation of individual-based analysis of dominant data sets are discussed.Keywords
This publication has 31 references indexed in Scilit:
- Genetic diversity in Arabidopsis thaliana L. Heynh. investigated by cleaved amplified polymorphic sequence (CAPS) and inter‐simple sequence repeat (ISSR) markersMolecular Ecology, 2002
- The estimation of population differentiation with microsatellite markersMolecular Ecology, 2002
- Demographic and random amplified polymorphic DNA analyses reveal high levels of genetic diversity in a clonal violetMolecular Ecology, 2001
- Historical biogeography in a linear system: genetic variation of Sea Rocket (Cakile maritima) and Sea Holly (Eryngium maritimum) along European coastsMolecular Ecology, 2000
- MICROSATELLITES CAN BE MISLEADING: AN EMPIRICAL AND SIMULATION STUDYEvolution, 2000
- Comparison of genetic diversity of the invasive weed Rubus alceifolius Poir. (Rosaceae) in its native range and in areas of introduction, using amplified fragment length polymorphism (AFLP) markersMolecular Ecology, 2000
- Random amplified polymorphic DNA (RAPD) and quantitative trait analyses across a major phylogeographical break in the Mediterranean ragwort Senecio gallicus Vill. (Asteraceae)Molecular Ecology, 2000
- Genetic variation in an apomictic grass, Heteropogon contortus, in the Hawaiian IslandsMolecular Ecology, 1999
- Molecular genetic analysis of black poplar (Populus nigra L.) along Dutch riversMolecular Ecology, 1998
- The clonal and population structure of a rare endemic plant, Wyethia reticulata (Asteraceae): allozyme and RAPD analysisMolecular Ecology, 1997