Significance Tests and Weighted Values for AFLP Similarities, Based on Arabidopsis in Silico AFLP Fragment Length Distributions
- 1 August 2004
- journal article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 167 (4) , 1915-1928
- https://doi.org/10.1534/genetics.103.015693
Abstract
Many AFLP studies include relatively unrelated genotypes that contribute noise to data sets instead of signal. We developed: (1) estimates of expected AFLP similarities between unrelated genotypes, (2) significance tests for AFLP similarities, enabling the detection of unrelated genotypes, and (3) weighted similarity coefficients, including band position information. Detection of unrelated genotypes and use of weighted similarity coefficients will make the analysis of AFLP data sets more informative and more reliable. Test statistics and weighted coefficients were developed for total numbers of shared bands and for Dice, Jaccard, Nei and Li, and simple matching (dis)similarity coefficients. Theoretical and in silico AFLP fragment length distributions (FLDs) were examined as a basis for the tests. The in silico AFLP FLD based on the Arabidopsis thaliana genome sequence was the most appropriate for angiosperms. The G + C content of the selective nucleotides in the in silico AFLP procedure significantly influenced the FLD. Therefore, separate test statistics were calculated for AFLP procedures with high, average, and low G + C contents in the selective nucleotides. The test statistics are generally applicable for angiosperms with a G + C content of ∼35–40%, but represent conservative estimates for genotypes with higher G + C contents. For the latter, test statistics based on a rice genome sequence are more appropriate.Keywords
This publication has 36 references indexed in Scilit:
- A cytometric exercise in plant DNA histograms, with 2C values for 70 speciesPublished by Wiley ,2012
- The genome sequence and structure of rice chromosome 1Nature, 2002
- A Physical Amplified Fragment-Length Polymorphism Map of ArabidopsisPlant Physiology, 2001
- Conversion of AFLP bands into high-throughput DNA markersMolecular Genetics and Genomics, 2001
- AFLP genotyping and fingerprintingTrends in Ecology & Evolution, 1999
- Arabidopsis–Rice: Will Colinearity Allow Gene Prediction Across the Eudicot–Monocot Divide?Genome Research, 1999
- AFLP: a new technique for DNA fingerprintingNucleic Acids Research, 1995
- Compositional variations in DNA sequencesBioinformatics, 1991
- Mathematical model for studying genetic variation in terms of restriction endonucleases.Proceedings of the National Academy of Sciences, 1979
- Measures of the Amount of Ecologic Association Between SpeciesEcology, 1945