Automated binning of microsatellite alleles: problems and solutions
Top Cited Papers
- 9 November 2006
- journal article
- Published by Wiley in Molecular Ecology Notes
- Vol. 7 (1) , 10-14
- https://doi.org/10.1111/j.1471-8286.2006.01560.x
Abstract
As genotyping methods move ever closer to full automation, care must be taken to ensure that there is no equivalent rise in allele‐calling error rates. One clear source of error lies with how raw allele lengths are converted into allele classes, a process referred to as binning. Standard automated approaches usually assume collinearity between expected and measured fragment length. Unfortunately, such collinearity is often only approximate, with the consequence that alleles do not conform to a perfect 2‐, 3‐ or 4‐base‐pair periodicity. To account for these problems, we introduce a method that allows repeat units to be fractionally shorter or longer than their theoretical value. Tested on a large human data set, our algorithm performs well over a wide range of dinucleotide repeat loci. The size of the problem caused by sticking to whole numbers of bases is indicated by the fact that the effective repeat length was within 5% of the assumed length only 68.3% of the time.Keywords
This publication has 19 references indexed in Scilit:
- Genotyping errors: causes, consequences and solutionsNature Reviews Genetics, 2005
- Microsatellite genotyping errors: detection approaches, common sources and consequences for paternal exclusionMolecular Ecology, 2004
- How to track and assess genotyping errors in population genetics studiesMolecular Ecology, 2004
- HOW MUCH OF THE VARIATION IN ADAPTIVE DIVERGENCE CAN BE EXPLAINED BY GENE FLOW? AN EVALUATION USING LAKE-STREAM STICKLEBACK PAIRSEvolution, 2004
- The evolution of molecular markers — just a matter of fashion?Nature Reviews Genetics, 2004
- Laboratory temperature variation is a previously unrecognized source of genotyping error during capillary electrophoresisMolecular Ecology Notes, 2003
- A Tale of Two Genotypes: Consistency between Two High-Throughput Genotyping CentersGenome Research, 2002
- Identification and Analysis of Error Types in High-Throughput GenotypingAmerican Journal of Human Genetics, 2000
- Biases associated with population estimation using molecular taggingAnimal Conservation, 2000
- Methods for precise sizing, automated binning of alleles, and reduction of error rates in large-scale genotyping using fluorescently labeled dinucleotide markers. FUSION (Finland-U.S. Investigation of NIDDM Genetics) Study Group.Genome Research, 1997