A nearest-neighboring-end algorithm for genetic mapping
Open Access
- 25 November 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (8) , 1579-1591
- https://doi.org/10.1093/bioinformatics/bti164
Abstract
Motivation: High-throughput methods are beginning to make possible the genotyping of thousands of loci in thousands of individuals, which could be useful for tightly associating phenotypes to candidate loci. Current mapping algorithms cannot handle so many data without building hierarchies of framework maps. Results: A version of Kruskal’s minimum spanning tree algorithm can solve any genetic mapping problem that can be stated as marker deletion from a set of linkage groups. These include backcross, recombinant inbred, haploid and double-cross recombinational populations, in addition to conventional deletion and radiation hybrid populations. The algorithm progressively joins linkage groups at increasing recombination fractions between terminal markers, and attempts to recognize and correct erroneous joins at peaks in recombination fraction. The algorithm is O (mn3) for m individuals and n markers, but the mean run time scales close to mn2. It is amenable to parallel processing and has recovered true map order in simulations of large backcross, recombinant inbred and deletion populations with up to 37 005 markers. Simulations were used to investigate map accuracy in response to population size, allelic dominance, segregation distortion, missing data and random typing errors. It produced accurate maps when marker distribution was sufficiently uniform, although segregation distortion could induce translocated marker orders. The algorithm was also used to map 1003 loci in the F7 ITMI population of bread wheat, Triticum aestivum L. emend Thell., where it shortened an existing standard map by 16%, but it failed to associate blocks of markers properly across gaps within linkage groups. This was because it depends upon the rankings of recombination fractions at individual markers, and is susceptible to sampling error, typing error and joint selection involving the terminal markers of nearly finished linkage groups. Therefore, the current form of the algorithm is useful mainly to improve local marker ordering in linkage groups obtained in other ways. Availability: The source code and supplemental data are http://www.iubio.bio.indiana.edu/soft/molbio/qtl/flipper/ Contact:ccrane@purdue.eduKeywords
This publication has 18 references indexed in Scilit:
- Rapid Mapping of Zebrafish Mutations With SNPs and Oligonucleotide MicroarraysGenome Research, 2002
- Gene Expression Analysis Using Oligonucleotide Arrays Produced by Maskless PhotolithographyGenome Research, 2002
- Large-scale selection of lines with deletions in chromosome 1B in wheat and applications for fine deletion mappingGenome, 2001
- RHO—Radiation Hybrid OrderingGenome Research, 2000
- On Constructing Radiation Hybrid MapsJournal of Computational Biology, 1997
- Molecular genetic maps of the group 6 chromosomes of hexaploid wheat (Triticum aestivumL. em. Thell.)Genome, 1996
- Molecular mapping of wheat. Homoeologous group 3Genome, 1995
- Molecular mapping of wheat. Homoeologous group 2Genome, 1995
- Molecular-genetic maps for group 1 chromosomes of Triticeae species and their relation to chromosomes in rice and oatGenome, 1995
- On the shortest spanning subtree of a graph and the traveling salesman problemProceedings of the American Mathematical Society, 1956