Modified SIMPSON O(n3) algorithm for the full sibship reconstruction problem
Open Access
- 23 August 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (20) , 3912-3917
- https://doi.org/10.1093/bioinformatics/bti642
Abstract
Motivation: The problem of reconstructing full sibling groups from DNA marker data remains a significant challenge for computational biology. A recently published heuristic algorithm based on Mendelian exclusion rules and the Simpson index was successfully applied to the full sibship reconstruction (FSR) problem. However, the so-called SIMPSON algorithm has an unknown complexity measure, questioning its applicability range. Results: We present a modified version of the SIMPSON (MS) algorithm that behaves as O(n3) and achieves the same or better accuracy when compared with the original algorithm. Performance of the MS algorithm was tested on a variety of simulated diploid population samples to verify its complexity measure and the significant improvement in efficiency (e.g. 100 times faster than SIMPSON in some cases). It has been shown that, in theory, the SIMPSON algorithm runs in non-polynomial time, significantly limiting its usefulness. It has been also verified via simulation experiments that SIMPSON could run in O(na), where a > 3. Availability: Computer code written in Java is available upon request from the first author. Contact:Dmitry.Konovalov@jcu.edu.auKeywords
This publication has 11 references indexed in Scilit:
- Partition-distance via the assignment problemBioinformatics, 2005
- kingroup: a program for pedigree relationship reconstruction and kin group assignments using genetic markersMolecular Ecology Notes, 2004
- Accuracy, efficiency and robustness of four algorithms allowing full sibship reconstruction from DNA marker dataMolecular Ecology, 2004
- DNA-based methods for pedigree reconstruction and kinship analysis in natural populationsTrends in Ecology & Evolution, 2003
- A graph‐theoretic approach to the partition of individuals into full‐sib familiesMolecular Ecology, 2003
- Sibship reconstruction in hierarchical population structures using Markov chain Monte Carlo techniquesGenetics Research, 2002
- Partition-distance: A problem and class of perfect graphs arising in clusteringInformation Processing Letters, 2002
- A Bootstrap Assessment of Variability in Pedigree Reconstruction Based on Genetic MarkersBiometrics, 2001
- Use of microsatellite loci to classify individuals by relatednessMolecular Ecology, 1996
- Exponential NumbersThe American Mathematical Monthly, 1934