Comparison of classification methods for detecting associations between SNPs and chick mortality
Open Access
- 23 January 2009
- journal article
- research article
- Published by Springer Nature in Genetics Selection Evolution
- Vol. 41 (1) , 18
- https://doi.org/10.1186/1297-9686-41-18
Abstract
Multi-category classification methods were used to detect SNP-mortality associations in broilers. The objective was to select a subset of whole genome SNPs associated with chick mortality. This was done by categorizing mortality rates and using a filter-wrapper feature selection procedure in each of the classification methods evaluated. Different numbers of categories (2, 3, 4, 5 and 10) and three classification algorithms (naïve Bayes classifiers, Bayesian networks and neural networks) were compared, using early and late chick mortality rates in low and high hygiene environments. Evaluation of SNPs selected by each classification method was done by predicted residual sum of squares and a significance test-related metric. A naïve Bayes classifier, coupled with discretization into two or three categories generated the SNP subset with greatest predictive ability. Further, an alternative categorization scheme, which used only two extreme portions of the empirical distribution of mortality rates, was considered. This scheme selected SNPs with greater predictive ability than those chosen by the methods described previously. Use of extreme samples seems to enhance the ability of feature selection procedures to select influential SNPs in genetic association studies.Keywords
This publication has 37 references indexed in Scilit:
- Marker-assisted assessment of genotype by environment interaction: A case study of single nucleotide polymorphism-mortality association in broilers in two hygiene environments1Journal of Animal Science, 2008
- Conditional variable importance for random forestsBMC Bioinformatics, 2008
- Reproducing Kernel Hilbert Spaces Regression Methods for Genomic Assisted Prediction of Quantitative TraitsGenetics, 2008
- Machine learning classification procedure for selecting SNPs in genomic selection: application to early mortality in broilersJournal of Animal Breeding and Genetics, 2007
- Looking for a bit of co-action?Thorax, 2007
- Genomic-Assisted Prediction of Genetic Value With Semiparametric ProceduresGenetics, 2006
- Association of IL13 with total IgE: Evidence against an inverse association of atopy and diabetesJournal of Allergy and Clinical Immunology, 2006
- Using Bayesian Networks to Analyze Expression DataJournal of Computational Biology, 2000
- Wrappers for feature subset selectionArtificial Intelligence, 1997
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978