Discovery of significant rules for classifying cancer diagnosis data
Open Access
- 27 September 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 19 (suppl_2) , ii93-ii102
- https://doi.org/10.1093/bioinformatics/btg1066
Abstract
Methods and Results: We introduce a new method to discover many diversified and significant rules from high dimensional profiling data. We also propose to aggregate the discriminating power of these rules for reliable predictions. The discovered rules are found to contain low-ranked features; these features are found to be sometimes necessary for classifiers to achieve perfect accuracy. The use of low-ranked but essential features in our method is in constrast to the prevailing use of an ad-hoc number of only top-ranked features. On a wide range of data sets, our method displayed highly competitive accuracy compared to the best performance of other kinds of classification models. In addition to accuracy, our method also provides comprehensible rules to help elucidate the translation between raw data and useful knowledge. Supplementary information: http://sdmc.i2r.a-star.edu.sg/GEDatasets/supplementaldata/eccb2003/ECCB2003.html. Contact: jinyan@i2r.a-star.edu.sg *To whom correspondence should be addressed.Keywords
This publication has 0 references indexed in Scilit: