Balancing false positives and false negatives for the detection of differential expression in malignancies
Open Access
- 31 August 2004
- journal article
- research article
- Published by Springer Nature in British Journal of Cancer
- Vol. 91 (6) , 1160-1165
- https://doi.org/10.1038/sj.bjc.6602140
Abstract
A basic problem of microarray data analysis is to identify genes whose expression is affected by the distinction between malignancies with different properties. These genes are said to be differentially expressed. Differential expression can be detected by selecting the genes with P-values (derived using an appropriate hypothesis test) below a certain rejection level. This selection, however, is not possible without accepting some false positives and negatives since the two sets of P-values, associated with the genes whose expression is and is not affected by the distinction between the different malignancies, overlap. We describe a procedure for the study of differential expression in microarray data based on receiver-operating characteristic curves. This approach can be useful to select a rejection level that balances the number of false positives and negatives and to assess the degree of overlap between the two sets of P-values. Since this degree of overlap characterises the balance that can be reached between the number of false positives and negatives, this quantity can be seen as a quality measure of microarray data with respect to the detection of differential expression. As an example, we apply our method to data sets studying acute leukaemia.Keywords
This publication has 22 references indexed in Scilit:
- A generalized likelihood ratio test to identify differentially expressed genes from microarray dataBioinformatics, 2004
- Comparison and meta-analysis of microarray data: from the bench to the computer deskTrends in Genetics, 2003
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- A molecular signature of metastasis in primary solid tumorsNature Genetics, 2002
- Better therapeutics through microarraysNature Genetics, 2002
- Significance analysis of microarrays applied to the ionizing radiation responseProceedings of the National Academy of Sciences, 2001
- Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression MonitoringScience, 1999
- Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arraysProceedings of the National Academy of Sciences, 1999
- A method of comparing the areas under receiver operating characteristic curves derived from the same cases.Radiology, 1983
- The meaning and use of the area under a receiver operating characteristic (ROC) curve.Radiology, 1982