Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray data
Open Access
- 31 May 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (16) , 2063-2072
- https://doi.org/10.1093/bioinformatics/btm289
Abstract
Motivation: A major challenge in current biomedical research is the identification of cellular processes deregulated in a given pathology through the analysis of gene expression profiles. To this end, predefined lists of genes, coding specific functions, are compared with a list of genes ordered according to their values of differential expression measured by suitable univariate statistics. Results: We propose a statistically well-founded method for measuring the relevance of predefined lists of genes and for assessing their statistical significance starting from their raw expression levels as recorded on the microarray. We use prediction accuracy as a measure of relevance of the list. The rationale is that a functional category, coded through a list of genes, is perturbed in a given pathology if it is possible to correctly predict the occurrence of the disease in new subjects on the basis of the expression levels of the genes belonging to the list only. The accuracy is estimated with multiple random validation strategy and its statistical significance is assessed against a couple of null hypothesis, by using two independent permutation tests. The utility of the proposed methodology is illustrated by analyzing the relevance of Gene Ontology terms belonging to biological process category in colon and prostate cancer, by using three different microarray data sets and by comparing it with current approaches. Availability: Source code for the algorithms is available from author upon request. Contact:ancona@ba.issia.cnr.it Supplementary information: Colon cancer data set and a complete description of experimental results are available at: ftp://bioftp:76bioftpxxx@marx.ba.issia.cnr.it/supp-info.htmKeywords
This publication has 37 references indexed in Scilit:
- 15-Hydroxyprostaglandin dehydrogenase is an in vivo suppressor of colon tumorigenesisProceedings of the National Academy of Sciences, 2006
- Non-steroidal anti-inflammatory drugs for cancer prevention: promise, perils and pharmacogeneticsNature Reviews Cancer, 2006
- Commentary: Targeting Colorectal Cancer Through Molecular BiologySeminars in Oncology, 2005
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005
- Ontological analysis of gene expression data: current tools, limitations, and open problemsBioinformatics, 2005
- 15-Hydroxyprostaglandin dehydrogenase, a COX-2 oncogene antagonist, is a TGF-β-induced suppressor of human gastrointestinal cancersProceedings of the National Academy of Sciences, 2004
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- Sterol Regulatory Element-Binding Protein-1 Participates in the Regulation of Fatty Acid Synthase Expression in Colorectal NeoplasiaExperimental Cell Research, 2000
- Mechanisms Linking Diet and Colorectal Cancer: The Possible Role of Insulin ResistanceNutrition and Cancer, 2000
- Transcriptional activation functions in BRCA2Nature, 1997