Predictive response-relevant clustering of expression data provides insights into disease processes
Open Access
- 22 June 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 38 (20) , 6831-6840
- https://doi.org/10.1093/nar/gkq550
Abstract
This article describes and illustrates a novel method of microarray data analysis that couples model-based clustering and binary classification to form clusters of `response-relevant' genes; that is, genes that are informative when discriminating between the different values of the response. Predictions are subsequently made using an appropriate statistical summary of each gene cluster, which we call the `meta-covariate' representation of the cluster, in a probit regression model. We first illustrate this method by analysing a leukaemia expression dataset, before focusing closely on the meta-covariate analysis of a renal gene expression dataset in a rat model of salt-sensitive hypertension. We explore the biological insights provided by our analysis of these data. In particular, we identify a highly influential cluster of 13 genes—including three transcription factors (Arntl, Bhlhe41 and Npas2)—that is implicated as being protective against hypertension in response to increased dietary sodium. Functional and canonical pathway analysis of this cluster using Ingenuity Pathway Analysis implicated transcriptional activation and circadian rhythm signalling, respectively. Although we illustrate our method using only expression data, the method is applicable to any high-dimensional datasets. Expression data are available at ArrayExpress (accession number E-MEXP-2514) and code is available at http://www.dcs.gla.ac.uk/inference/metacovariateanalysis/.Keywords
This publication has 34 references indexed in Scilit:
- Gene expression profiling: Decoding breast cancerSurgical Oncology, 2009
- Molecular clock is involved in predictive circadian adjustment of renal functionProceedings of the National Academy of Sciences, 2009
- CD74: A New Candidate Target for the Immunotherapy of B-Cell NeoplasmsClinical Cancer Research, 2007
- Aryl hydrocarbon receptor nuclear translocator-like (BMAL1) is associated with susceptibility to hypertension and type 2 diabetesProceedings of the National Academy of Sciences, 2007
- The MIF-173G/C polymorphism does not contribute to prednisone poor response in vivo in childhood acute lymphoblastic leukemiaLeukemia, 2005
- Global burden of hypertension: analysis of worldwide dataThe Lancet, 2005
- Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experimentsPublished by Wiley ,2004
- Alterations of Circadian Expressions of Clock Genes in Dahl Salt-Sensitive Rats Fed a High-Salt DietHypertension, 2003
- Quantitative Trait Loci in Genetically Hypertensive RatsHypertension, 1996
- Comparing partitionsJournal of Classification, 1985