A novel approach to detect hot-spots in large-scale multivariate data
Open Access
- 11 September 2007
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 8 (1) , 331
- https://doi.org/10.1186/1471-2105-8-331
Abstract
Background: Progressive advances in the measurement of complex multifactorial components of biological processes involving both spatial and temporal domains have made it difficult to identify the variables (genes, proteins, neurons etc.) significantly changed activities in response to a stimulus within large data sets using conventional statistical approaches. The set of all changed variables is termed hot-spots. The detection of such hot spots is considered to be an NP hard problem, but by first establishing its theoretical foundation we have been able to develop an algorithm that provides a solution. Results: Our results show that a first-order phase transition is observable whose critical point separates the hot-spot set from the remaining variables. Its application is also found to be more successful than existing approaches in identifying statistically significant hot-spots both with simulated data sets and in real large-scale multivariate data sets from gene arrays, electrophysiological recording and functional magnetic resonance imaging experiments. Conclusion: In summary, this new statistical algorithm should provide a powerful new analytical tool to extract the maximum information from complex biological multivariate data.Keywords
This publication has 16 references indexed in Scilit:
- Dissociation Between Local Field Potentials and Spiking Activity in Macaque Inferior Temporal Cortex Reveals Diagnosticity-Based Encoding of Complex ObjectsJournal of Neuroscience, 2006
- The-more-the-better and the-less-the-betterBioinformatics, 2006
- Reward, Motivation, and Emotion Systems Associated With Early-Stage Intense Romantic LoveJournal of Neurophysiology, 2005
- The use of receiver operating characteristic curves in biomedical informaticsPublished by Elsevier ,2005
- Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray dataBioinformatics, 2005
- Inference of hand movements from local field potentials in monkey motor cortexNature Neuroscience, 2003
- Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretationQuarterly Journal of the Royal Meteorological Society, 2002
- Gene-microarray analysis of multiple sclerosis lesions yields new targets validated in autoimmune encephalomyelitisNature Medicine, 2002
- A New Statistical Approach to Detecting Significant Activation in Functional MRINeuroImage, 2000
- Functional Mapping of the Human Visual Cortex by Magnetic Resonance ImagingScience, 1991