Data integration in plant biology: the O2PLS method for combined modeling of transcript and metabolite data
- 11 October 2007
- journal article
- Published by Wiley in The Plant Journal
- Vol. 52 (6) , 1181-1191
- https://doi.org/10.1111/j.1365-313x.2007.03293.x
Abstract
Summary: The technological advances in the instrumentation employed in life sciences have enabled the collection of a virtually unlimited quantity of data from multiple sources. By gathering data from several analytical platforms, with the aim of parallel monitoring of, e.g. transcriptomic, metabolomic or proteomic events, one hopes to answer and understand biological questions and observations. This ‘systems biology’ approach typically involves advanced statistics to facilitate the interpretation of the data. In the present study, we demonstrate that the O2PLS multivariate regression method can be used for combining ‘omics’ types of data. With this methodology, systematic variation that overlaps across analytical platforms can be separated from platform‐specific systematic variation. A study of Populus tremula × Populus tremuloides, investigating short‐day‐induced effects at transcript and metabolite levels, is employed to demonstrate the benefits of the methodology. We show how the models can be validated and interpreted to identify biologically relevant events, and discuss the results in relation to a pairwise univariate correlation approach and principal component analysis.Keywords
This publication has 39 references indexed in Scilit:
- Integrated Analysis of Metabolite and Transcript Levels Reveals the Metabolic Shifts That Underlie Tomato Fruit Development and Highlight Regulatory Aspects of Metabolic Network BehaviorPlant Physiology, 2006
- Combined Transcript and Metabolite Profiling of Arabidopsis Leaves Reveals Fundamental Effects of the Thiol-Disulfide Status on Plant MetabolismPlant Physiology, 2006
- Combination of ‘omics’ data to investigate the mechanism(s) of hydrazine-induced hepatotoxicity in Rats and to identify potential biomarkersBiomarkers, 2004
- Frontiers in Bioscience 9, 1611-1625, May 1, 2004Frontiers in Bioscience-Landmark, 2004
- Knowing when to grow: signals regulating bud dormancyPublished by Elsevier ,2003
- O2‐PLS, a two‐block (X–Y) latent variable regression (LVR) method with an integral OSC filterJournal of Chemometrics, 2003
- O2‐PLS for qualitative and quantitative analysis in multivariate calibrationJournal of Chemometrics, 2002
- Purine Biosynthesis. Big in Cell Division, Even Bigger in Nitrogen AssimilationPlant Physiology, 2002
- Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA MicroarrayScience, 1995
- Linear Model Selection by Cross-validationJournal of the American Statistical Association, 1993