Multiple Imputations Applied to the DREAM3 Phosphoproteomics Challenge: A Winning Strategy
Open Access
- 18 January 2010
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 5 (1) , e8012
- https://doi.org/10.1371/journal.pone.0008012
Abstract
DREAM is an initiative that allows researchers to assess how well their methods or approaches can describe and predict networks of interacting molecules [1]. Each year, recently acquired datasets are released to predictors ahead of publication. Researchers typically have about three months to predict the masked data or network of interactions, using any predictive method. Predictions are assessed prior to an annual conference where the best predictions are unveiled and discussed. Here we present the strategy we used to make a winning prediction for the DREAM3 phosphoproteomics challenge. We used Amelia II, a multiple imputation software method developed by Gary King, James Honaker and Matthew Blackwell[2] in the context of social sciences to predict the 476 out of 4624 measurements that had been masked for the challenge. To chose the best possible multiple imputation parameters to apply for the challenge, we evaluated how transforming the data and varying the imputation parameters affected the ability to predict additionally masked data. We discuss the accuracy of our findings and show that multiple imputations applied to this dataset is a powerful method to accurately estimate the missing data. We postulate that multiple imputations methods might become an integral part of experimental design as a mean to achieve cost savings in experimental design or to increase the quantity of samples that could be handled for a given cost.Keywords
This publication has 16 references indexed in Scilit:
- Lessons from the DREAM2 ChallengesAnnals of the New York Academy of Sciences, 2009
- Modelling and analysis of gene regulatory networksNature Reviews Molecular Cell Biology, 2008
- ENFIN a Network to Enhance Integrative Systems BiologyAnnals of the New York Academy of Sciences, 2007
- Critical assessment of methods of protein structure prediction—Round VIIProteins-Structure Function and Bioinformatics, 2007
- Combinations of biomarkers predictive of later life mortalityProceedings of the National Academy of Sciences, 2006
- HaCaT keratinocyte migration is dependent on epidermal growth factor receptor signaling and glycogen synthase kinase-3αPublished by Elsevier ,2006
- Critical assessment of methods of protein structure prediction (CASP)—Round 6Proteins-Structure Function and Bioinformatics, 2005
- Up-regulation of IL-1 receptor through PI3K/Akt is essential for the induction of iNOS gene expression in hepatocytesJournal of Hepatology, 2004
- Unraveling protein interaction networks with near-optimal efficiencyNature Biotechnology, 2003
- Regulation of Hypoxia-inducible Factor-1α Protein Level during Hypoxic Conditions by the Phosphatidylinositol 3-Kinase/Akt/Glycogen Synthase Kinase 3β Pathway in HepG2 CellsJournal of Biological Chemistry, 2003