Model criticism based on likelihood-free inference, with an application to protein network evolution
Open Access
- 30 June 2009
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 106 (26) , 10576-10581
- https://doi.org/10.1073/pnas.0807882106
Abstract
Mathematical models are an important tool to explain and comprehend complex phenomena, and unparalleled computational advances enable us to easily explore them without any or little understanding of their global properties. In fact, the likelihood of the data under complex stochastic models is often analytically or numerically intractable in many areas of sciences. This makes it even more important to simultaneously investigate the adequacy of these models—in absolute terms, against the data, rather than relative to the performance of other models—but no such procedure has been formally discussed when the likelihood is intractable. We provide a statistical interpretation to current developments in likelihood-free Bayesian inference that explicitly accounts for discrepancies between the model and the data, termed Approximate Bayesian Computation under model uncertainty (ABCμ). We augment the likelihood of the data with unknown error terms that correspond to freely chosen checking functions, and provide Monte Carlo strategies for sampling from the associated joint posterior distribution without the need of evaluating the likelihood. We discuss the benefit of incorporating model diagnostics within an ABC framework, and demonstrate how this method diagnoses model mismatch and guides model refinement by contrasting three qualitative models of protein network evolution to the protein interaction datasets of Helicobacter pylori and Treponema pallidum. Our results make a number of model deficiencies explicit, and suggest that the T. pallidum network topology is inconsistent with evolution dominated by link turnover or lateral gene transfer alone.Keywords
This publication has 34 references indexed in Scilit:
- Modular networks and cumulative impact of lateral transfer in prokaryote genome evolutionProceedings of the National Academy of Sciences, 2008
- The Binary Protein Interactome of Treponema pallidum – The Syphilis SpirochetePLOS ONE, 2008
- Reconstruction of ancestral protein interaction networks for the bZIP transcription factorsProceedings of the National Academy of Sciences, 2007
- Using Likelihood-Free Inference to Compare Evolutionary Dynamics of the Protein Networks of H. pylori and P. falciparumPLoS Computational Biology, 2007
- Statistical evaluation of alternative models of human evolutionProceedings of the National Academy of Sciences, 2007
- A new approach to estimate parameters of speciation models with application to apesGenome Research, 2007
- Specificity and Evolvability in Eukaryotic Protein Interaction NetworksPLoS Computational Biology, 2007
- Sequential Monte Carlo without likelihoodsProceedings of the National Academy of Sciences, 2007
- Science and StatisticsJournal of the American Statistical Association, 1976
- Bayesian Analysis of Regression Error TermsJournal of the American Statistical Association, 1975