NovoHMM: A Hidden Markov Model for de Novo Peptide Sequencing
- 11 October 2005
- journal article
- research article
- Published by American Chemical Society (ACS) in Analytical Chemistry
- Vol. 77 (22) , 7265-7273
- https://doi.org/10.1021/ac0508853
Abstract
De novo sequencing of peptides poses one of the most challenging tasks in data analysis for proteome research. In this paper, a generative hidden Markov model (HMM) of mass spectra for de novo peptide sequencing which constitutes a novel view on how to solve this problem in a Bayesian framework is proposed. Further extensions of the model structure to a graphical model and a factorial HMM to substantially improve the peptide identification results are demonstrated. Inference with the graphical model for de novo peptide sequencing estimates posterior probabilities for amino acids rather than scores for single symbols in the sequence. Our model outperforms state-of-the-art methods for de novo peptide sequencing on a large test set of spectra.Keywords
This publication has 9 references indexed in Scilit:
- PepNovo: De Novo Peptide Sequencing via Probabilistic Network ModelingAnalytical Chemistry, 2005
- Large-scale database searching using tandem mass spectra: Looking up the answer in the back of the bookNature Methods, 2004
- Automatic Quality Assessment of Peptide Tandem Mass SpectraBioinformatics, 2004
- The need for a public proteomics repositoryNature Biotechnology, 2004
- Experimental Protein Mixture for Validating Tandem Mass Spectral AnalysisOMICS: A Journal of Integrative Biology, 2002
- SCOPE: a probabilistic model for scoring tandem mass spectra against a peptide databaseBioinformatics, 2001
- De NovoPeptide Sequencing via Tandem Mass SpectrometryJournal of Computational Biology, 1999
- Sequence database searches viade novo peptide sequencing by tandem mass spectrometryRapid Communications in Mass Spectrometry, 1997
- Factorial Hidden Markov ModelsMachine Learning, 1997