Processing and classification of protein mass spectra
- 3 February 2006
- journal article
- review article
- Published by Wiley in Mass Spectrometry Reviews
- Vol. 25 (3) , 409-449
- https://doi.org/10.1002/mas.20072
Abstract
Among the many applications of mass spectrometry, biomarker pattern discovery from protein mass spectra has aroused considerable interest in the past few years. While research efforts have raised hopes of early and less invasive diagnosis, they have also brought to light the many issues to be tackled before mass‐spectra‐based proteomic patterns become routine clinical tools. Known issues cover the entire pipeline leading from sample collection through mass spectrometry analytics to biomarker pattern extraction, validation, and interpretation. This study focuses on the data‐analytical phase, which takes as input mass spectra of biological specimens and discovers patterns of peak masses and intensities that discriminate between different pathological states. We survey current work and investigate computational issues concerning the different stages of the knowledge discovery process: exploratory analysis, quality control, and diverse transforms of mass spectra, followed by further dimensionality reduction, classification, and model evaluation. We conclude after a brief discussion of the critical biomedical task of analyzing discovered discriminatory patterns to identify their component proteins as well as interpret and validate their biological implications. © 2006 Wiley Periodicals, Inc., Mass Spec Rev 25:409–449, 2006Keywords
This publication has 166 references indexed in Scilit:
- Automatic Quality Assessment of Peptide Tandem Mass SpectraBioinformatics, 2004
- Iterative data analysis is the key for exhaustive analysis of peptide mass fingerprints from proteins separated by two-dimensional electrophoresisJournal of the American Society for Mass Spectrometry, 2003
- Improved peptide charge state assignmentProteomics, 2003
- Diagnostic Potential of Serum Proteomic Patterns in Prostate CancerJournal of Urology, 2003
- Peak alignment of NMR signals by means of a genetic algorithmAnalytica Chimica Acta, 2003
- Mass spectrometry-based proteomicsNature, 2003
- Molecular scanner experiment with human plasma: Improving protein identification by using intensity distributions of matching peptide massesProteomics, 2002
- Comparing three methods for variance estimation with duplicated high density oligonucleotide arraysFunctional & Integrative Genomics, 2002
- Survey and critique of techniques for extracting rules from trained artificial neural networksPublished by Elsevier ,2000
- A Decision-Theoretic Generalization of On-Line Learning and an Application to BoostingJournal of Computer and System Sciences, 1997