Improving feature detection and analysis of surface-enhanced laser desorption/ionization-time of flight mass spectra
- 1 July 2005
- journal article
- research article
- Published by Wiley in Proteomics
- Vol. 5 (11) , 2778-2788
- https://doi.org/10.1002/pmic.200401184
Abstract
Discovering valid biological information from surface-enhanced laser desorption/ionization-time of flight mass spectrometry (SELDI-TOF MS) depends on clear experimental design, meticulous sample handling, and sophisticated data processing. Most published literature deals with the biological aspects of these experiments, or with computer-learning algorithms to locate sets of classifying biomarkers. The process of locating and measuring proteins across spectra has received less attention. This process should be tunable between sensitivity and false-discovery, and should guarantee that features are biologically meaningful in that they represent chemical species that can be identified and investigated. Existing feature detection in SELDI-TOF MS is not optimal for acquiring biologically relevant data. Most methods have so many user-defined settings that reproducibility and comparability among studies suffer considerably. To address these issues, we have developed an approach, called simultaneous spectrum analysis (SSA), which (i) locates proteins across spectra, (ii) measures their abundance, (iii) subtracts baseline, (iv) excludes irreproducible measurements, and (v) computes normalization factors for comparing spectra. SSA uses only two key parameters for feature detection and one parameter each for quality thresholds on spectra and peaks. The effectiveness of SSA is demonstrated by identifying proteins differentially expressed in SELDI-TOF spectra from plasma of wild-type and knockout mice for plasma glutathione peroxidase. Comparing analyses by SSA and CiphergenExpress Data Manager 2.1 finds similar results for large signal peaks, but SSA improves the number and quality of differences betweens groups among lower signal peaks. SSA is also less likely to introduce systematic bias when normalizing spectra.Keywords
This publication has 17 references indexed in Scilit:
- Three Biomarkers Identified from Serum Proteomic Analysis for the Detection of Early Stage Ovarian CancerCancer Research, 2004
- An Operator-Independent Approach to Mass Spectral Peak Identification and IntegrationAnalytical Chemistry, 2004
- Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experimentsBioinformatics, 2004
- Probabilistic Disease Classification of Expression-Dependent Proteomic Data from Mass Spectrometry of Human SerumJournal of Computational Biology, 2003
- Quality Control and Peak Finding for Proteomics Data Collected from Nipple Aspirate Fluid by Surface-Enhanced Laser Desorption and IonizationClinical Chemistry, 2003
- Decision tree classification of proteins identified by mass spectrometry of blood serum samples from people with and without lung cancerProteomics, 2003
- Megavariate data analysis of mass spectrometric proteomics data using latent variable projection methodProteomics, 2003
- A comprehensive approach to the analysis of matrix‐assisted laser desorption/ionization‐time of flight proteomics spectra from serum samplesProteomics, 2003
- A data-analytic strategy for protein biomarker discovery: profiling of high-dimensional proteomic data for cancer detectionBiostatistics, 2003
- Plasma Glutathione Peroxidase and Its Relationship to Renal Proximal Tubule FunctionMolecular Genetics and Metabolism, 1998