Automatic Deconvolution of Isotope-Resolved Mass Spectra Using Variable Selection and Quantized Peptide Mass Distribution
- 11 April 2006
- journal article
- Published by American Chemical Society (ACS) in Analytical Chemistry
- Vol. 78 (10) , 3385-3392
- https://doi.org/10.1021/ac052212q
Abstract
We present an algorithm for the deconvolution of isotope-resolved mass spectra of complex peptide mixtures where peaks and isotope series often overlap. The algorithm formulates the problem of mass spectrum deconvolution as a classical statistical problem of variable selection, which aims to interpret the spectrum with the least number of peptides. The LASSO method is used to perform automatic variable selection. The algorithm also makes use of the quantized distribution of peptide masses in the NCBInr database after in silico trypsin digestion as filters to aid the deconvolution process. Errors in the expected isotope pattern are accounted for to avoid spurious isotope series. The effectiveness of the algorithm is demonstrated with annotated ESI spectrum of known peptides for which the peaks and isotope series are highly overlapping. The algorithm successfully finds all correct masses in the experimental spectrum, except for one spectrum where an additional refinement procedure is required to obtain the correct results. Our results compare favorably to those from a widely used commercial program.Keywords
This publication has 8 references indexed in Scilit:
- Modular, scriptable and automated analysis tools for high-throughput peptide mass fingerprintingBioinformatics, 2004
- Least angle regressionThe Annals of Statistics, 2004
- Use of 13C to monitor soil organic matter transformations caused by a simulated forest fireRapid Communications in Mass Spectrometry, 2004
- Automated deconvolution and deisotoping of electrospray mass spectraJournal of Mass Spectrometry, 2002
- Isotopic Deconvolution of Matrix-Assisted Laser Desorption/Ionization Mass Spectra for Substance-Class Specific Analysis of Complex SamplesEuropean Journal of Mass Spectrometry, 2001
- The Variable Selection ProblemJournal of the American Statistical Association, 2000
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999
- A universal algorithm for fast and automated charge state deconvolution of electrospray mass-to-charge ratio spectraJournal of the American Society for Mass Spectrometry, 1998