Second-Order Peak Detection for Multicomponent High-Resolution LC/MS Data
- 14 January 2006
- journal article
- research article
- Published by American Chemical Society (ACS) in Analytical Chemistry
- Vol. 78 (4) , 975-983
- https://doi.org/10.1021/ac050980b
Abstract
The first step when analyzing multicomponent LC/MS data from complex samples such as biofluid metabolic profiles is to separate the data into information and noise via, for example, peak detection. Due to the complex nature of this type of data, with problems such as alternating backgrounds and differing peak shapes, this can be a very complex task. This paper presents and evaluates a two-dimensional peak detection algorithm based on raw vector-represented LC/MS data. The algorithm exploits the fact that in high-resolution centroid data chromatographic peaks emerge flanked with data voids in the corresponding mass axis. According to the proposed method, only 4‰ of the total amount of data from a urine sample is defined as chromatographic peaks; however, 94% of the raw data variance is captured within these peaks. Compared to bucketed data, results show that essentially the same features that an experienced analyst would define as peaks can automatically be extracted with a minimum of noise and background. The method is simple and requires a priori knowledge of only the minimum chromatographic peak widtha system-dependent parameter that is easily assessed. Additional meta parameters are estimated from the data themselves. The result is well-defined chromatographic peaks that are consistently arranged in a matrix at their corresponding m/z values. In the context of automated analysis, the method thus provides an alternative to the traditional approach of bucketing the data followed by denoising and/or one-dimensional peak detection. The software implementation of the proposed algorithm is available at http://www.anchem.su.se/peakd as compiled code for Matlab.Keywords
This publication has 10 references indexed in Scilit:
- HPLC-MS-based methods for the study of metabonomicsJournal of Chromatography B, 2004
- Multivariate approaches for efficient detection of potential metabolites from liquid chromatography/mass spectrometry dataRapid Communications in Mass Spectrometry, 2004
- A Universal Denoising and Peak Picking Algorithm for LC−MS Based on Matched Filtration in the Chromatographic Time DomainAnalytical Chemistry, 2003
- Preprocessing of tandem mass spectrometric data to support automatic protein identificationProteomics, 2003
- Matched filtering with background suppression for improved quality of base peak chromatograms and mass spectra in liquid chromatography–mass spectrometryAnalytica Chimica Acta, 2001
- Windowed mass selection method: a new data processing algorithm for liquid chromatography–mass spectrometry dataJournal of Chromatography A, 1999
- Application of sequential paired covariance to liquid chromatography-mass spectrometry data enhancements in both the signal-to-noise ratio and the resolution of analyte peaks in the chromatogramJournal of Chromatography A, 1997
- A Noise and Background Reduction Method for Component Detection in Liquid Chromatography/Mass SpectrometryAnalytical Chemistry, 1996
- 750 MHz 1H and 1H-13C NMR Spectroscopy of Human Blood PlasmaAnalytical Chemistry, 1995
- Smoothing and Differentiation of Data by Simplified Least Squares Procedures.Analytical Chemistry, 1964