TOPP—the OpenMS proteomics pipeline

Open Access

15 January 2007

journal article
research article
Published by Oxford University Press (OUP) in Bioinformatics

Vol. 23 (2) , e191-e197
https://doi.org/10.1093/bioinformatics/btl299

Abstract

Motivation: Experimental techniques in proteomics have seen rapid development over the last few years. Volume and complexity of the data have both been growing at a similar rate. Accordingly, data management and analysis are one of the major challenges in proteomics. Flexible algorithms are required to handle changing experimental setups and to assist in developing and validating new methods. In order to facilitate these studies, it would be desirable to have a flexible ‘toolbox’ of versatile and user-friendly applications allowing for rapid construction of computational workflows in proteomics. Results: We describe a set of tools for proteomics data analysis—TOPP, The OpenMS Proteomics Pipeline. TOPP provides a set of computational tools which can be easily combined into analysis pipelines even by non-experts and can be used in proteomics workflows. These applications range from useful utilities (file format conversion, peak picking) over wrapper applications for known applications (e.g. Mascot) to completely new algorithmic techniques for data reduction and data analysis. We anticipate that TOPP will greatly facilitate rapid prototyping of proteomics data evaluation pipelines. As such, we describe the basic concepts and the current abilities of TOPP and illustrate these concepts in the context of two example applications: the identification of peptides from a raw dataset through database search and the complex analysis of a standard addition experiment for the absolute quantitation of biomarkers. The latter example demonstrates TOPP's ability to construct flexible analysis pipelines in support of complex experimental setups. Availability: The TOPP components are available as open-source software under the lesser GNU public license (LGPL). Source code is available from the project website at Author Webpage Contact:oliver.kohlbacher@uni-tuebingen.de

Keywords

This publication has 24 references indexed in Scilit:

Autumn 2005 Workshop of the Human Proteome Organisation Proteomics Standards Initiative (HUPO-PSI) Geneva, September, 4–6, 2005
Proteomics, 2006
MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data
Bioinformatics, 2006
A Software Suite for the Generation and Comparison of Peptide Arrays from Sets of Data Collected by Liquid Chromatography-Mass Spectrometry
Molecular & Cellular Proteomics, 2005
MSight: An image analysis software for liquid chromatography‐mass spectrometry
Proteomics, 2005
Taverna: a tool for the composition and enactment of bioinformatics workflows
Bioinformatics, 2004
A Tool To Visualize and Evaluate Data Obtained by Liquid Chromatography-Electrospray Ionization-Mass Spectrometry
Analytical Chemistry, 2004
Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database Search
Analytical Chemistry, 2002
Emboss opens up sequence analysis
Briefings in Bioinformatics, 2002
Probability-based protein identification by searching sequence databases using mass spectrometry data
Electrophoresis, 1999
Geometric hashing: an overview
IEEE Computational Science and Engineering, 1997