ProteomeCommons.org IO Framework: reading and writing multiple proteomics data formats
Open Access
- 22 November 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (2) , 262-263
- https://doi.org/10.1093/bioinformatics/btl573
Abstract
Motivation: Effective use of proteomics data, specifically mass spectrometry data, relies on the ability to read and write the many mass spectrometer file formats. Even with mass spectrometer vendor-specific libraries and vendor-neutral file formats, such as mzXML and mzData it can be difficult to extract raw data files in a form suitable for batch processing and basic research. Introduced here are the ProteomeCommons.org Input and Output Framework, abbreviated to IO Framework, which is designed to abstractly represent mass spectrometry data. This project is a public, open-source, free-to-use framework that supports most of the mass spectrometry data formats, including current formats, legacy formats and proprietary formats that require a vendor-specific library in order to operate. The IO Framework includes an on-line tool for non-programmers and a set of libraries that developers may use to convert between various proteomics file formats. Availability: The current source-code and documentation for the ProteomeCommons.org IO Framework is freely available at Contact:jfalkner@umich.eduKeywords
This publication has 10 references indexed in Scilit:
- ProteomeCommons.org JAF: reference information and tools for proteomicsBioinformatics, 2006
- Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigationsNature Methods, 2005
- DBToolkit: processing protein databases for peptide-centric proteomicsBioinformatics, 2005
- Second Proteomics Standards Initiative Spring WorkshopExpert Review of Proteomics, 2005
- Fast tandem mass spectra-based protein identification regardless of the number of spectra or potential modifications examinedBioinformatics, 2005
- A common open representation of mass spectrometry data and its application to proteomics researchNature Biotechnology, 2004
- Extractor for ESI quadrupole TOF tandem MS data enabled for high throughput batch processingBMC Bioinformatics, 2004
- Open Source System for Analyzing, Validating, and Storing Protein Identification DataJournal of Proteome Research, 2004
- TANDEM: matching proteins with tandem mass spectraBioinformatics, 2004
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999