Automated reprocessing pipeline for searching heterogeneous mass spectrometric data of the HUPO Brain Proteome Project pilot phase
- 11 September 2006
- journal article
- research article
- Published by Wiley in Proteomics
- Vol. 6 (18) , 5015-5029
- https://doi.org/10.1002/pmic.200600294
Abstract
The newly available techniques for sensitive proteome analysis and the resulting amount of data require a new bioinformatics focus on automatic methods for spectrum reprocessing and peptide/protein validation. Manual validation of results in such studies is not feasible and objective enough for quality relevant interpretation. The necessity for tools enabling an automatic quality control is, therefore, important to produce reliable and comparable data in such big consortia as the Human Proteome Organization Brain Proteome Project. Standards and well‐defined processing pipelines are important for these consortia. We show a way for choosing the right database model, through collecting data, processing these with a decoy database and end up with a quality controlled protein list merged from several search engines, including a known false‐positive rate.Keywords
This publication has 19 references indexed in Scilit:
- Randomized Sequence Databases for Tandem Mass Spectrometry Peptide and Protein IdentificationOMICS: A Journal of Integrative Biology, 2005
- 5th HUPO BPP Bioinformatics Meeting at the European Bioinformatics Institute in Hinxton, UK - Setting the Analysis FrameProteomics, 2005
- Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigationsNature Methods, 2005
- DBToolkit: processing protein databases for peptide-centric proteomicsBioinformatics, 2005
- HUPO Brain Proteome Project Pilot Studies: Bioinformatics at WorkProteomics, 2005
- Towards data management of the HUPO Human Brain Proteome Project pilot phaseProteomics, 2004
- The International Protein Index: An integrated database for proteomics experimentsProteomics, 2004
- Interpretation of mass spectrometry data for high-throughput proteomicsAnalytical and Bioanalytical Chemistry, 2003
- Initial sequencing and analysis of the human genomeNature, 2001
- Method to Correlate Tandem Mass Spectra of Modified Peptides to Amino Acid Sequences in the Protein DatabaseAnalytical Chemistry, 1995