Efficiency improvement of peptide identification for an organism without complete genome sequence, using expressed sequence tag database and tandem mass spectral data
- 8 December 2003
- journal article
- research article
- Published by Wiley in Proteomics
- Vol. 3 (12) , 2305-2309
- https://doi.org/10.1002/pmic.200300620
Abstract
We compared peptide identification by database (DB) search methods with de novo sequencing results for proteomics study in an organism without genome sequence information. When the former was done by searching the Expressed Sequence Tag (EST) DB of the sample organism or the NCBI nonredundant (nr) protein DB of green plants using either the MASCOT or SEQUEST software program, it was confirmed that the former is as accurate as the latter. Peptides identified from EST DB were twice as many as those from the nr protein DB, in spite of the fact that the EST DB has less data (26 222 EST) than the NCBI nr protein DB (224 238). This study demonstrates that EST DB with tandem mass spectra can be used reliably for high‐throughput proteomics studies in an organism without genome information.Keywords
This publication has 16 references indexed in Scilit:
- Mapping the Proteome of Barrel Medic (Medicago truncatula),Plant Physiology, 2003
- Statistical Characterization of Ion Trap Tandem Mass Spectra from Doubly Charged Tryptic PeptidesAnalytical Chemistry, 2003
- Intensity-Based Statistical Scorer for Tandem Mass SpectrometryAnalytical Chemistry, 2003
- Error tolerant searching of uninterpreted tandem mass spectrometry dataProteomics, 2002
- Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database SearchAnalytical Chemistry, 2002
- Probability-Based Validation of Protein Identifications Using a Modified SEQUEST AlgorithmAnalytical Chemistry, 2002
- Proteome Data Analysis of Hairy Root of Panax ginseng : Use of Expressed Sequence Tag Data of Ginseng for the Protein IdentificationJournal of Plant Biotechnology, 2002
- Experimental Protein Mixture for Validating Tandem Mass Spectral AnalysisOMICS: A Journal of Integrative Biology, 2002
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999
- An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein databaseJournal of the American Society for Mass Spectrometry, 1994