Increased Identification of Peptides by Enhanced Data Processing of High-Resolution MALDI TOF/TOF Mass Spectra Prior to Database Searching
- 14 September 2004
- journal article
- research article
- Published by American Chemical Society (ACS) in Analytical Chemistry
- Vol. 76 (20), 6017-6028
- https://doi.org/10.1021/ac049247v
Abstract
This paper presents application of sequential enhanced data processing procedures to high-resolution tandem mass spectra for identification of peptides using the Mascot database search algorithm. A strategy for (1) selection of fragment ion peaks from MS/MS spectra, (2) utilization of improved mass accuracy of the precursor ions, and (3) wavelet denoising of the mass spectra prior to fragment ion selection have been developed. The number of peptide identifications obtained using the enhanced processing was then compared with that obtained using software provided by the instrument manufacturer. Approximately 9000 MS/MS spectra acquired by the Applied Biosystems 4700 TOF/TOF MS instrument were used as a model data set. After application of the new processing, an increase of 33% unique peptides and 22% protein identifications with at least two unique peptides were found. The influence of the processing on the percentage of false positives, estimated by searching against a randomized database, was estimated to increase false positive identifications from 2.7 to 3.9%, which was still below the 5% error rate specified in the Mascot search. These data processing approaches increase the amount of information that can be extracted from LC-MS analysis without the necessity of additional experiments.Keywords
This publication has 25 references indexed in Scilit:
- Evaluation of algorithms for protein identification from sequence databases using mass spectrometry dataProteomics, 2004
- Result‐driven strategies for protein identification and quantitation – a way to optimize experimental design and derive reliable resultsProteomics, 2004
- A Universal Denoising and Peak Picking Algorithm for LC−MS Based on Matched Filtration in the Chromatographic Time DomainAnalytical Chemistry, 2003
- Preprocessing of tandem mass spectrometric data to support automatic protein identificationProteomics, 2003
- Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database SearchAnalytical Chemistry, 2002
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999
- Proteome and proteomics: New technologies, new concepts, and new wordsElectrophoresis, 1998
- Direct Analysis and Identification of Proteins in Mixtures by LC/MS/MS and Database Searching at the Low-Femtomole LevelAnalytical Chemistry, 1997
- Femtomole sequencing of proteins from polyacrylamide gels by nano-electrospray mass spectrometryNature, 1996
- Spin‐coated samples for high resolution matrix‐assisted laser desorption/ionization time‐of‐flight mass spectrometry of large proteinsRapid Communications in Mass Spectrometry, 1995