Efficiency of Database Search for Identification of Mutated and Modified Proteins via Mass Spectrometry

Open Access

1 February 2001

journal article
research article
Published by Cold Spring Harbor Laboratory in Genome Research

Vol. 11 (2), 290-299
https://doi.org/10.1101/gr.154101

Abstract

Although protein identification by matching tandem mass spectra (MS/MS) against protein databases is a widespread tool in mass spectrometry, the question about reliability of such searches remains open. Absence of rigorous significance scores in MS/MS database search makes it difficult to discard random database hits and may lead to erroneous protein identification, particularly in the case of mutated or post-translationally modified peptides. This problem is especially important for high-throughput MS/MS projects when the possibility of expert analysis is limited. Thus, algorithms that sort out reliable database hits from unreliable ones and identify mutated and modified peptides are sought. Most MS/MS database search algorithms rely on variations of the Shared Peaks Count approach that scores pairs of spectra by the peaks (masses) they have in common. Although this approach proved to be useful, it has a high error rate in identification of mutated and modified peptides. We describe new MS/MS database search tools, MS-CONVOLUTION andMS-ALIGNMENT, which implement the spectral convolution and spectral alignment approaches to peptide identification. We further analyze these approaches to identification of modified peptides and demonstrate their advantages over the Shared Peaks Count. We also use the spectral alignment approach as a filter in a new database search algorithm that reliably identifies peptides differing by up to two mutations/modifications from a peptide in a database.

Keywords

This publication has 17 references indexed in Scilit:

Modeling Amino Acid Replacement
Journal of Computational Biology, 2000
Mutation-Tolerant Protein Identification by Mass Spectrometry
Journal of Computational Biology, 2000
Sequence and structure-based prediction of eukaryotic protein phosphorylation sites
Journal of Molecular Biology, 1999
De NovoPeptide Sequencing via Tandem Mass Spectrometry
Journal of Computational Biology, 1999
Protein indentification using mass spectrometric information
Electrophoresis, 1998
The Importance of Protein Co- and Post-Translational Modifications in Proteome Projects
Published by Springer Nature ,1997
Mining Genomes: Correlating Tandem Mass Spectra of Modified and Unmodified Peptides to Sequences in Nucleotide Databases
Analytical Chemistry, 1995
Method to Correlate Tandem Mass Spectra of Modified Peptides to Amino Acid Sequences in the Protein Database
Analytical Chemistry, 1995
Error-Tolerant Identification of Peptides in Sequence Databases by Peptide Sequence Tags
Analytical Chemistry, 1994
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database
Journal of the American Society for Mass Spectrometry, 1994

Cited by 106 articles