Probability-based protein identification by searching sequence databases using mass spectrometry data

1 December 1999

journal article
Published by Wiley in Electrophoresis

Vol. 20 (18), 3551-3567
https://doi.org/10.1002/(sici)1522-2683(19991201)20:18<3551::aid-elps3551>3.0.co;2-2

Abstract

Several algorithms have been described in the literature for protein identification by searching a sequence database using mass spectrometry data. In some approaches, the experimental data are peptide molecular weights from the digestion of a protein by an enzyme. Other approaches use tandem mass spectrometry (MS/MS) data from one or more peptides. Still others combine mass data with amino acid sequence data. We present results from a new computer program, Mascot, which integrates all three types of search. The scoring algorithm is probability based, which has a number of advantages: (i) A simple rule can be used to judge whether a result is significant or not. This is particularly useful in guarding against false positives. (ii) Scores can be com pared with those from other types of search, such as sequence homology. (iii) Search parameters can be readily optimised by iteration. The strengths and limitations of probability‐based scoring are discussed, particularly in the context of high throughput, fully automated protein identification.

Keywords

This publication has 28 references indexed in Scilit:

Proteomics: quantitative and physical mapping of cellular proteins
Trends in Biotechnology, 1999
Database searching using mass spectrometry data
Electrophoresis, 1998
Mass Spectrometric Sequencing of Proteins from Silver-Stained Polyacrylamide Gels
Analytical Chemistry, 1996
Method to Correlate Tandem Mass Spectra of Modified Peptides to Amino Acid Sequences in the Protein Database
Analytical Chemistry, 1995
Error-Tolerant Identification of Peptides in Sequence Databases by Peptide Sequence Tags
Analytical Chemistry, 1994
Peptide Mass Maps: A Highly Informative Approach to Protein Identification
Analytical Biochemistry, 1993
Protein Identification by Mass Profile Fingerprinting
Biochemical and Biophysical Research Communications, 1993
Rapid identification of proteins by peptide-mass fingerprinting
Current Biology, 1993
Identifying proteins from two-dimensional gels by molecular mass searching of peptide fragments in protein sequence databases.
Proceedings of the National Academy of Sciences, 1993
Use of mass spectrometric molecular weight information to identify proteins in sequence databases
Journal of Mass Spectrometry, 1993

Cited by 7026 articles