Identifying Proteins Using Matrix-Assisted Laser Desorption/Ionization In-Source Fragmentation Data Combined with Database Searching

Abstract
Metastable ion decay in matrix-assisted laser desorption/ionization (MALDI) has become a routine method for obtaining primary structures of peptides. Significant fragmentation occurs in the MALDI ion source and can be observed via delayed ion extraction TOF-MS. In-source decay (ISD) can provide C- and N-terminal primary sequence data for even moderate-sized peptides (n series fragmentation that occurs in ISD has been exploited to obtain partial C-terminal sequences for proteins as large as human apotransferrin (75 kDa). Two approaches for combining this ISD MALDI-generated partial sequence information with protein database searching techniques are presented. In one approach, cyanogen bromide is used to cleave relatively large peptide fragments from a sample of human apotransferrin. One of the larger cleavage products (6034.84 Da) was isolated by HPLC and subjected to ISD MALDI analysis. An easily identified cn fragment ion series allowed two noncontiguous segments of the peptide's sequence to be determined (about 55% of the total sequence). This partial sequence information was used to search protein and oligonucleotide sequence databases. In addition to uniquely identifying human apotransferrin in a protein sequence database, an example of the use of this ISD MALDI-determined partial sequence information to search expressed sequence tag databases is presented. Such searches have the potential for rapidly identifying new genes that code for target proteins. An alternate approach for obtaining partial sequence information on proteins is also demonstrated that utilizes ISD MALDI fragmentation of the intact protein to generate partial sequence information. This approach is shown to generate about 5−7% of a protein's sequence, usually near the C-terminus of the protein. Examples of the ISD MALDI fragmentation data obtained from intact (reduced) human apotransferrin and intact (nonreduced) bovine serum albumin (66 kDa) proteins are presented.

This publication has 49 references indexed in Scilit: