Proteomic Parsimony through Bipartite Graph Analysis Improves Accuracy and Transparency
Top Cited Papers
- 4 August 2007
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 6 (9), 3549-3557
- https://doi.org/10.1021/pr070230d
Abstract
Assembling peptides identified from LC−MS/MS spectra into a list of proteins is a critical step in analyzing shotgun proteomics data. As one peptide sequence can be mapped to multiple proteins in a database, naïve protein assembly can substantially overstate the number of proteins found in samples. We model the peptide−protein relationships in a bipartite graph and use efficient graph algorithms to identify protein clusters with shared peptides and to derive the minimal list of proteins. We test the effects of this parsimony analysis approach using MS/MS data sets generated from a defined human protein mixture, a yeast whole cell extract, and a human serum proteome after MARS column depletion. The results demonstrate that the bipartite parsimony technique not only simplifies protein lists but also improves the accuracy of protein identification. We use bipartite graphs for the visualization of the protein assembly results to render the parsimony analysis process transparent to users. Our approach also groups functionally related proteins together and improves the comprehensibility of the results. We have implemented the tool in the IDPicker package. The source code and binaries for this protein assembly pipeline are available under Mozilla Public License at the following URL: http://www.mc.vanderbilt.edu/msrc/bioinformatics/. Keywords: parsimony analysis • bipartite graph • shotgun proteomics • LC−MS/MS • protein assemblyKeywords
This publication has 19 references indexed in Scilit:
- MyriMatch: Highly Accurate Tandem Mass Spectral Peptide Identification by Multivariate Hypergeometric AnalysisJournal of Proteome Research, 2007
- Head-to-Head Comparison of Serum Fractionation TechniquesJournal of Proteome Research, 2006
- Minimum Reporting Requirements for Proteomics: A MIAPE PrimerProteomics, 2006
- Data management and preliminary data analysis in the pilot phase of the HUPO Plasma Proteome ProjectProteomics, 2005
- MS2Grouper: Group assessment and synthetic replacement of duplicate proteomic tandem mass spectraJournal of the American Society for Mass Spectrometry, 2005
- Peptide charge state determination for low-resolution tandem mass spectraProceedings. IEEE Computational Systems Bioinformatics Conference, 2005
- DBParser: Web-Based Software for Shotgun Proteomic Data AnalysesJournal of Proteome Research, 2004
- The Need for Guidelines in Publication of Peptide and Protein Identification DataMolecular & Cellular Proteomics, 2004
- Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database SearchAnalytical Chemistry, 2002
- DTASelect and Contrast: Tools for Assembling and Comparing Protein Identifications from Shotgun ProteomicsJournal of Proteome Research, 2002