A scalable strategy for high-throughput GFP tagging of endogenous human proteins
Preprint
- 24 May 2016
- preprint
- Published by Cold Spring Harbor Laboratory in bioRxiv
- p. 055285
- https://doi.org/10.1101/055285
Abstract
A central challenge of the post-genomic era is to comprehensively characterize the cellular role of the ∼20,000 proteins encoded in the human genome. To systematically study protein function in a native cellular background, libraries of human cell lines expressing proteins tagged with a functional sequence at their endogenous loci would be very valuable. Here, using electroporation of Cas9/sgRNA ribonucleoproteins and taking advantage of a split-GFP system, we describe a scalable method for the robust, scarless and specific tagging of endogenous human genes with GFP. Our approach requires no molecular cloning and allows a large number of cell lines to be processed in parallel. We demonstrate the scalability of our method by targeting 48 human genes and show that the resulting GFP fluorescence correlates with protein expression levels. We next present how our protocols can be easily adapted for the tagging of a given target with GFP repeats, critically enabling the study of low-abundance proteins. Finally, we show that our GFP tagging approach allows the biochemical isolation of native protein complexes for proteomic studies. Together, our results pave the way for the large-scale generation of endogenously tagged human cell lines for the proteome-wide analysis of protein localization and interaction networks in a native cellular context.SIGNIFICANCE STATEMENT: The function of a large fraction of the human proteome still remains poorly characterized. Tagging proteins with a functional sequence is a powerful way to access function, and inserting tags at endogenous genomic loci allows the preservation of a near-native cellular background. To characterize the cellular role of human proteins in a systematic manner and in a native context, we developed a method for tagging endogenous human proteins with GFP that is both rapid and readily applicable at a genome-wide scale. Our approach allows studying both localization and interaction partners of the protein target. Our results pave the way for the large-scale generation of endogenously tagged human cell lines for a systematic functional interrogation of the human proteome.All Related Versions
- Published version: Proceedings of the National Academy of Sciences, 113 (25), 201606731.
This publication has 39 references indexed in Scilit:
- Yeast Proteome Dynamics from Single Cell Imaging and Automated AnalysisCell, 2015
- Systematic Discovery of Human Gene Function and Principles of Modular Organization through Phylogenetic ProfilingCell Reports, 2015
- The Contribution of Systematic Approaches to Characterizing the Proteins and Functions of the Endoplasmic ReticulumCold Spring Harbor Perspectives in Biology, 2013
- ‘Unknown’ proteins and ‘orphan’ enzymes: the missing half of the engineering parts list – and how to find itBiochemical Journal, 2009
- Global landscape of protein complexes in the yeast Saccharomyces cerevisiaeNature, 2006
- Finishing the euchromatic sequence of the human genomeNature, 2004
- Global analysis of protein expression in yeastNature, 2003
- Global analysis of protein localization in budding yeastNature, 2003
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- A simple and efficient method for direct gene deletion inSaccharomyces cerevisiaeNucleic Acids Research, 1993