A scalable strategy for high-throughput GFP tagging of endogenous human proteins

Abstract
A central challenge of the post-genomic era is to comprehensively characterize the cellular role of the ∼20,000 proteins encoded in the human genome. To systematically study protein function in a native cellular background, libraries of human cell lines expressing proteins tagged with a functional sequence at their endogenous loci would be very valuable. Here, using electroporation of Cas9/sgRNA ribonucleoproteins and taking advantage of a split-GFP system, we describe a scalable method for the robust, scarless and specific tagging of endogenous human genes with GFP. Our approach requires no molecular cloning and allows a large number of cell lines to be processed in parallel. We demonstrate the scalability of our method by targeting 48 human genes and show that the resulting GFP fluorescence correlates with protein expression levels. We next present how our protocols can be easily adapted for the tagging of a given target with GFP repeats, critically enabling the study of low-abundance proteins. Finally, we show that our GFP tagging approach allows the biochemical isolation of native protein complexes for proteomic studies. Together, our results pave the way for the large-scale generation of endogenously tagged human cell lines for the proteome-wide analysis of protein localization and interaction networks in a native cellular context.SIGNIFICANCE STATEMENT: The function of a large fraction of the human proteome still remains poorly characterized. Tagging proteins with a functional sequence is a powerful way to access function, and inserting tags at endogenous genomic loci allows the preservation of a near-native cellular background. To characterize the cellular role of human proteins in a systematic manner and in a native context, we developed a method for tagging endogenous human proteins with GFP that is both rapid and readily applicable at a genome-wide scale. Our approach allows studying both localization and interaction partners of the protein target. Our results pave the way for the large-scale generation of endogenously tagged human cell lines for a systematic functional interrogation of the human proteome.