Counting the Zinc-Proteins Encoded in the Human Genome
Top Cited Papers
- 15 December 2005
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 5 (1), 196-201
- https://doi.org/10.1021/pr050361j
Abstract
Metalloproteins are proteins capable of binding one or more metal ions, which may be required for their biological function, or for regulation of their activities or for structural purposes. Genome sequencing projects have provided a huge number of protein primary sequences, but, even though several different elaborate analyses and annotations have been enabled by a rich and ever-increasing portfolio of bioinformatic tools, metal-binding properties remain difficult to predict as well as to investigate experimentally. Consequently, the present knowledge about metalloproteins is only partial. The present bioinformatic research proposes a strategy to answer the question of how many and which proteins encoded in the human genome may require zinc for their physiological function. This is achieved by a combination of approaches, which include: (i) searching in the proteome for the zinc-binding patterns that, on their turn, are obtained from all available X-ray data; (ii) using libraries of metal-binding protein domains based on multiple sequence alignments of known metalloproteins obtained from the Pfam database; and (iii) mining the annotations of human gene sequences, which are based on any type of information available. It is found that 1684 proteins in the human proteome are independently identified by all three approaches as zinc-proteins, 746 are identified by two, and 777 are identified by only one method. By assuming that all proteins identified by at least two approaches are truly zinc-binding and inspecting the proteins identified by a single method, it can be proposed that ca. 2800 human proteins are potentially zinc-binding in vivo, corresponding to 10% of the human proteome, with an uncertainty of 400 sequences. Available functional information suggests that the large majority of human zinc-binding proteins are involved in the regulation of gene expression. The most abundant class of zinc-binding proteins in humans is that of zinc-fingers, with Cys4 and Cys2His2 being the most common types of coordination environment. Keywords: zinc • metalloproteins • zinc finger • metalloproteaseKeywords
This publication has 33 references indexed in Scilit:
- A High Throughput Method for the Detection of Metalloproteins on a Microgram ScaleMolecular & Cellular Proteomics, 2005
- Comparative Analysis of the ADAM and ADAMTS FamiliesJournal of Proteome Research, 2005
- Structural Basis for the Catalytic Activity of Human Serine/Threonine Protein Phosphatase-5Journal of Biological Chemistry, 2004
- A hint to search for metalloproteins in gene banksBioinformatics, 2004
- SGP-1: Prediction and Validation of Homologous Genes Based on Sequence AlignmentsGenome Research, 2001
- Femtomolar Sensitivity of Metalloregulatory Proteins Controlling Zinc HomeostasisScience, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- The Protein Data BankNucleic Acids Research, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- X-ray structure of the cambialistic superoxide dismutase from Propionibacterium shermanii active with Fe or MnJBIC Journal of Biological Inorganic Chemistry, 1996