Genes mirror geography within Europe
Top Cited Papers
- 31 August 2008
- journal article
- research article
- Published by Springer Nature in Nature
- Vol. 456 (7218), 98-101
- https://doi.org/10.1038/nature07331
Abstract
The power of the latest massively parallel synthetic DNA sequencing technologies is demonstrated in two major collaborations that shed light on the nature of genomic variation with ethnicity. The first describes the genomic characterization of an individual from the Yoruba ethnic group of west Africa. The second reports a personal genome of a Han Chinese, the group comprising 30% of the world's population. These new resources can now be used in conjunction with the Venter, Watson and NIH reference sequences. A separate study looked at genetic ethnicity on the continental scale, based on data from 1,387 individuals from more than 30 European countries. Overall there was little genetic variation between countries, but the differences that do exist correspond closely to the geographic map. Statistical analysis of the genome data places 50% of the individuals within 310 km of their reported origin. As well as its relevance for testing genetic ancestry, this work has implications for evaluating genome-wide association studies that link genes with diseases. Understanding the genetic structure of human populations is of fundamental interest to medical, forensic and anthropological sciences. Advances in high-throughput genotyping technology have markedly improved our understanding of global patterns of human genetic variation and suggest the potential to use large samples to uncover variation among closely spaced populations1,2,3,4,5. Here we characterize genetic variation in a sample of 3,000 European individuals genotyped at over half a million variable DNA sites in the human genome. Despite low average levels of genetic differentiation among Europeans, we find a close correspondence between genetic and geographic distances; indeed, a geographical map of Europe arises naturally as an efficient two-dimensional summary of genetic variation in Europeans. The results emphasize that when mapping the genetic basis of a disease phenotype, spurious associations can arise if genetic structure is not properly accounted for. In addition, the results are relevant to the prospects of genetic ancestry testing6; an individual’s DNA can be used to infer their geographic origin with surprising accuracy—often to within a few hundred kilometres.Keywords
This publication has 27 references indexed in Scilit:
- The Population Reference Sample, POPRES: A Resource for Population, Disease, and Pharmacological Genetics ResearchAmerican Journal of Human Genetics, 2008
- Identification of ten loci associated with height highlights new biological pathways in human growthNature Genetics, 2008
- Genome-wide association analysis identifies 20 loci that influence adult heightNature Genetics, 2008
- The CoLaus study: a population-based study to investigate the epidemiology and genetic determinants of cardiovascular risk factors and metabolic syndromeBMC Cardiovascular Disorders, 2008
- Worldwide Human Relationships Inferred from Genome-Wide Patterns of VariationScience, 2008
- PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage AnalysesAmerican Journal of Human Genetics, 2007
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007
- Measuring European Population Stratification with Microarray Genotype DataAmerican Journal of Human Genetics, 2007
- Principal components analysis corrects for stratification in genome-wide association studiesNature Genetics, 2006
- Reconstructing Genetic Ancestry Blocks in Admixed IndividualsAmerican Journal of Human Genetics, 2006