A New Approach for Using Genome Scans to Detect Recent Positive Selection in the Human Genome

Top Cited Papers
Open Access
Abstract
Genome-wide scanning for signals of recent positive selection is essential for a comprehensive and systematic understanding of human adaptation. Here, we present a genomic survey of recent local selective sweeps, especially aimed at those nearly or recently completed. A novel approach was developed for such signals, based on contrasting the extended haplotype homozygosity (EHH) profiles between populations. We applied this method to the genome single nucleotide polymorphism (SNP) data of both the International HapMap Project and Perlegen Sciences, and detected widespread signals of recent local selection across the genome, consisting of both complete and partial sweeps. A challenging problem of genomic scans of recent positive selection is to clearly distinguish selection from neutral effects, given the high sensitivity of the test statistics to departures from neutral demographic assumptions and the lack of a single, accurate neutral model of human history. We therefore developed a new procedure that is robust across a wide range of demographic and ascertainment models, one that indicates that certain portions of the genome clearly depart from neutrality. Simulations of positive selection showed that our tests have high power towards strong selection sweeps that have undergone fixation. Gene ontology analysis of the candidate regions revealed several new functional groups that might help explain some important interpopulation differences in phenotypic traits. The evolution of new functions and adaptation to new environments occurs by positive selection, whereby beneficial mutations increase in frequency and eventually become fixed in a population. Detecting such selection in humans is crucial for understanding the importance of past genetic adaptations and their role in contemporary common diseases. Methods have already been developed for detecting the signature of positive selection in large, genome-scale datasets (such as the “HapMap”). Positive selection is expected to more rapidly increase the frequency of an allele, and hence, the length of the haplotype (extent of DNA segment) associated with the selected allele, relative to those that are not under selection. Such methods compare haplotype lengths within a single population. Here, we introduce a new method that compares the lengths of haplotypes associated with the same allele in different populations. We demonstrate that our method has greater power to detect selective sweeps that are fixed or nearly so, and we construct a statistical framework that shows that our method reliably detects positive selection. We applied our method to the HapMap data and identified approximately 500 candidate regions in the human genome that show a signature of recent positive selection. Further targeted studies of these regions should reveal important genetic adaptations in our past.