“REVERSE ECOLOGY” AND THE POWER OF POPULATION GENOMICS
Open Access
- 5 December 2008
- Vol. 62 (12), 2984-2994
- https://doi.org/10.1111/j.1558-5646.2008.00486.x
Abstract
Rapid and inexpensive sequencing technologies are making it possible to collect whole genome sequence data on multiple individuals from a population. This type of data can be used to quickly identify genes that control important ecological and evolutionary phenotypes by finding the targets of adaptive natural selection, and we therefore refer to such approaches as “reverse ecology.” To quantify the power gained in detecting positive selection using population genomic data, we compare three statistical methods for identifying targets of selection: the McDonald–Kreitman test, the mkprf method, and a likelihood implementation for detecting dN/dS > 1. Because the first two methods use polymorphism data we expect them to have more power to detect selection. However, when applied to population genomic datasets from human, fly, and yeast, the tests using polymorphism data were actually weaker in two of the three datasets. We explore reasons why the simpler comparative method has identified more genes under selection, and suggest that the different methods may really be detecting different signals from the same sequence data. Finally, we find several statistical anomalies associated with the mkprf method, including an almost linear dependence between the number of positively selected genes identified and the prior distributions used. We conclude that interpreting the results produced by this method should be done with some caution.Keywords
This publication has 58 references indexed in Scilit:
- Proportionally more deleterious genetic variation in European than in African populationsNature, 2008
- PAML 4: Phylogenetic Analysis by Maximum LikelihoodMolecular Biology and Evolution, 2007
- A Scan for Positively Selected Genes in the Genomes of Humans and ChimpanzeesPLoS Biology, 2005
- Population History and Natural Selection Shape Patterns of Genetic Variation in 132 GenesPLoS Biology, 2004
- The Genetic Architecture of Parallel Armor Plate Reduction in Threespine SticklebacksPLoS Biology, 2004
- Sequencing and comparison of yeast species to identify genes and regulatory elementsNature, 2003
- Life with 6000 GenesScience, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Adaptive protein evolution at the Adh locus in DrosophilaNature, 1991
- Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selectionNature, 1988