“REVERSE ECOLOGY” AND THE POWER OF POPULATION GENOMICS

Open Access

5 December 2008

journal article
Published by Wiley in Evolution

Vol. 62 (12), 2984-2994
https://doi.org/10.1111/j.1558-5646.2008.00486.x

Abstract

Rapid and inexpensive sequencing technologies are making it possible to collect whole genome sequence data on multiple individuals from a population. This type of data can be used to quickly identify genes that control important ecological and evolutionary phenotypes by finding the targets of adaptive natural selection, and we therefore refer to such approaches as “reverse ecology.” To quantify the power gained in detecting positive selection using population genomic data, we compare three statistical methods for identifying targets of selection: the McDonald–Kreitman test, the mkprf method, and a likelihood implementation for detecting d_N/d_S > 1. Because the first two methods use polymorphism data we expect them to have more power to detect selection. However, when applied to population genomic datasets from human, fly, and yeast, the tests using polymorphism data were actually weaker in two of the three datasets. We explore reasons why the simpler comparative method has identified more genes under selection, and suggest that the different methods may really be detecting different signals from the same sequence data. Finally, we find several statistical anomalies associated with the mkprf method, including an almost linear dependence between the number of positively selected genes identified and the prior distributions used. We conclude that interpreting the results produced by this method should be done with some caution.

Keywords

This publication has 58 references indexed in Scilit:

Proportionally more deleterious genetic variation in European than in African populations
Nature, 2008
PAML 4: Phylogenetic Analysis by Maximum Likelihood
Molecular Biology and Evolution, 2007
A Scan for Positively Selected Genes in the Genomes of Humans and Chimpanzees
PLoS Biology, 2005
Population History and Natural Selection Shape Patterns of Genetic Variation in 132 Genes
PLoS Biology, 2004
The Genetic Architecture of Parallel Armor Plate Reduction in Threespine Sticklebacks
PLoS Biology, 2004
Sequencing and comparison of yeast species to identify genes and regulatory elements
Nature, 2003
Life with 6000 Genes
Science, 1996
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
Nucleic Acids Research, 1994
Adaptive protein evolution at the Adh locus in Drosophila
Nature, 1991
Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection
Nature, 1988

Cited by 126 articles