Estimation of Nucleotide Diversity, Disequilibrium Coefficients, and Mutation Rates from High-Coverage Genome-Sequencing Projects
- 21 August 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 25 (11), 2409-2419
- https://doi.org/10.1093/molbev/msn185
Abstract
Recent advances in sequencing strategies have made it feasible to rapidly obtain high-coverage genomic profiles of single individuals, and soon it will be economically feasible to do so with hundreds to thousands of individuals per population. While offering unprecedented power for the acquisition of population-genetic parameters, these new methods also introduce a number of challenges, most notably the need to account for the binomial sampling of parental alleles at individual nucleotide sites and to eliminate bias from various sources of sequence errors. To minimize the effects of both problems, methods are developed for generating nearly unbiased and minimum-sampling-variance estimates of a number of key parameters, including the average nucleotide heterozygosity and its variance among sites, the pattern of decomposition of linkage disequilibrium with physical distance, and the rate and molecular spectrum of spontaneously arising mutations. These methods provide a general platform for the efficient utilization of data from population-genomic surveys, while also providing guidance for the optimal design of such studies.Keywords
This publication has 18 references indexed in Scilit:
- A genome-wide view of the spectrum of spontaneous mutations in yeastProceedings of the National Academy of Sciences, 2008
- DNA from Pre-Clovis Human Coprolites in Oregon, North AmericaScience, 2008
- Population genetic analysis of shotgun assemblies of genomic sequences from multiple individualsGenome Research, 2008
- The impact of next-generation sequencing technology on geneticsTrends in Genetics, 2008
- Patterns of damage in genomic DNA sequences from a NeandertalProceedings of the National Academy of Sciences, 2007
- Accuracy and quality of massively parallel DNA pyrosequencingGenome Biology, 2007
- Estimating recombination rates from population-genetic dataNature Reviews Genetics, 2003
- A Comparison of Estimators of the Population Recombination RateMolecular Biology and Evolution, 2000
- Sequencing errors and molecular evolutionary analysis.Molecular Biology and Evolution, 1992
- Linkage disequilibrium among multiple neutral alleles produced by mutation in finite populationTheoretical Population Biology, 1975