Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates
- 14 April 1992
- journal article
- research article
- Published by Hindawi Limited in Genetics Research
- Vol. 59 (2), 139-147
- https://doi.org/10.1017/s0016672300030354
Abstract
It is known that under neutral mutation at a known mutation rate a sample of nucleotide sequences, within which there is assumed to be no recombination, allows estimation of the effective size of an isolated population. This paper investigates the case of very long sequences, where each pair of sequences allows a precise estimate of the divergence time of those two gene copies. The average divergence time of all pairs of copies estimates twice the effective population number and an estimate can also be derived from the number of segregating sites. One can alternatively estimate the genealogy of the copies. This paper shows how a maximum likelihood estimate of the effective population number can be derived from such a genealogical tree. The pairwise and the segregating sites estimates are shown to be much less efficient than this maximum likelihood estimate, and this is verified by computer simulation. The result implies that there is much to gain by explicitly taking the tree structure of these genealogies into account.Keywords
This publication has 35 references indexed in Scilit:
- Gene Genealogies within the Organismal Pedigrees of Random-Mating PopulationsEvolution, 1990
- Gene Trees and Organismal Histories: A Phylogenetic Approach to Population BiologyEvolution, 1989
- The coalescent in two partially isolated diffusion populationsGenetics Research, 1988
- The Infinitely-Many-Sites Model as a Measure-Valued DiffusionThe Annals of Probability, 1987
- Mitochondrial DNA and human evolutionNature, 1987
- On the genealogy of nested subsamples from a haploid populationAdvances in Applied Probability, 1984
- Testing the Constant-Rate Neutral Allele Model with Protein Sequence DataEvolution, 1983
- On the genealogy of large populationsJournal of Applied Probability, 1982
- The probabilities of rooted tree-shapes generated by random bifurcationAdvances in Applied Probability, 1971
- Random processes in geneticsMathematical Proceedings of the Cambridge Philosophical Society, 1958