The Impact of Fossils and Taxon Sampling on Ancient Molecular Dating Analyses

Abstract
The number and complexity of molecular dating studies has increased over the past decade. Along with a broadening acceptance of their utility has come significant controversy over the methods and models that are appropriate, as well as the accuracy of the estimates yielded by molecular clock analyses. Radically different age estimates have been published for the same divergences from analyses of different datasets with different fossil constraints obtained with different methods, and the underlying explanation for these differences is often unclear. Here we utilize two previously published datasets to examine the effect of fossil calibrations and taxon sampling on the age estimates for two deep eukaryote divergences in an attempt to discern the relative impact of these factors. Penalized likelihood, non-parametric rate smoothing, and Bayesian methods were utilized to generate age estimates for the origin of the Metazoa from a 7-gene dataset and for the divergence of Eukaryotes from a 129-gene dataset. From these analyses, it is clear that the fossil calibrations chosen and the method for applying constraints to these nodes have a large impact on age estimates, while the degree of taxon sampling within a dataset is less important in terms of the resulting age estimates. Concerns and recommendations for addressing these two factors when initiating a dating analysis are discussed.