Ancient DNA (aDNA) sequences have been proliferating rapidly over the past two decades, with datasets now ranging from population-scale genetic profiles (
Lambert et al. 2002;
Shapiro et al. 2004) to palaeogenomic libraries (e.g.
Noonan et al. 2005). In many cases, the ages of aDNA sequences are known from radiocarbon dating or stratigraphic context, allowing detailed investigations of molecular evolutionary processes through time (
Drummond et al. 2002). These known ages can be used as calibrations for estimating substitution rates, divergence times and various demographic parameters in the absence of any other calibration points (
Rambaut 2000;
Drummond et al. 2002).
One of the notable characteristics of aDNA datasets is that they have generally yielded elevated estimates of substitution rates (for a summary, see
Ho et al. (2007)). Typically, such estimates have been intermediate between the mutation rates inferred from pedigree studies (e.g.
Denver et al. 2000;
Howell et al. 2003) and the long-term substitution rates obtained from phylogenetic studies (e.g.
Brown et al. 1979). These results suggest that rate estimates depend on the length of the observational period, with higher rates being measured over shorter periods of time (
Ho et al. 2005). Hypothesized causes for this pattern include the persistence of slightly deleterious mutations, sequence damage, calibration error or substitutional saturation (Ho
et al.
2005,
2007;
Woodhams 2006). Although the relative contributions of these factors are difficult to quantify, it is clear that the time dependency of rate estimates represents a considerable problem for molecular studies of recent evolutionary events.
Recently, the ability of Bayesian phylogenetic analysis to infer substitution rates from dated aDNA sequences was questioned (
Emerson 2007). This criticism was based on anomalous results obtained from a sequence analysis of four dated Neanderthals and four modern humans, in which
Emerson (2007) estimated the substitution rate to be 1.98 substitutions per site

Myr
−1. A subsequent re-analysis was unable to replicate these findings (
Ho et al. 2007). Validation of various aspects of the method has been performed previously (Drummond
et al.
2002,
2006), but not in a simulation study using parameter settings that are biologically relevant to aDNA data. Comprehensive testing of the method is desirable due to its pivotal role in many aDNA studies (e.g.
Lambert et al. 2002;
Shapiro et al. 2004).
In order to assess the accuracy and precision of the substitution rates estimated using Bayesian analysis, we present a study based on datasets generated by simulation under known conditions. The robustness of the estimates to the assumed demographic model is also assessed. We then estimate intraspecific substitution rates from a collection of nearly 1500 sequences obtained from a diverse range of animals, plants and bacteria. Demographic model selection is performed using Bayes factors.