Molecular clocks let biologists estimate when two species diverged from a common ancestor by measuring how much their DNA sequences differ. The core idea is straightforward: if mutations accumulate at a roughly constant rate over time, then the amount of genetic difference between two species is proportional to how long ago they split. This concept is one of the most powerful tools in evolutionary biology for putting dates on the tree of life.

The theoretical foundation comes from neutral theory. Because most mutations at the molecular level are selectively neutral (they don't help or hurt the organism), they accumulate through genetic drift at a rate that approximates the mutation rate itself. That predictability is what makes the "clock" tick.

Strict vs. relaxed clocks

A strict molecular clock assumes the substitution rate is uniform across all lineages and constant over time. This is a useful starting assumption, but it often doesn't hold up. Mice and elephants, for example, have very different generation times, so their per-year mutation rates differ substantially.

A relaxed molecular clock accounts for this by allowing rates to vary between branches of a phylogenetic tree. Primates, for instance, tend to evolve more slowly at the molecular level than rodents. Relaxed clock models are now standard in most modern analyses because they produce more realistic divergence estimates across diverse lineages.

Concept of molecular clocks, Evolution - Wikipedia

Estimating divergence times

The basic logic for estimating when two species diverged works like this:

Compare DNA sequences between the two species and quantify how different they are.
Calculate genetic distance using a substitution model that corrects for multiple mutations at the same site. Common models include Jukes-Cantor (simplest, assumes equal substitution rates) and Kimura two-parameter (distinguishes transitions from transversions).
Apply the molecular clock equation to convert distance into time:

$T = \frac{D}{2R}$

where $T$ is the time since divergence, $D$ is the genetic distance, and $R$ is the substitution rate per unit time. You divide by 2 because mutations accumulate independently along both lineages after the split.

Build a phylogenetic tree with branch lengths representing genetic distances, then convert those lengths to time using the estimated rate.

Modern approaches often use Bayesian methods (implemented in software like BEAST, MrBayes, or PhyloBayes) that incorporate uncertainty in rate estimates and prior knowledge from calibration data, producing a range of likely divergence times rather than a single point estimate.

Concept of molecular clocks, Phylogenetic tree - wikidoc

Factors and Calibration

Factors affecting clock accuracy

Several biological variables cause substitution rates to differ across lineages, which is exactly why strict clocks often fail:

Generation time — Organisms with shorter generations copy their DNA more often per year, accumulating more mutations. Bacteria evolve far faster per year than redwoods.
Metabolic rate — Higher metabolic rates may generate more DNA-damaging reactive oxygen species. Hummingbirds, for example, likely experience higher mutation rates than tortoises.
Population size — In small populations (like island species), genetic drift is stronger, which can fix slightly deleterious mutations that would be purged in larger populations. This shifts the effective substitution rate.
Selection pressure — Genes under strong positive selection (like immune system genes) evolve faster than the neutral rate, while genes under strong purifying selection (like ribosomal RNA genes) evolve much more slowly.
DNA repair efficiency — Species with more effective DNA repair mechanisms accumulate fewer mutations. Naked mole rats, for instance, have unusually robust DNA repair compared to other rodents.
Environmental mutagens — UV radiation, chemical exposure, and other environmental factors can elevate mutation rates in exposed lineages.

Because of all this variation, choosing the right genes and the right clock model matters enormously for getting accurate divergence estimates.

Calibrating the clock

A molecular clock is useless without calibration. You need at least one known time point to anchor the rate. Here are the main calibration strategies:

Fossil calibration — The most common approach. A dated fossil provides a minimum age for the divergence of the group it belongs to. For example, Archaeopteryx (~150 million years ago) sets a minimum age for the bird-dinosaur split. Fossils give minimum ages because the true divergence almost certainly predates the oldest known fossil.
Biogeographic events — Geological events with known dates can calibrate splits. The formation of the Isthmus of Panama (~3 million years ago) separated marine populations on either side, providing a calibration point for species that diverged because of it.
Secondary calibrations — These use divergence times estimated in previous molecular studies as calibration points for new analyses. They're convenient but carry forward any errors from the original study.
Tip-dating — This method incorporates extinct taxa (known from fossils with radiometric dates) directly into the phylogenetic analysis, allowing the fossil ages themselves to inform rate estimation.
Cross-validation — Testing consistency across multiple independent calibration points helps identify unreliable calibrations. If one fossil-based estimate conflicts sharply with several others, it may reflect a misidentified fossil or an incorrect date.

Combining multiple calibration strategies and using relaxed clock models gives the most robust divergence time estimates. No single calibration point should be trusted in isolation.

2,589 studying →