DNA and protein sequences are some of the strongest evidence for evolution because they let you directly compare the genetic blueprints of different species. The core idea is simple: the more similar two species' DNA sequences are, the more recently they shared a common ancestor.

Some sequences are conserved across wildly different species, meaning they've barely changed over hundreds of millions of years. Histones (proteins that package DNA) and ribosomal RNA are good examples. These molecules perform such critical functions that almost any mutation in them is harmful, so natural selection keeps them nearly identical across species. The fact that a yeast cell and a human cell use nearly the same histone proteins points strongly to common ancestry.

Comparative genomics takes this further by analyzing entire genomes side by side, identifying homologous genes and sequences shared between species. From these comparisons, scientists build phylogenetic trees that visualize how species are related. Primate phylogenies, for instance, consistently show humans and chimpanzees sharing about 98-99% of their DNA, with gorillas as a slightly more distant relative.

Two types of molecular homology are worth knowing:

Orthologous genes are similar genes found in different species, inherited from a common ancestor. Hemoglobin genes across vertebrates are a classic example.
Paralogous genes are similar genes found within the same species, resulting from gene duplication. The α-globin and β-globin genes in humans arose this way.

Protein structure conservation provides additional evidence. Cytochrome c, a protein used in cellular respiration, has a nearly identical 3D structure across species from yeast to humans. Functional domains within proteins tend to be preserved through evolution because they're essential for the protein to work.

DNA sequences as evolutionary evidence, Phylogenetic Trees | Biology for Non-Majors I

Molecular clocks for evolutionary timing

A molecular clock uses the rate at which mutations accumulate in DNA or protein sequences to estimate when two species diverged from a common ancestor. The underlying assumption is that mutations build up at a roughly constant rate over time, like ticks on a clock.

These clocks need to be calibrated against known events from the fossil record or geology (like the formation of a land bridge or the age of a particular fossil).

There are two main types:

Protein-based clocks, which count amino acid substitutions
DNA-based clocks, which count nucleotide substitutions

To estimate a divergence time, you follow these steps:

Compare sequences from two species and count the number of differences (this gives you the genetic distance, $K$ ).
Determine the substitution rate ( $r$ ), typically from calibration against the fossil record.
Apply the formula: $T = \frac{K}{2r}$ , where $T$ is the time since divergence. You divide by 2 because mutations have been accumulating independently in both lineages since they split.

Molecular clocks have been used to date the human-chimpanzee split (roughly 6-7 million years ago) and to estimate when primates evolved trichromatic color vision.

Limitations to keep in mind: Mutation rates aren't perfectly constant. They vary between lineages, are affected by generation time (organisms that reproduce faster accumulate mutations faster), and can be skewed by natural selection pressures. So molecular clock estimates are always approximations, not exact dates.

DNA sequences as evolutionary evidence, Phylogenetic tree - Wikipedia

Molecular vs. other evolutionary evidence

Each type of evolutionary evidence has distinct strengths, and they work best together.

Molecular evidence provides direct genetic information and is especially powerful for organisms that don't fossilize well, like bacteria and other soft-bodied organisms. It can also reveal cryptic species, organisms that look nearly identical but are genetically distinct enough to be separate species.

The fossil record offers physical evidence of past life forms and provides morphological details you can't get from DNA alone. Fossils also let you directly observe evolutionary transitions, like the series of fossils documenting whale evolution from land-dwelling ancestors to fully aquatic mammals.

Comparative anatomy reveals structural homologies (like the shared bone pattern in vertebrate limbs), demonstrates adaptive radiations (like Darwin's finches diversifying into different beak shapes), and identifies vestigial structures (like the human appendix or pelvic bones in whales).

These lines of evidence reinforce each other. Molecular data frequently corroborates phylogenies originally built from fossils, and anatomical features support relationships identified through genetics. When discrepancies arise, scientists use integrative approaches that combine molecular and morphological data to resolve them. Mammal phylogeny, for example, was significantly revised when molecular data revealed relationships that anatomy alone had gotten wrong.

Mutations in molecular evolution

Mutations are the raw material of evolution. Without them, there would be no genetic variation for natural selection to act on.

Types of mutations:

Point mutations: substitutions (one nucleotide swapped for another), insertions, or deletions of single nucleotides
Chromosomal mutations: larger-scale changes like inversions, translocations, and duplications of chromosome segments

Mutation is one of three main sources of genetic variation in populations, alongside recombination (shuffling of alleles during meiosis) and gene flow (migration of alleles between populations).

Natural selection operates at the molecular level in three ways:

Positive selection favors beneficial mutations and spreads them through a population. Lactase persistence in human populations with a history of dairy farming is a well-known example.
Negative (purifying) selection removes harmful mutations, like those causing lethal developmental defects.
Neutral evolution describes mutations that have no effect on fitness and drift randomly in frequency through genetic drift. The neutral theory of molecular evolution, proposed by Motoo Kimura, argues that most molecular-level changes are neutral rather than adaptive.

Molecular adaptation can occur through changes that alter protein function or gene regulation. Antibiotic resistance in bacteria often involves mutations that change the shape of a protein targeted by the drug. Gene duplication followed by neofunctionalization is another important mechanism: one copy keeps doing the original job while the duplicate is free to evolve a new function. The large family of olfactory receptor genes in mammals evolved this way.

Evolutionary rates vary across the genome. Immune system genes evolve rapidly because they're in a constant arms race with pathogens, while ribosomal genes are highly conserved because ribosomes must function precisely. Factors like generation time and population size also influence how fast sequences evolve.

Even synonymous mutations (nucleotide changes that don't alter the amino acid) aren't always truly neutral. In organisms like E. coli, certain codons are translated more efficiently than others, leading to codon bias, a pattern where natural selection favors particular synonymous codons to optimize translation speed.

2,589 studying →