Fiveable

🙈Evolutionary Biology Unit 8 Review

QR code for Evolutionary Biology practice questions

8.3 Measuring genetic variation in populations

8.3 Measuring genetic variation in populations

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
🙈Evolutionary Biology
Unit & Topic Study Guides

Understanding Genetic Variation

Importance of genetic variation

Genetic variation refers to DNA sequence differences among individuals within a population. These differences arise from mutations, recombination, and gene flow. This variation is the raw material that natural selection acts on, allowing populations to adapt to changing environments like shifting climates or new pathogens. Without it, a population has no ability to evolve.

Genetic variation can be measured at multiple levels:

  • Individual level: Heterozygosity measures how much genetic diversity exists within a single organism (how many loci carry two different alleles).
  • Population level: Allele frequencies describe the genetic makeup of a group. Shifts in these frequencies over time are the definition of evolution in population genetics.
  • Species level: Comparing genetic diversity between populations reveals patterns of evolutionary history, migration, and isolation.

A population's genetic variation directly shapes its fitness and long-term survival potential. Populations with low variation are more vulnerable to disease outbreaks, environmental shifts, and inbreeding depression.

Importance of genetic variation, Population Genetics | Boundless Biology

Methods for measuring variation

Scientists use several laboratory techniques to detect and quantify genetic variation. Each method has different strengths depending on the question being asked.

  • Allozyme electrophoresis separates protein variants by size and charge, detecting variation in enzyme-coding loci. This was one of the earliest methods for measuring variation and helped establish that natural populations carry far more genetic diversity than previously assumed.
  • DNA sequencing determines the exact nucleotide sequence of a stretch of DNA, providing the most comprehensive genetic information. Large-scale projects like the Human Genome Project rely on this approach.
  • Microsatellite analysis examines short tandem repeat sequences (e.g., CACACA repeated 5–20 times). These repeats mutate quickly, making them useful for fine-scale questions like population structure, parentage analysis, and wildlife forensics.
  • Restriction Fragment Length Polymorphism (RFLP) uses restriction enzymes to cut DNA at specific recognition sites. Differences in fragment lengths between individuals reveal underlying sequence variation. This technique was foundational for early genetic fingerprinting.
  • Single Nucleotide Polymorphism (SNP) genotyping identifies single base-pair differences across the genome. SNPs are the most common type of genetic variation in most organisms and are widely used in personalized medicine and genome-wide association studies.
Importance of genetic variation, Sources of Genetic Variation | Boundless Anatomy and Physiology

Analyzing Genetic Variation Data

Calculations of genetic diversity

Several metrics quantify how much genetic variation a population carries. Each captures a slightly different dimension of diversity.

Heterozygosity measures the proportion of heterozygous individuals in a population. Comparing observed to expected values is a direct way to test whether a population is in Hardy-Weinberg equilibrium.

  • Observed heterozygosity is simply the fraction of individuals that are heterozygous at a given locus:

Ho=number of heterozygotestotal number of individualsH_o = \frac{\text{number of heterozygotes}}{\text{total number of individuals}}

  • Expected heterozygosity is what you'd predict under Hardy-Weinberg assumptions, calculated from allele frequencies:

He=1i=1kpi2H_e = 1 - \sum_{i=1}^{k} p_i^2

where pip_i is the frequency of the iith allele and kk is the total number of alleles at that locus. If HoH_o is much lower than HeH_e, something like inbreeding or population subdivision may be occurring.

Proportion of polymorphic loci tells you what fraction of loci in a population carry more than one allele:

P=number of polymorphic locitotal number of lociP = \frac{\text{number of polymorphic loci}}{\text{total number of loci}}

A locus is typically considered polymorphic if the most common allele has a frequency below 0.95 (or below 0.99, depending on the threshold used). Higher PP values indicate greater overall genetic variation.

Nucleotide diversity (π\pi) measures the average number of nucleotide differences per site between pairs of DNA sequences in a population:

π=i<jπijn(n1)/2\pi = \frac{\sum_{i<j} \pi_{ij}}{n(n-1)/2}

where πij\pi_{ij} is the proportion of nucleotide sites that differ between sequences ii and jj, and nn is the number of sequences sampled. This metric is especially useful for comparing diversity across species or genomic regions.

Fixation index (FSTF_{ST}) quantifies genetic differentiation between populations. It ranges from 0 to 1, where 0 means populations are genetically identical (complete gene flow) and 1 means they share no alleles (complete isolation). Values above roughly 0.25 are generally considered high differentiation.

Applications of variation data

Measuring genetic variation isn't just an academic exercise. These data have direct, practical uses across biology.

  • Conservation biology: Assessing genetic diversity in endangered species like the California condor helps identify populations at risk of inbreeding depression. Breeding programs use this data to maximize genetic variation and long-term viability.
  • Evolutionary studies: Genetic variation data allow researchers to reconstruct phylogenetic relationships, estimate divergence times between lineages, and detect signatures of natural selection (e.g., selective sweeps that reduce local diversity).
  • Human genetics: Population-level variation data have been central to tracing human migration patterns, supporting models like the Out of Africa hypothesis. SNP data also help identify disease-associated genetic variants through genome-wide association studies.
  • Agriculture and animal breeding: Marker-assisted selection uses known genetic variants to breed for desirable traits, such as drought resistance in crops, more efficiently than traditional methods.
  • Environmental monitoring: Tracking changes in genetic diversity over time can reveal the impacts of pollution, habitat fragmentation, or climate change on populations. Coral reef populations, for example, have been monitored this way to assess ecosystem health.