Chemical Structure and Function of DNA
Components of DNA nucleotides
DNA (deoxyribonucleic acid) is built from repeating subunits called nucleotides. Each nucleotide has three parts:
- Deoxyribose sugar: a five-carbon (pentose) sugar arranged in a ring. The carbons are numbered 1' through 5', and these numbers matter for understanding how the strand is oriented.
- Phosphate group: negatively charged, which is why DNA is acidic. It attaches to the 5' carbon of the sugar.
- Nitrogenous base: attached to the 1' carbon of the deoxyribose. This is the part that carries genetic information.
The four nitrogenous bases fall into two categories based on their ring structure:
- Purines (double-ringed): Adenine (A) and Guanine (G)
- Pyrimidines (single-ringed): Thymine (T) and Cytosine (C)
A quick way to remember: purines are the bigger molecules (two rings), pyrimidines are smaller (one ring). A purine always pairs with a pyrimidine, which keeps the width of the double helix consistent.
Nucleotides are linked together by phosphodiester bonds. These bonds connect the phosphate group on the 5' carbon of one nucleotide to the 3' hydroxyl group on the deoxyribose of the next nucleotide. The repeating sugar-phosphate chain forms the backbone of each DNA strand, providing structural support. The nitrogenous bases project inward from the backbone, forming the "rungs" of the double helix.

Base pairing rules in DNA
In double-stranded DNA, the bases on opposite strands pair according to strict rules:
- Adenine (A) pairs with Thymine (T) through two hydrogen bonds
- Guanine (G) pairs with Cytosine (C) through three hydrogen bonds
Because G-C pairs have three hydrogen bonds versus two for A-T pairs, regions of DNA with a higher G-C content are more thermally stable and harder to denature (separate).
This complementary base pairing is the foundation of the Watson-Crick model of DNA structure, and it has two major functional consequences:
- Replication accuracy: When DNA is copied, each strand serves as a template. Because A can only pair with T and G can only pair with C, the new strand is an exact complement of the original. This is how genetic information is faithfully passed on.
- Transcription: During transcription, the template strand of DNA is read to build a complementary RNA strand (mRNA). The same base pairing logic applies, except RNA uses uracil (U) in place of thymine.
The complementary relationship also means that if you know the sequence of one strand, you can always determine the sequence of the other.

Antiparallel structure of DNA
The two strands of the double helix don't run in the same direction. One strand runs 5' to 3', while the complementary strand runs 3' to 5'. This opposite orientation is called the antiparallel arrangement.
Why does directionality exist in the first place? It comes from the phosphodiester bonds. Each bond links the 5' phosphate of one nucleotide to the 3' hydroxyl of the next, so every strand has a distinct 5' end (free phosphate) and a 3' end (free hydroxyl group).
The antiparallel arrangement matters for several reasons:
- DNA polymerase only works in one direction. It adds nucleotides to the 3' end of a growing strand, synthesizing exclusively in the 5' → 3' direction. Because the two template strands point in opposite directions, replication proceeds differently on each: the leading strand is synthesized continuously, while the lagging strand is synthesized in short fragments (Okazaki fragments) that are later joined together.
- Optimal hydrogen bonding. The base pairs align most stably when the strands are antiparallel. This maximizes the strength of hydrogen bonds between complementary bases and contributes to the overall structural stability of the helix.
- Strand separation. Enzymes like helicase unwind the double helix during replication and transcription. The antiparallel orientation allows both strands to be accessed and read by the cellular machinery.
DNA Replication and Genetic Information
DNA replication follows the semiconservative model: after replication, each new double helix contains one original (parental) strand and one newly synthesized strand. This was demonstrated by the Meselson-Stahl experiment and ensures that genetic information is preserved across cell divisions.
The sequence of nucleotides along a DNA strand encodes the genetic code, a set of rules that determines how nucleotide triplets (codons) are translated into specific amino acids during protein synthesis. The flow of information from DNA → RNA → protein is often called the central dogma of molecular biology.