Microbial taxonomy is the system scientists use to organize microorganisms into groups based on shared traits. Without it, there would be no standardized way to name, study, or communicate about the millions of microbial species on Earth. This section covers the classification hierarchy, the characteristics used to sort microbes, and how modern genetic tools have transformed the way we understand microbial relationships.
Microbial Taxonomy and Classification
Classification of microorganisms
The taxonomic hierarchy organizes microorganisms into increasingly specific groups based on shared characteristics. From broadest to most specific, the levels are:
Domain → Kingdom → Phylum → Class → Order → Family → Genus → Species
A common mnemonic: Dear King Philip Came Over For Good Spaghetti.
Once an organism is classified, it receives a formal name using binomial nomenclature, a two-part naming system consisting of the genus and species. The genus is capitalized, the species is lowercase, and the whole name is italicized (e.g., Escherichia coli). This system gives every organism a unique, standardized name that researchers worldwide can use.
To place a microorganism within this hierarchy, scientists evaluate several types of characteristics:
- Morphology: Cell shape (cocci are spherical, bacilli are rod-shaped, spirilla are spiral), cell arrangement (pairs, chains, clusters), and the presence or absence of structures like flagella, capsules, or endospores
- Staining properties: The Gram stain reaction divides bacteria into Gram-positive (thick peptidoglycan layer, stains purple) and Gram-negative (thin peptidoglycan, outer membrane, stains pink). The acid-fast stain identifies organisms like Mycobacterium that have waxy cell walls resistant to standard staining.
- Biochemical and metabolic properties: Oxygen requirements (obligate aerobe, obligate anaerobe, facultative anaerobe), nutrient utilization, enzyme production, and fermentation end products all help differentiate species that may look identical under a microscope
- Genetic composition: DNA base composition (G+C content), DNA-DNA hybridization, and 16S rRNA gene sequencing provide the highest-resolution classification data
Approaches to microbial taxonomy
Historical approaches relied on what scientists could directly observe or test in the lab:
- Microscopic observations of cell morphology and colony appearance on agar plates
- Biochemical tests measuring nutrient utilization, enzyme activity, and fermentation products
These methods work well for initial identification, but they have limits. Two organisms can look identical under a microscope yet be genetically distinct, or they can appear different simply because they're growing under different conditions.
Modern approaches use genetic methods that reveal deeper evolutionary relationships:
- G+C content: The ratio of guanine + cytosine to total bases in an organism's DNA. Similar G+C content suggests (but doesn't prove) relatedness.
- DNA-DNA hybridization: Measures genetic similarity by seeing how well single-stranded DNA from two organisms binds together. Two organisms are generally considered the same species if they show ≥70% hybridization.
- 16S rRNA gene sequencing: The 16S rRNA gene is present in all bacteria and archaea and contains both highly conserved regions (useful for universal primers) and variable regions (useful for distinguishing species). Two organisms with ≥97% sequence similarity in this gene are typically considered the same species. This technique has become the gold standard for determining phylogenetic relationships.
Polyphasic taxonomy integrates all of these data types together. Rather than relying on any single method, it combines morphological, biochemical, and genetic evidence to produce a more accurate and comprehensive classification. This is the current best practice.
Interpretation of phylogenetic trees
A phylogenetic tree is a diagram that represents the evolutionary relationships among organisms. Reading these trees correctly is an important skill.
The key components:
- Nodes: Points where branches split, representing a common ancestor of the groups that diverge from that point
- Branches: Lines connecting nodes, representing evolutionary lineages
- Branch length: In many trees, longer branches indicate greater genetic divergence (more evolutionary change)
There are two main types of phylogenetic trees:
Rooted trees have a defined starting point (the root) that represents the most recent common ancestor of all organisms on the tree. An outgroup, a distantly related organism, is used to determine where the root goes and which direction evolution proceeded.
Unrooted trees show the relationships among organisms without specifying a common ancestor or direction of evolution. They can be converted to rooted trees once an appropriate outgroup is identified.
When interpreting a tree, remember that organisms on neighboring branches are not necessarily most closely related. What matters is how recently they share a common ancestor (their most recent shared node). Closely related organisms share a more recent node; distantly related organisms diverge further back.
Phylogenetic trees are used to understand evolutionary history, identify novel or uncultured microorganisms, design targeted PCR primers, and develop molecular diagnostic tests.
Modern Classification Methods
Several key concepts underpin how modern classification works:
- Phenotype refers to an organism's observable characteristics, such as shape, colony color, or growth rate. Phenotype is influenced by both genetics and environmental conditions, which means the same organism can look different depending on how it's grown.
- Genotype is the organism's actual genetic makeup. It determines the potential traits an organism can express, even if environmental conditions prevent some of those traits from showing up.
- Phylogeny is the study of evolutionary relationships among organisms. Phylogenetic trees are the primary tool for visualizing these relationships.
- Cladistics is a classification method that groups organisms based on shared derived characteristics, meaning traits that evolved in a common ancestor and were inherited by its descendants. Only shared derived traits (not ancestral ones) are used to define groups.
- Molecular systematics uses genetic and molecular data (DNA sequences, protein sequences, RNA structure) to determine evolutionary relationships. It has largely replaced older classification methods that relied solely on physical traits, because genetic data is less influenced by environmental variation.