Definition of birth-death processes
A birth-death process is a continuous-time Markov chain where the state can only change by (a "birth") or (a "death") at any given transition. The rates at which these transitions happen are governed by state-dependent birth rates and death rates . These processes show up constantly in population dynamics, queueing theory, and epidemiology because many real systems evolve through incremental additions and removals.
Discrete-time vs continuous-time
Birth-death models exist in both discrete and continuous time. In discrete time, transitions happen at fixed intervals (every day, every generation, etc.). In continuous time, transitions can occur at any moment, and the waiting time in each state follows an exponential distribution. This unit focuses on the continuous-time version, where the exponential holding times give the process its memoryless property.
State space and transitions
The state space is typically , representing a count of something: individuals in a population, customers in a queue, infected people in an epidemic. The defining structural constraint is that transitions only occur between adjacent states:
- From state to state at rate (birth)
- From state to state at rate (death)
No jumps of size 2 or larger are allowed. This gives the generator matrix a tridiagonal structure, which is what makes many birth-death calculations tractable.
Birth and death rates
- Birth rates govern how quickly the system moves upward from state . Think of new customers arriving or new infections occurring.
- Death rates govern how quickly the system moves downward. Think of service completions or recoveries.
These rates can be constant (the same for every state) or state-dependent (varying with ). State-dependent rates are common: for example, in a population model, the total birth rate might scale linearly with population size, so .
Transition probabilities
Transition probabilities describe the likelihood of being in state after time , given the process started in state . Computing these is central to understanding both the short-term dynamics and long-run behavior of the chain.
Chapman-Kolmogorov equations
The Chapman-Kolmogorov equations express the idea that you can break any time interval into two pieces and sum over all intermediate states:
This is just the law of total probability applied to the Markov property. In matrix form, it says , which means the transition probability matrices form a semigroup. These equations are the foundation for deriving the Kolmogorov forward and backward differential equations, which give you the actual ODEs for computing .
Transition rate matrix (generator matrix)
For continuous-time chains, the key object is the infinitesimal generator matrix , not a one-step transition probability matrix. For a birth-death process, has the tridiagonal form:
- Off-diagonal entries: and
- Diagonal entries:
- All other entries are zero
The embedded discrete-time chain has transition probabilities and , which describe where the chain goes next (ignoring when).
Stationary distribution
The stationary distribution is the long-run probability of being in each state. For a continuous-time chain, you find it by solving:
For birth-death processes, the tridiagonal structure of means you can use detailed balance (also called the local balance equations):
This gives the explicit solution:
where is determined by the normalization condition. A stationary distribution exists if and only if the chain is irreducible and the sum converges (positive recurrence).
Poisson processes
The Poisson process is the simplest birth-death process: set all birth rates to a constant and all death rates to zero. It models pure arrivals with no departures.
Relationship to birth-death processes
A Poisson process with rate is a birth-death chain on with for all and for all . The state only increases, so it counts the cumulative number of events. The number of events in an interval of length follows a Poisson distribution with mean .
Poisson process properties
Three properties characterize a Poisson process:
- Memoryless property: The time until the next event doesn't depend on how long you've already waited. This comes directly from the exponential distribution of inter-arrival times.
- Stationary increments: The distribution of the number of events in any interval depends only on the interval's length, not its location in time.
- Independent increments: Event counts in non-overlapping intervals are independent random variables.
Exponential inter-arrival times
The time between consecutive events follows an exponential distribution with rate :
- Expected inter-arrival time:
- Variance:
The memoryless property of the exponential distribution () is what makes the Poisson process Markovian.
Applications of birth-death processes
Population growth models
Birth-death processes model population dynamics where each individual independently gives birth at rate and dies at rate . With state-dependent rates and , you get a linear birth-death process. If , the population tends to grow exponentially; if , it tends toward extinction. More sophisticated models add carrying capacity through nonlinear rate functions, similar in spirit to the logistic growth model.
Queueing systems
Queueing theory is one of the richest application areas. The classic M/M/1 queue has Poisson arrivals at rate and exponential service at rate , giving a birth-death process with constant rates. The stationary distribution exists when the traffic intensity , and the expected number of customers in the system is . The M/M/c queue extends this to servers, where the death rate becomes in state .
Epidemiology and disease spread
In the stochastic SIS model, infected individuals recover at rate and infect susceptible individuals at a rate proportional to the number of infected. This creates a birth-death process on the number of infected individuals. The stochastic SIR model is similar but with permanent recovery. Birth-death formulations let you compute quantities like the probability of a major outbreak and the expected time to disease extinction, which deterministic ODE models cannot provide.
Limiting behavior
The limiting behavior describes what happens to the process as . Does it settle into an equilibrium? Does it drift to infinity? Does it get absorbed?
Transient vs recurrent states
- A state is recurrent if the process returns to it with probability 1. It's positive recurrent if the expected return time is finite.
- A state is transient if there's a positive probability of never returning.
For an irreducible birth-death process on , all states share the same classification. The chain is positive recurrent (and has a stationary distribution) if and only if .
Absorbing states and absorption probabilities
If state 0 has but , it's not absorbing. But if , then state 0 is absorbing: once the process reaches 0, it stays there forever. Absorption probabilities from state can be computed by solving a system of linear equations derived from first-step analysis on the embedded chain, or equivalently using the fundamental matrix of the transient portion of the chain.
Long-run distribution
For an irreducible, positive recurrent birth-death process, the long-run distribution equals the stationary distribution found by solving with . The detailed balance solution gives:
If the chain is not positive recurrent (the sum diverges), no stationary distribution exists, and the process is either null recurrent or transient.

First passage times
The first passage time is the time it takes the process to reach state for the first time, starting from state . These random variables capture the timescales of the system: how long until a queue empties, how long until a population reaches a threshold, etc.
Definition and properties
- is always non-negative.
- By the Markov property, the distribution of depends only on and , not on the prior history.
- For birth-death processes, reaching state from state requires passing through every intermediate state (since only transitions are allowed). This means can be decomposed as a sum of independent passage times through consecutive states.
Expected first passage times
For a birth-death process, the expected first passage time from state to state satisfies:
for , with boundary condition (you're already there). The term is the expected holding time in state , and the remaining terms account for which direction the chain jumps.
Because transitions are only to adjacent states, the expected time to go from state to state (one step up) can be computed directly, and longer passages are sums of these one-step times.
Variance of first passage times
The variance measures the spread around the expected passage time. It can be computed by setting up a second system of equations for using the same first-step conditioning approach, then applying . High variance indicates the passage time is unpredictable, which matters in applications like service guarantees in queueing.
Extinction probabilities
Extinction means the process reaches state 0 and stays there (when state 0 is absorbing). This is especially relevant in population models: will a small population die out, or will it survive?
Conditions for extinction
For a birth-death process on with state 0 absorbing, extinction from state is certain (probability 1) if and only if:
Intuitively, if death rates dominate birth rates sufficiently, the process will always drift down to 0. If birth rates are strong enough that this sum converges, there's a positive probability of escaping to infinity (survival).
Calculation of extinction probabilities
The extinction probability from state satisfies the recursion from first-step analysis on the embedded chain:
with boundary condition . This is a second-order linear difference equation. The solution is:
For the simple case where and (linear birth-death), the extinction probability from state is when , and 1 when .
Time to extinction
The expected time to extinction can be computed using the first passage time framework with state 0 as the target. For the linear birth-death process starting from state 1 with , the expected extinction time is . In general, these calculations involve solving the appropriate system of equations using the generator matrix.
Birth-death processes with immigration
Adding immigration means individuals enter the system from an external source at some rate, independent of the current state. This prevents the population from staying at zero and often guarantees the existence of a stationary distribution.
Definition and properties
The transition rates become:
- State : rate , where is the immigration rate
- State : rate (for )
The immigration rate acts as a constant upward push regardless of the current state. Because of this, state 0 is never absorbing (even if , the rate out of state 0 is at least ).
Stationary distribution with immigration
The stationary distribution is found from with , using the modified rates. Detailed balance gives:
so . The immigration term makes it more likely that this sum converges (and a stationary distribution exists), since the numerator grows but the death rates in the denominator can still dominate.
For example, the M/M/1 queue is a birth-death process with constant arrival rate (which you can think of as pure immigration with and ) and service rate . Its stationary distribution is geometric: where .
Applications in queueing theory
- M/M/1 queue with immigration: Models a single server where customers arrive from outside (Poisson arrivals at rate ) and are served at rate . The immigration framework applies when arrivals are purely external.
- M/M/ queue: Every arriving customer gets their own server immediately. With arrival rate and per-server service rate , the stationary distribution is Poisson with mean . This is a birth-death process with (immigration) and .
Branching processes
Branching processes model populations where each individual independently produces a random number of offspring. They're closely related to birth-death processes but allow for multiple offspring per reproduction event.
Relationship to birth-death processes
In a branching process, each individual in generation independently produces a random number of offspring according to some distribution . A birth-death process is the special case where the offspring distribution is concentrated on : each individual either dies (0 offspring) or is replaced by two (itself plus one new, effectively 1 net birth). The branching process framework is more general because it allows jumps larger than in a single generation.
Generating functions
The probability generating function (PGF) of the offspring distribution is:
This is the main analytical tool. If is the population size at generation , its PGF satisfies the recursion:
From the PGF you can extract moments: (the mean number of offspring per individual), and .
Extinction probabilities in branching processes
The extinction probability (starting from a single individual) is the smallest non-negative solution to:
The classification depends on the mean offspring number :
- Subcritical (): Extinction is certain (). The population shrinks on average each generation.
- Critical (): Extinction is still certain (), but it takes longer on average.
- Supercritical (): There's a positive probability of survival (). The extinction probability is the unique root of in .
Simulation of birth-death processes
When analytical solutions are hard to obtain (complex state-dependent rates, finite populations with boundaries, etc.), simulation provides a way to estimate stationary distributions, passage times, and extinction probabilities numerically.
Monte Carlo methods
Monte Carlo simulation for birth-death processes means generating many independent sample paths and using them to estimate quantities of interest. For example, to estimate the extinction probability from state 5, you simulate thousands of trajectories starting at state 5 and compute the fraction that reach state 0. The accuracy improves with more samples, scaling as where is the number of trajectories.
Gillespie algorithm
The Gillespie algorithm (also called the stochastic simulation algorithm) is the standard method for simulating continuous-time Markov chains exactly. For a birth-death process, it works as follows:
-
Initialize: Set the current state and time .
-
Compute the total rate: (the total rate of leaving state ).
-
Generate the holding time: Draw from an exponential distribution with rate . Update .
-
Determine the event type: With probability , a birth occurs (). With probability , a death occurs ().
-
Repeat from step 2 until a stopping condition is met (e.g., exceeds a time limit, or the process reaches an absorbing state).
This algorithm is exact: it produces sample paths with the correct statistical properties, unlike approximate time-discretization schemes. Each step requires only two random number draws (one for the waiting time, one for the event type), making it computationally efficient.