Definition of stochastic differential equations
Stochastic differential equations (SDEs) extend ordinary differential equations to systems driven by randomness. They combine a deterministic component describing average behavior with a stochastic component capturing random fluctuations, typically driven by a Wiener process (Brownian motion).
The general form of an SDE is:
- is the drift coefficient, governing the deterministic trend of the process
- is the diffusion coefficient, controlling the intensity of random fluctuations
- is a Wiener process, a continuous-time Gaussian process with independent increments and
The drift term pulls the process in a predictable direction, while the diffusion term adds noise scaled by . Understanding how these two terms interact is central to everything that follows.
Solutions of stochastic differential equations
Existence of solutions
Existence theorems specify when an SDE actually has a solution. The standard sufficient conditions are:
- Lipschitz continuity of the coefficients: there exists a constant such that for all
- Linear growth condition:
These conditions prevent the coefficients from blowing up or oscillating too wildly. Existence results can be global (solution defined for all ) or local (defined only up to some stopping time).
Uniqueness of solutions
Under the same Lipschitz condition, uniqueness holds in two senses:
- Pathwise (strong) uniqueness: any two solutions built on the same Wiener process agree almost surely for all
- Uniqueness in distribution (weak uniqueness): any two solutions share the same probability law, even if constructed on different probability spaces
Strong uniqueness implies weak uniqueness, but not the other way around.
Explicit solutions vs numerical methods
Closed-form solutions exist only in special cases. The most important example is the linear SDE with constant coefficients, which leads to geometric Brownian motion. For the vast majority of SDEs, you'll need numerical approximation. The general workflow is:
- Discretize the time interval into steps of size
- Simulate increments of the Wiener process
- Update the approximation step by step using a scheme like Euler-Maruyama or Milstein
Itô integral
Definition of Itô integral
The Itô integral extends classical integration to allow integration with respect to a Wiener process. For an adapted process , it's defined as:
The crucial feature: the integrand is evaluated at the left endpoint of each subinterval. This left-endpoint choice is what makes the Itô integral non-anticipating (it only uses information available at the current time).
The Itô integral is a martingale with zero mean: .
Properties of Itô integral
- Linearity:
- Adaptedness: the integrand must be adapted to the filtration generated by (no peeking into the future)
- Continuity: the integral is a continuous function of the upper limit
- Quadratic variation: the quadratic variation of equals , which is generally nonzero
Note: The Itô integral itself has nonzero quadratic variation. This is a key distinction from ordinary integrals and is precisely what gives rise to the extra term in Itô's lemma.
Itô isometry
The Itô isometry connects the second moment of a stochastic integral to a deterministic integral:
This identity is indispensable for computing variances and proving convergence results. It essentially lets you move between "stochastic world" and "deterministic world" when working with second moments.
Itô's lemma
Statement of Itô's lemma
Itô's lemma is the stochastic chain rule. If satisfies and is twice continuously differentiable, then:
The term has no analogue in ordinary calculus. It arises because the Wiener process has nonzero quadratic variation: . This is the single most important thing to remember about Itô's lemma.

Applications of Itô's lemma
- Deriving SDE dynamics for transformed processes: if you know and want or , Itô's lemma gives the answer directly
- Financial mathematics: the Black-Scholes PDE is derived by applying Itô's lemma to a portfolio of options and stock
- Computing moments and distributions of stochastic processes
- Stochastic optimal control: Itô's lemma underpins the Hamilton-Jacobi-Bellman equation
Generalized Itô's lemma
For a function of multiple Itô processes, the generalized formula includes:
- Partial derivatives with respect to each process and time
- Cross-variation terms , which account for correlations between driving Wiener processes
This is essential for multi-dimensional SDEs and models with multiple sources of uncertainty (e.g., multi-asset option pricing).
Stochastic exponential
Definition of stochastic exponential
The stochastic exponential (Doléans-Dade exponential) of a semimartingale is the unique solution to:
For a continuous process , the explicit form is:
where is the quadratic variation of . The subtraction of is a direct consequence of Itô's lemma applied to the exponential function.
Properties of stochastic exponential
- Strictly positive: for all , since it's an exponential
- Multiplicative:
- Inverse: is itself a stochastic exponential
- Local martingale: is always a local martingale; it's a true martingale if Novikov's condition holds
- Reduces to the ordinary exponential when the stochastic component vanishes
Relationship to geometric Brownian motion
Geometric Brownian motion (GBM) is the most common SDE in finance:
Its solution is the stochastic exponential:
The correction ensures , which you can verify using Itô's lemma. GBM stays strictly positive, making it suitable for asset price modeling. The Black-Scholes option pricing model is built on this foundation.
Stratonovich integral
Definition of Stratonovich integral
The Stratonovich integral uses a midpoint rule instead of the left-endpoint rule:
The circle notation distinguishes it from the Itô integral. Because of the midpoint evaluation, the Stratonovich integral obeys the ordinary chain rule without the extra second-derivative correction term.
Comparison to Itô integral
| Feature | Itô | Stratonovich |
|---|---|---|
| Evaluation point | Left endpoint | Midpoint |
| Chain rule | Modified (extra term) | Ordinary chain rule |
| Martingale property | Yes (integral is a martingale) | Not in general |
| Non-anticipating | Yes | No (uses future values in construction) |
| The Stratonovich integral is not non-anticipating, which makes it less natural for filtering and prediction problems. However, it's often preferred in physics because physical systems modeled as limits of smooth noise naturally yield Stratonovich SDEs. |
Stratonovich calculus
To convert between the two frameworks, use the Itô-Stratonovich correction:
This means any Stratonovich SDE can be rewritten as an Itô SDE (and vice versa) by adding or subtracting the correction term. The choice between the two depends on your application: Itô is standard in finance and probability theory; Stratonovich is common in physics and engineering.

Linear stochastic differential equations
Homogeneous linear equations
A homogeneous linear SDE has the form:
The solution is given by the stochastic exponential:
This generalizes the deterministic exponential solution to the stochastic setting. These equations model phenomena like population dynamics with random growth rates.
Inhomogeneous linear equations
Inhomogeneous linear SDEs add an external forcing term:
The solution combines a homogeneous solution with a particular solution found via variation of parameters.
Variation of parameters formula
This is the stochastic analogue of the classical ODE technique:
- Solve the corresponding homogeneous equation to get the fundamental solution
- Write the particular solution as
- The full solution is the sum of the homogeneous solution (from initial conditions) and this particular integral
The formula reduces the inhomogeneous problem to computing stochastic integrals, which can then be handled analytically or numerically.
Numerical methods for SDEs
Euler-Maruyama method
The simplest and most widely used scheme. Given :
where .
- Strong convergence order: 0.5 (error in individual paths scales as )
- Weak convergence order: 1.0 (error in expectations scales as )
Strong convergence matters when you care about pathwise accuracy. Weak convergence matters when you only need accurate expectations (e.g., option pricing).
Milstein method
The Milstein method adds a correction term to Euler-Maruyama:
This achieves strong convergence order 1.0, a significant improvement. The extra term involves , the derivative of the diffusion coefficient. In multiple dimensions, implementing Milstein requires simulating iterated stochastic integrals (Lévy areas), which adds complexity.
Runge-Kutta methods for SDEs
Stochastic Runge-Kutta methods adapt the classical ODE approach by using multiple evaluations of and within each time step. Higher-order methods (e.g., order 1.5) provide better accuracy but at greater computational cost. They're most useful when high precision is needed and the SDE has smooth coefficients.
Applications of stochastic differential equations
Financial mathematics
- The Black-Scholes model uses GBM () to price European options
- Stochastic volatility models like the Heston model couple an SDE for the asset price with a separate SDE for the variance:
- Interest rate models (Vasicek, Cox-Ingersoll-Ross) use mean-reverting SDEs to capture the dynamics of short rates
Physics and engineering
- The Langevin equation describes Brownian particle motion in a fluid, balancing friction against thermal noise
- In control engineering, SDEs model systems with stochastic disturbances and form the basis for robust controller design
- Stochastic PDEs (extensions of SDEs to spatial domains) appear in turbulence modeling and quantum field theory
Biology and ecology
- Stochastic Lotka-Volterra equations add noise to predator-prey dynamics, capturing environmental variability and demographic stochasticity
- Epidemic models (stochastic SIR) use SDEs to account for randomness in disease transmission
- In neuroscience, SDEs model the stochastic firing patterns of neurons, where membrane potential fluctuates due to random synaptic inputs