Definition of Itô integral
The Itô integral extends the idea of integration to stochastic processes, specifically allowing you to integrate with respect to Brownian motion. Classical (Riemann or Lebesgue) integrals can't handle this because Brownian motion paths are nowhere differentiable and have infinite variation. The Itô integral solves this problem and provides the foundation for all of stochastic calculus.
Itô integral vs Riemann integral
Three key differences separate the Itô integral from the ordinary Riemann integral:
- Random integrands and integrators. The Itô integral is built for stochastic processes with random fluctuations, not smooth deterministic functions.
- Quadratic variation matters. Brownian motion has nonzero quadratic variation (), which means the accumulated "roughness" of the path contributes to the integral. Riemann integration never encounters this because smooth functions have zero quadratic variation.
- Different limiting procedure. The Riemann integral is a limit of sums where you evaluate the integrand at any point in each subinterval. The Itô integral specifically evaluates the integrand at the left endpoint of each subinterval. This left-endpoint (non-anticipating) choice is what makes the integral adapted to the filtration and gives it the martingale property. Choosing the midpoint instead leads to the Stratonovich integral, which has different properties.
Itô integral for simple processes
The construction starts with simple (step) processes, which are piecewise constant and adapted to the filtration. A simple process has the form:
where each is -measurable (known at time ). For such a process, the Itô integral with respect to Brownian motion is defined as:
Notice that is evaluated at the left endpoint of each interval. Because the increments are independent of , this construction guarantees two things:
- The integral has zero mean: .
- The integral is a martingale with respect to the Brownian filtration.
Itô isometry
The Itô isometry is the key tool for extending the integral beyond simple processes. For a square-integrable, adapted process :
This says the -norm of the stochastic integral equals the -norm of the integrand computed with ordinary (Lebesgue) integration. The isometry is what lets you control the "size" of the stochastic integral using deterministic-style estimates, and it's the engine behind the extension to general integrands.
Extension to square-integrable processes
With the Itô isometry in hand, you extend the integral to all adapted, square-integrable processes, meaning processes satisfying:
The extension works in three steps:
- Approximate. Given a general square-integrable adapted process , construct a sequence of simple processes that converge to in the sense.
- Integrate the approximations. Each is already well-defined by the simple-process definition.
- Take the limit. The Itô isometry guarantees that these integrals form a Cauchy sequence in , so they converge to a unique limit. That limit is defined to be .
The result is independent of which approximating sequence you choose, so the integral is well-defined.
Properties of Itô integral
The Itô integral inherits several structural properties from its construction. These properties are not just theoretical niceties; you'll use them constantly when manipulating SDEs and applying Itô's lemma.
Linearity
For adapted, square-integrable processes and and constants :
This follows directly from the linearity of the sum in the simple-process definition and carries through the extension.
Continuity
The Itô integral is continuous in its integrand in the sense. If in , meaning:
then:
This is a direct consequence of the Itô isometry. It also turns out that the sample paths are almost surely continuous, which is important for the theory of SDEs.
Martingale property
If is adapted and square-integrable, then the process:
is a martingale with respect to the Brownian filtration . Concretely, this means:
- for all
- for all
The second condition says that the best prediction of the future value of the integral, given all information up to time , is just its current value. This is a direct consequence of evaluating the integrand at the left endpoint. The martingale property also implies for all , which you'll use repeatedly.
Itô processes
An Itô process combines a deterministic drift with a stochastic diffusion driven by Brownian motion. These processes are the central objects you'll work with in stochastic calculus.
Definition and examples
An Itô process satisfies a stochastic differential equation (SDE) of the form:
The term is the drift coefficient (the deterministic trend), and is the diffusion coefficient (the intensity of random fluctuations). In integral form, this reads:
where the first integral is an ordinary Lebesgue integral and the second is an Itô integral.
Three important examples:
- Geometric Brownian motion (GBM): . Both drift and diffusion scale with the current value, so the process stays positive. This is the standard model for stock prices in the Black-Scholes framework.
- Ornstein-Uhlenbeck (OU) process: . The drift pulls back toward the long-run mean at rate , making it mean-reverting. Used for interest rates and physical systems with a restoring force.
- Cox-Ingersoll-Ross (CIR) process: . Like OU but with diffusion proportional to , which prevents the process from going negative (under appropriate parameter conditions). Widely used for interest rate modeling.
Quadratic variation of Itô processes
The quadratic variation of an Itô process captures the cumulative "roughness" contributed by the diffusion term. For :
Only the diffusion coefficient contributes. The drift term has zero quadratic variation because it behaves like a smooth function. This is why the extra second-derivative term appears in Itô's lemma: the nonzero quadratic variation of Brownian motion (, or informally ) generates a correction that doesn't exist in ordinary calculus.
The informal multiplication rules that follow from this are:
You'll use these constantly when applying Itô's lemma.
Stochastic differential equations
SDEs of the form describe the evolution of systems with randomness. Existence and uniqueness of solutions typically require:
- Lipschitz continuity of and in (uniformly in )
- Linear growth bounds on and
Under these conditions, a unique strong solution exists for any initial condition with finite second moment. Solving SDEs analytically is only possible in special cases (GBM, OU, etc.). In general, you rely on Itô's lemma for transformations or numerical methods like the Euler-Maruyama scheme.

Itô's lemma
Itô's lemma is the stochastic version of the chain rule. It tells you how to compute the differential of a smooth function applied to an Itô process. The critical difference from ordinary calculus is an extra second-derivative term arising from the quadratic variation of Brownian motion.
Statement of Itô's lemma
Let satisfy , and let be twice continuously differentiable in and once in (i.e., ). Then is also an Itô process with:
Expanding using the multiplication rules (, all other products vanish), this becomes:
where all partial derivatives are evaluated at .
Comparison with deterministic chain rule
In ordinary calculus, for a smooth function :
Itô's lemma has the same two terms, plus the correction:
This term exists because Brownian motion has nonzero quadratic variation. In the deterministic case, and the correction vanishes, recovering the ordinary chain rule. The correction is sometimes called the Itô correction and is the single most important thing to remember about stochastic calculus.
Applying Itô's lemma: step-by-step
Here's how to apply Itô's lemma in practice:
- Identify the Itô process. Write down and read off and .
- Identify the function. Determine such that your target process is .
- Compute partial derivatives. Calculate , , and .
- Substitute into the formula. Plug everything into:
- Simplify. Collect terms and identify the new drift and diffusion coefficients of .
Classic example: Solve GBM by applying Itô's lemma to . You get , , so:
This shows is a Brownian motion with drift, giving the explicit solution .
Applications of Itô's lemma
- Black-Scholes equation. Apply Itô's lemma to an option price where follows GBM. Combined with a hedging argument, this yields the Black-Scholes PDE.
- Deriving moment equations. Apply Itô's lemma to (or higher powers) to derive ODEs for the moments of an SDE solution.
- Change of variables for SDEs. Transform a complicated SDE into a simpler one. The GBM-to-log transform above is the prototypical example.
- Physics and engineering. Analyze dynamics of particles subject to thermal noise (Langevin equations), study diffusion processes, and derive Fokker-Planck equations for probability densities.
Stochastic calculus
Beyond the Itô integral and Itô's lemma, stochastic calculus includes several additional tools that round out the theory and enable more advanced applications.
Stochastic integration by parts
For two Itô processes and , the product rule takes the form:
In integral form:
The quadratic covariation is defined as:
where and are the diffusion coefficients of and (assuming they're driven by the same Brownian motion). The extra term is the product-rule analogue of the Itô correction in Itô's lemma. In deterministic calculus, ; here you get an additional cross-variation term.
Integration with respect to martingales
The Itô integral generalizes beyond Brownian motion to integration with respect to continuous square-integrable martingales. A martingale satisfies:
- for all
- for
The construction mirrors the Brownian case, with the quadratic variation replacing . The isometry becomes:
A central result here is the martingale representation theorem: every square-integrable martingale adapted to the Brownian filtration can be written as a stochastic integral with respect to . This is the theoretical backbone of hedging arguments in finance.
Girsanov's theorem
Girsanov's theorem lets you change the probability measure so that an Itô process with drift becomes a martingale (or equivalently, a Brownian motion) under the new measure.
Suppose under the original measure , where is an adapted process satisfying the Novikov condition:
Define the Radon-Nikodým derivative (also called the exponential martingale):
Then under the new measure defined by , the process is a standard Brownian motion.
In finance, Girsanov's theorem is how you move from the "real-world" measure to the risk-neutral measure , under which discounted asset prices are martingales. This simplifies derivative pricing to computing expected values under .
Applications of Itô calculus
Financial mathematics and the Black-Scholes model
The Black-Scholes model assumes the stock price follows GBM:
To price a European option with payoff :
- Apply Itô's lemma to to get .
- Construct a self-financing portfolio that hedges the option (the "delta hedge").
- Since the hedged portfolio is riskless, it must earn the risk-free rate , yielding the Black-Scholes PDE:
- Solve with the appropriate boundary condition to get the Black-Scholes formula.
Alternatively, use Girsanov's theorem to switch to the risk-neutral measure and compute .
Stochastic differential equations in physics
SDEs appear throughout physics whenever thermal or quantum noise is present:
- Langevin equation: , describing a particle subject to friction and random thermal kicks. Itô calculus lets you compute velocity distributions and diffusion coefficients.
- Fokker-Planck equation: Given an SDE for , Itô's lemma (applied to test functions) yields a PDE for the probability density , connecting the stochastic and PDE perspectives.
- Stochastic Schrödinger equations: Model quantum systems coupled to noisy environments, with Itô calculus providing the rigorous framework for their analysis.
Filtering theory and stochastic control
Filtering is the problem of estimating a hidden state from noisy observations. The observation process is typically modeled as an Itô process, and the optimal filter satisfies a stochastic PDE:
- For linear Gaussian systems, this reduces to the Kalman-Bucy filter, a set of ODEs for the conditional mean and covariance.
- For nonlinear systems, the Kushner-Stratonovich equation gives the evolution of the conditional distribution.
Stochastic control seeks optimal decisions in the presence of randomness. The value function satisfies the Hamilton-Jacobi-Bellman (HJB) equation, a nonlinear PDE derived using Itô's lemma and the dynamic programming principle. Applications range from portfolio optimization in finance to robotic path planning and resource management.