Definition of stochastic integrals
Stochastic integrals extend the concept of integration to stochastic processes, allowing you to integrate one random process with respect to another. In classical calculus, you integrate a function against a smooth, deterministic variable. Stochastic integrals do something analogous, but the integrator is a random process like Brownian motion, which behaves far less predictably.
This machinery is essential for modeling systems driven by noise: asset prices in finance, particle diffusion in physics, noisy signals in engineering. Without stochastic integrals, you can't rigorously write down or solve stochastic differential equations.
Intuition behind stochastic integration
The basic idea mirrors Riemann integration: approximate the integral by summing products of the integrand and small increments of the integrator. You partition the time interval, evaluate the integrand somewhere in each subinterval, multiply by the increment of the integrator (say, Brownian motion), and take a limit as the partition gets finer.
The core difficulty is that Brownian motion has unbounded variation and is nowhere differentiable. You can't treat like an ordinary differential. The paths are too rough for classical integration theory to apply, so you need a purpose-built framework to make the limit well-defined.
Formal definition
Stochastic integrals are defined as limits of Riemann-Stieltjes-type sums, where the integrand is a predictable process (adapted and left-continuous) and the integrator is a semimartingale.
The definition depends on the concept of quadratic variation, which measures accumulated squared increments. For Brownian motion , the quadratic variation over is simply .
A critical subtlety: where you evaluate the integrand within each subinterval changes the result. Evaluating at the left endpoint gives the Itô integral; evaluating at the midpoint gives the Stratonovich integral. Unlike in classical calculus, these two choices produce different answers because of the nonzero quadratic variation of the integrator.
Differences vs Riemann-Stieltjes integrals
- Integrator type: In Riemann-Stieltjes integrals, the integrator is a deterministic function of bounded variation. In stochastic integrals, it's a stochastic process with unbounded variation (like Brownian motion).
- Predictability requirement: Stochastic integrals require the integrand to be predictable, meaning its value at time depends only on information available just before . Riemann-Stieltjes integrals have no such constraint.
- Quadratic variation matters: The quadratic variation of the integrator directly affects the value of a stochastic integral and appears in key results like Itô's lemma. In the classical setting, quadratic variation is zero for functions of bounded variation, so it never enters the picture.
- Evaluation point matters: Choosing different evaluation points within subintervals yields the same limit for Riemann-Stieltjes integrals but different limits for stochastic integrals.
Stochastic integrals generalize Riemann-Stieltjes integrals to handle integrators that are too irregular for the classical theory.
Itô integrals
Itô integrals, introduced by Kiyoshi Itô, are the most widely used stochastic integrals, especially in mathematical finance. They use a non-anticipative (left-endpoint) evaluation rule, which means the integrand at each step only uses information available before the next increment of noise. This makes them natural for modeling situations where decisions are made without knowledge of future randomness.
Definition of Itô integrals
Given an adapted process satisfying and a Brownian motion , the Itô integral is:
where is a partition with mesh size going to zero. The limit is taken in the (mean-square) sense.
Notice that is evaluated at the left endpoint , not at or the midpoint. This is what makes it "non-anticipative." A key consequence: the Itô integral is a martingale with respect to the filtration generated by , provided the integrability condition holds.
Itô processes
An Itô process is a stochastic process that can be written as the sum of a deterministic drift integral and an Itô stochastic integral. Formally, is an Itô process if:
or equivalently in integral form:
Here is the drift coefficient (the deterministic trend) and is the diffusion coefficient (the intensity of random fluctuations). The classic example is geometric Brownian motion, used to model stock prices, where and are proportional to itself.
Itô's lemma
Itô's lemma is the stochastic calculus analog of the chain rule. It tells you how to compute the differential of a smooth function applied to an Itô process, and it's arguably the single most important tool in stochastic calculus.
Let satisfy , and let be twice continuously differentiable. Then:
The term is the Itô correction term. It has no analog in ordinary calculus and arises precisely because has nonzero quadratic variation (). This correction is what makes Itô calculus different from classical calculus and is the reason the Itô integral of with respect to is rather than .
Itô's lemma is the key to deriving the Black-Scholes PDE for option pricing.
Applications of Itô calculus
- Mathematical finance: Modeling asset prices (geometric Brownian motion), deriving option pricing formulas (Black-Scholes), interest rate models, and portfolio optimization.
- Physics: Describing Brownian motion of particles, diffusion processes, and Langevin dynamics in statistical mechanics.
- Engineering: Filtering and estimation in noisy systems (Kalman filter), stochastic control, and signal processing.
Stratonovich integrals
Stratonovich integrals, introduced by Ruslan Stratonovich, offer an alternative to Itô integrals. They evaluate the integrand at the midpoint of each subinterval, which changes the resulting value of the integral but preserves the classical chain rule.
Definition of Stratonovich integrals
Given a process and Brownian motion , the Stratonovich integral is defined as:
The notation distinguishes it from the Itô integral. Because the midpoint evaluation "peeks" at the future increment, the Stratonovich integral is not a martingale. This can be a disadvantage in probability-based arguments but is sometimes more natural in physical modeling.
Comparison vs Itô integrals
The two integrals are related by a correction term. When is itself a function of :
where is the quadratic covariation of and . For the specific case where for some smooth function , this becomes:
The correction term is exactly what compensates for the Itô correction in Itô's lemma. You can always convert between the two formulations, so the choice is one of convenience:
- Itô is preferred when martingale properties matter (finance, probability theory).
- Stratonovich is preferred when you want classical calculus rules to hold (physics, systems derived from smooth approximations of noise).
Stratonovich calculus
Stratonovich calculus preserves the ordinary chain rule, which makes it more intuitive if you're coming from a classical calculus background. However, the computations can become more involved because you lose the martingale property and the clean Itô isometry.

Chain rule for Stratonovich integrals
Let satisfy the Stratonovich SDE , and let be twice continuously differentiable. The chain rule is:
There is no second-order correction term. This looks exactly like the ordinary chain rule, which is the main appeal of the Stratonovich formulation.
Properties of stochastic integrals
Both Itô and Stratonovich integrals share some fundamental properties, though they differ in others. These properties are the workhorses you'll use repeatedly when manipulating stochastic integrals.
Linearity of integration
Stochastic integrals are linear. For adapted processes , and constants , :
This holds for both Itô and Stratonovich integrals and lets you break complex integrands into simpler pieces.
Isometry property
The Itô isometry connects the variance of a stochastic integral to a deterministic integral. For an adapted process :
This is extremely useful for computing second moments and proving convergence results. It says that the norm of the stochastic integral equals the norm of the integrand, treating the Itô integral as an "isometry" between function spaces.
Note: this property holds specifically for Itô integrals. Stratonovich integrals don't satisfy an isometry of this form because they aren't martingales.
Martingale property
If is adapted and satisfies , then the Itô integral is a martingale:
This means the expected future value of the integral, given current information, equals its current value. The martingale property is central to mathematical finance, where it underpins the theory of fair pricing and hedging.
Stratonovich integrals are not martingales in general, which is why Itô integrals are preferred in probabilistic and financial applications.
Quadratic variation
The quadratic variation of Brownian motion is:
This is a deterministic result despite being random, and it's the fundamental reason stochastic calculus differs from ordinary calculus. For a stochastic integral , the quadratic variation is:
This follows directly from the Itô isometry. Quadratic variation appears throughout stochastic calculus: in Itô's lemma, in the conversion between Itô and Stratonovich integrals, and in the definition of the integrals themselves.
Stochastic differential equations (SDEs)
SDEs combine everything above into a framework for modeling systems driven by noise. An SDE specifies how a process evolves through both a deterministic drift and a random diffusion term.
Definition of SDEs
An SDE takes the form:
This is shorthand for the integral equation:
- is the drift coefficient, governing the deterministic trend.
- is the diffusion coefficient, governing the intensity of noise.
- The stochastic integral can be interpreted in either the Itô or Stratonovich sense, yielding different equations with different solutions.
Strong vs weak solutions
- A strong solution is a process defined on the same probability space as the given Brownian motion , adapted to its filtration, and satisfying the integral equation pathwise (almost surely). Strong solutions are pathwise unique: if two strong solutions start at the same point, they agree for all time with probability 1.
- A weak solution requires only that there exists some probability space carrying both a Brownian motion and a process satisfying the SDE. The Brownian motion may not be the one you started with. Weak solutions are unique in distribution: any two weak solutions have the same probability law, but their individual paths may differ.
Strong solutions are more constructive and easier to work with numerically. Weak solutions are sufficient when you only care about distributional properties.
Existence and uniqueness of solutions
The standard sufficient conditions for a unique strong solution are:
-
Lipschitz continuity: There exists a constant such that for all , , :
-
Linear growth: There exists a constant such that for all , :
The Lipschitz condition prevents the coefficients from changing too abruptly, ensuring uniqueness. The linear growth condition prevents the solution from exploding to infinity in finite time, ensuring existence. Together, they guarantee a unique strong solution that exists for all time.
Numerical methods for SDEs
When analytical solutions aren't available, you approximate using numerical schemes:
-
Euler-Maruyama method: The simplest approach. Discretize time into steps of size and iterate: where . This achieves strong convergence of order .
-
Milstein method: Adds a correction term from the Itô-Taylor expansion: where . This achieves strong convergence of order .
-
Higher-order methods: Include additional terms from the Itô-Taylor expansion for better accuracy, at the cost of greater complexity.
The Euler-Maruyama method is the go-to starting point. The Milstein method is worth the extra effort when you need better pathwise accuracy.
Applications of stochastic integrals
Mathematical finance
Stochastic integrals are the language of modern quantitative finance. The Black-Scholes model assumes stock prices follow geometric Brownian motion:
Applying Itô's lemma to yields the Black-Scholes PDE, which gives closed-form option prices. More advanced models like the Heston stochastic volatility model use coupled SDEs to capture the empirical observation that volatility itself is random, with the variance process satisfying its own SDE.