๐Ÿ’นFinancial Mathematics

Key Concepts in Option Pricing Models

Study smarter with Fiveable

Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.

Get Started

Why This Matters

Option pricing models form the backbone of derivatives valuation and risk management, two areas that dominate quantitative finance exams and real-world applications. You're being tested not just on whether you can plug numbers into formulas, but on whether you understand why different models exist and when each approach is appropriate. The core principles here, no-arbitrage pricing, risk-neutral valuation, volatility modeling, and numerical methods, appear repeatedly across financial mathematics, from basic derivative pricing to advanced portfolio hedging.

Don't just memorize the Black-Scholes formula or the structure of a binomial tree. Focus on understanding what assumptions each model makes, what market behaviors it can (and can't) capture, and how models relate to one another. When an exam question asks you to choose an appropriate pricing method, you need to recognize whether the option is European or American, whether volatility is constant or stochastic, and whether the payoff depends on the entire price path. Know what problem each model solves, and you'll handle any question thrown at you.


Closed-Form Analytical Models

These models provide exact mathematical solutions for option prices, making them computationally efficient and ideal for European-style options. The key assumption enabling closed-form solutions is typically that the underlying follows a specific stochastic process with known distributional properties.

Black-Scholes-Merton Model

The BSM model is built on a no-arbitrage pricing foundation: option prices must be set so that no risk-free profit opportunity exists in an efficient market. To arrive at a tractable closed-form solution, the model assumes constant volatility and constant risk-free interest rates, along with continuous trading, no transaction costs, and no dividends (in the basic version).

Under these assumptions, the underlying asset price follows geometric Brownian motion (GBM), meaning log-returns are normally distributed and the stock price itself is log-normally distributed. The resulting pricing PDE simplifies to yield the famous formula for a European call:

C=S0N(d1)โˆ’Keโˆ’rTN(d2)C = S_0 N(d_1) - Ke^{-rT}N(d_2)

where:

  • d1=lnโก(S0/K)+(r+ฯƒ2/2)TฯƒTd_1 = \frac{\ln(S_0 / K) + (r + \sigma^2 / 2)T}{\sigma \sqrt{T}}
  • d2=d1โˆ’ฯƒTd_2 = d_1 - \sigma \sqrt{T}
  • S0S_0 is the current stock price, KK is the strike, rr is the risk-free rate, TT is time to expiration, ฯƒ\sigma is volatility, and N(โ‹…)N(\cdot) is the standard normal CDF

The model cannot directly price American options (which may be exercised early) and produces a flat implied volatility surface, which contradicts what we observe in real markets.

Garman-Kohlhagen Model

This model extends Black-Scholes to foreign exchange options by incorporating both the domestic risk-free rate rdr_d and the foreign risk-free rate rfr_f. In the standard BSM framework, a continuous dividend yield qq reduces the forward price of the underlying. For FX options, the foreign interest rate plays the role of that dividend yield, because holding the foreign currency earns the foreign risk-free rate. The closed-form solution for a European FX call becomes:

C=S0eโˆ’rfTN(d1)โˆ’Keโˆ’rdTN(d2)C = S_0 e^{-r_f T} N(d_1) - K e^{-r_d T} N(d_2)

This is essential for managing currency exposure in international portfolios.

Compare: Black-Scholes-Merton vs. Garman-Kohlhagen: both assume log-normal distributions and provide closed-form solutions, but Garman-Kohlhagen accounts for two interest rates instead of one. If a question asks about pricing currency options, Garman-Kohlhagen is your go-to model.


Lattice-Based Models

Lattice models discretize time and price movements into a tree structure, allowing step-by-step backward induction. This discrete framework makes them particularly powerful for American options, where early exercise decisions must be evaluated at each node.

Binomial Option Pricing Model

The binomial model uses a discrete-time framework where, at each time step, the stock price can move to one of two values: up by a factor uu or down by a factor dd. You price the option by working backward from the terminal payoffs at expiration to the present.

At each node, you calculate the risk-neutral expected value of the option one step ahead, discounted at the risk-free rate. For American options, you compare this continuation value against the immediate exercise value and take whichever is larger. This node-by-node comparison is what makes lattice models suitable for early exercise problems, something BSM can't handle directly.

As the number of time steps increases, the binomial model converges to the Black-Scholes price for European options, which provides useful intuition for why the continuous-time limit works.

Cox-Ross-Rubinstein (CRR) Model

The CRR model is a specific parameterization of the binomial model that produces a recombining tree: an up move followed by a down move lands at the same node as a down move followed by an up move. This means the number of nodes at each time step grows linearly (n+1n+1 nodes at step nn) rather than exponentially, which is critical for computational efficiency.

The up and down factors are set as u=eฯƒฮ”tu = e^{\sigma\sqrt{\Delta t}} and d=1/ud = 1/u, and the risk-neutral probability of an up move is derived directly from no-arbitrage conditions:

p=erฮ”tโˆ’duโˆ’dp = \frac{e^{r\Delta t} - d}{u - d}

The CRR tree's clear visual structure makes it an excellent tool for building intuition about how option values depend on the underlying price path.

Trinomial Tree Model

The trinomial model extends the lattice approach by allowing three possible movements per step: up, down, or unchanged. This finer granularity captures more potential price outcomes at each node.

The practical benefit is better convergence properties: you typically need fewer time steps to achieve the same pricing accuracy as a binomial tree. This advantage is especially noticeable for American options and options with complex early exercise features, such as Bermudan options.

Compare: Binomial vs. Trinomial Trees: both use backward induction and handle American options, but trinomial trees offer faster convergence and better accuracy for the same computational effort. Choose trinomial when precision matters; choose binomial when simplicity and transparency are priorities.


Stochastic Volatility Models

These models recognize that volatility itself is random and evolves over time. By treating volatility as a stochastic process, they capture real market phenomena like volatility clustering and the implied volatility smile.

Heston Model

In the Heston model, the variance vtv_t of the asset follows its own stochastic differential equation, specifically a mean-reverting square-root diffusion (also known as a CIR process):

dvt=ฮบ(ฮธโˆ’vt)โ€‰dt+ฮพvtโ€‰dWtvdv_t = \kappa(\theta - v_t)\,dt + \xi\sqrt{v_t}\,dW_t^v

where ฮบ\kappa is the speed of mean reversion, ฮธ\theta is the long-run variance, and ฮพ\xi is the volatility of volatility. The square-root term ensures variance stays non-negative (under certain parameter conditions).

A key feature is the correlation parameter ฯ\rho linking the Brownian motion driving the asset price to the one driving variance. When ฯ<0\rho < 0 (typical for equities), a drop in the stock price tends to coincide with a rise in volatility. This is the leverage effect, and it's what generates the implied volatility skew observed in equity markets.

The Heston model admits a semi-analytical solution via characteristic functions and Fourier inversion, making it much faster to calibrate than purely simulation-based stochastic volatility models.

Stochastic Volatility Models (General Framework)

The broader class of stochastic volatility models treats volatility as a random variable that evolves according to its own dynamics, allowing the model to adapt to changing market conditions.

Notable examples beyond Heston include the SABR model (Stochastic Alpha, Beta, Rho), which is widely used for interest rate derivatives and provides an approximate closed-form implied volatility formula. Each model specifies different volatility dynamics suited to different asset classes.

These models improve pricing accuracy for longer-dated options and exotic derivatives where the constant volatility assumption of BSM breaks down most visibly.

Compare: Black-Scholes vs. Heston: Black-Scholes assumes constant volatility and produces flat implied volatility across strikes, while Heston's stochastic volatility generates the smile/skew patterns observed in real markets. When exam questions reference implied volatility patterns, think stochastic volatility models.


Models for Discontinuous Price Movements

Markets don't always move smoothly. Sudden jumps from earnings announcements, geopolitical events, or market crashes require models that incorporate discontinuities. Jump-diffusion models combine continuous Brownian motion with discrete Poisson-driven jumps.

Jump-Diffusion Models

The most well-known version is the Merton jump-diffusion model, which adds a compound Poisson process to the standard GBM dynamics:

dSS=(rโˆ’ฮปkห‰)โ€‰dt+ฯƒโ€‰dWt+Jโ€‰dNt\frac{dS}{S} = (r - \lambda \bar{k})\,dt + \sigma\,dW_t + J\,dN_t

where NtN_t is a Poisson process with intensity ฮป\lambda (the average number of jumps per year), JJ is the random jump size (typically log-normally distributed), and kห‰\bar{k} is the expected percentage jump size. The Poisson process governs jump timing, so jumps arrive randomly but at a known average rate.

These models are essential for assets with event risk: think earnings announcements, FDA drug approvals, or merger news where prices gap overnight. The Merton model can be expressed as an infinite series of BSM-like terms, each weighted by the probability of nn jumps occurring.

Compare: Black-Scholes vs. Jump-Diffusion: Black-Scholes assumes continuous price paths and tends to underprice out-of-the-money options, while jump-diffusion models generate fatter tails and higher prices for deep OTM options. This distinction frequently appears in questions about model limitations.


Numerical Methods

When closed-form solutions don't exist due to American exercise features, path dependence, or complex payoffs, numerical methods provide the answer. These techniques trade computational cost for flexibility in handling virtually any derivative structure.

Monte Carlo Simulation

Monte Carlo pricing works by simulating thousands (or millions) of random price paths for the underlying asset under the risk-neutral measure, computing the option payoff along each path, and then averaging and discounting those payoffs back to the present.

The method's great strength is flexibility with path-dependent options like Asian options (payoff depends on the average price), lookback options (payoff depends on the maximum or minimum price), and barrier options (payoff depends on whether the price crosses a threshold). It also scales well to high-dimensional problems, such as basket options on multiple underlyings, where lattice methods become impractical.

The main drawback is computational cost, especially for achieving tight confidence intervals. Variance reduction techniques (antithetic variates, control variates, importance sampling) are commonly used to improve efficiency. Standard Monte Carlo is also poorly suited for American options, though extensions like the Longstaff-Schwartz least-squares regression method address this.

Finite Difference Methods

Finite difference methods solve the pricing PDE numerically by discretizing both time and the underlying asset price onto a grid. You replace the partial derivatives in the Black-Scholes PDE with finite difference approximations and solve the resulting system of equations.

Three common schemes offer different trade-offs:

  • Explicit scheme: simple to implement, but can be numerically unstable if the time step is too large relative to the price step
  • Implicit scheme: unconditionally stable, but requires solving a system of linear equations at each time step
  • Crank-Nicolson scheme: averages explicit and implicit approaches, offering second-order accuracy in time with unconditional stability

For American options, these methods handle the early exercise constraint through free boundary conditions that identify the optimal exercise boundary at each time step.

Compare: Monte Carlo vs. Finite Difference: Monte Carlo excels at high-dimensional problems and path-dependent payoffs, while finite difference methods are more efficient for low-dimensional American option pricing. Choose your method based on the derivative's specific features.


Quick Reference Table

ConceptBest Examples
Closed-form European pricingBlack-Scholes-Merton, Garman-Kohlhagen, Heston (semi-analytical)
American option valuationBinomial, Cox-Ross-Rubinstein, Trinomial, Finite Difference
Stochastic volatilityHeston, SABR, General SV models
Path-dependent optionsMonte Carlo Simulation
Jump/discontinuity modelingMerton Jump-Diffusion
Currency option pricingGarman-Kohlhagen
Numerical PDE solutionsFinite Difference Methods
Volatility smile captureHeston, SABR, Jump-Diffusion

Self-Check Questions

  1. Which two models both provide closed-form (or semi-analytical) solutions for European options but differ in their treatment of volatility, one assuming it's constant, the other modeling it as stochastic?

  2. You need to price an American put option on a stock. Which models from this guide would be appropriate, and why can't you use Black-Scholes directly?

  3. Compare Monte Carlo simulation with finite difference methods: what types of derivatives is each best suited for, and what are the computational trade-offs?

  4. Implied volatility data shows higher volatilities for out-of-the-money puts than at-the-money options. Which model limitation does this reveal, and which alternative models address it?

  5. The Garman-Kohlhagen model extends Black-Scholes for FX options. What specific modification does it make to the standard framework, and why is this necessary for currency derivatives?