Partial derivatives are the foundation of Calculus IV. They extend differentiation to functions with multiple inputs. Whenever you analyze a surface, optimize a multivariable function, or study how a physical system responds to changes, you're relying on partial derivatives. The rules here connect directly to gradient vectors, directional derivatives, optimization, and vector field analysis.
You'll be tested on more than computation. Examiners want to see that you understand when to apply each rule and why it works. The chain rule for partials drives related rates problems. Clairaut's theorem saves you time on mixed partials. The gradient ties everything together for optimization. Knowing what concept each rule captures, and when to reach for it, matters just as much as cranking through algebra.
Foundational Concepts and Notation
Before applying any rules, you need a solid understanding of what partial derivatives actually measure and how to communicate them precisely. A partial derivative isolates the rate of change in one direction while treating all other variables as constants.
Definition of Partial Derivatives
The formal definition mirrors the single-variable limit definition. For a function f(x,y):
fx(a,b)=limh→0hf(a+h,b)−f(a,b)
Measures single-variable change: a partial derivative tells you how f changes as one variable varies while all others stay fixed.
Enables independent analysis of each variable's contribution to the function's behavior.
Notation ∂x∂f uses the curly ∂ to distinguish partial differentiation from total (ordinary) differentiation.
Partial Derivative Notation
First-order partials use ∂x∂f, ∂y∂f, or subscript notation like fx, fy.
Mixed second derivatives written as ∂x∂y∂2f mean: differentiate first with respect to y, then with respect to x. Read right to left in Leibniz notation.
Higher-order notation extends naturally: ∂x2∂y∂3f means differentiate once with respect to y, then twice with respect to x.
Compare:∂x∂f vs. dxdf. Both use Leibniz-style notation, but the curly ∂ signals a multivariable context where other variables are held constant. If a problem uses ∂, you're in partial derivative territory.
Differentiation Rules for Partial Derivatives
The computational rules from single-variable calculus (product rule, quotient rule, chain rule) all extend to partial derivatives with one key modification: treat every variable except the one you're differentiating with respect to as a constant.
Partial Derivatives of Multivariable Functions
Here's the process:
Identify which variable you're differentiating with respect to.
∂x∂(x3y2)=3x2y2 (power rule; y2 is just a constant coefficient).
∂x∂(sin(xy))=ycos(xy) (chain rule; derivative of the inner function xy with respect to x is y).
So fx=3x2y2+ycos(xy).
Setting all first partials equal to zero simultaneously is how you find critical points of multivariable functions.
Chain Rule for Partial Derivatives
This is where things get genuinely tricky. The chain rule has different forms depending on the dependency structure.
Case 1: Single parameter. If z=f(x,y) where x=x(t) and y=y(t), then:
dtdz=∂x∂fdtdx+∂y∂fdtdy
Notice the result is an ordinary derivative dtdz because z ultimately depends on only one independent variable.
Case 2: Multiple parameters. If z=f(x,y) where x=x(s,t) and y=y(s,t), then:
∂s∂z=∂x∂f∂s∂x+∂y∂f∂s∂y
and similarly for ∂t∂z.
Tree diagrams are genuinely useful here. Draw z at the top, connect it to x and y, then connect those to s and t. Each path from z down to your target variable contributes one term (multiply along branches, add across paths).
Implicit Differentiation for Multivariable Functions
When a relationship is given as F(x,y,z)=0 and you can't easily solve for z, implicit differentiation gives you the partial derivatives directly.
Formula: If F(x,y,z)=0 and Fz=0, then:
∂x∂z=−FzFxand∂y∂z=−FzFy
Why this works: Differentiate F(x,y,z)=0 with respect to x using the chain rule (treating z as a function of x and y), then solve for ∂x∂z.
Compare: Standard partial differentiation vs. implicit differentiation. Both find partial derivatives, but implicit differentiation works when you can't isolate the dependent variable. Use implicit when you see equations like x2+y2+z2=1 rather than z=1−x2−y2.
Symmetry and Higher-Order Derivatives
When you take multiple partial derivatives, the order can matter. Clairaut's theorem tells you exactly when you can swap the order, which is most of the time for functions you'll encounter in this course.
Clairaut's Theorem (Equality of Mixed Partials)
Statement: If fxy and fyx are both continuous on an open region, then:
∂x∂y∂2f=∂y∂x∂2f
Computation shortcut: choose whichever differentiation order is easier. The result is identical as long as the continuity condition holds.
The continuity requirement is almost always satisfied for functions on exams. The exception to watch for is piecewise-defined functions at the boundary point, where the mixed partials can fail to be continuous and may differ.
Higher-Order Partial Derivatives
Pure second partials like fxx=∂x2∂2f measure concavity in the x-direction, just like single-variable second derivatives.
Mixed partials like fxy capture how the rate of change in one direction varies as you move in another direction. Think of it as measuring "twist."
The Hessian matrix collects all second partials into a matrix:
H=[fxxfxyfxyfyy]
The second derivative test uses the Hessian's determinant: D=fxxfyy−(fxy)2. At a critical point where ∇f=0:
D>0 and fxx>0: local minimum
D>0 and fxx<0: local maximum
D<0: saddle point
D=0: test is inconclusive
Compare:∂x2∂2f vs. ∂x∂y∂2f. Pure second partials measure curvature along axes, while mixed partials measure twist. Both appear in the discriminant D=fxxfyy−(fxy)2.
Gradient and Directional Analysis
The gradient packages all first partial derivatives into a single vector, unlocking powerful geometric interpretations. It points toward steepest increase, and its magnitude tells you how steep.
Gradient Vector
Definition: For f(x,y,z):
∇f=⟨∂x∂f,∂y∂f,∂z∂f⟩=⟨fx,fy,fz⟩
Points toward steepest ascent: moving in the direction of ∇f increases f as rapidly as possible.
Magnitude ∥∇f∥ equals the maximum rate of change of f at that point.
Normal to level curves/surfaces: ∇f at a point is perpendicular to the level curve (2D) or level surface (3D) passing through that point. This is why the gradient shows up in tangent plane equations.
Directional Derivatives
The directional derivative gives the rate of change of f in any direction, not just along coordinate axes.
Duf=∇f⋅u
where umust be a unit vector (∥u∥=1). If you're given a direction vector that isn't unit length, normalize it first.
Three facts worth memorizing:
MaximumDuf=∥∇f∥, occurring when \mathbf{u}} points in the gradient direction.
MinimumDuf=−∥∇f∥, occurring in the direction opposite the gradient.
Zero directional derivative occurs when u is perpendicular to ∇f, meaning you're moving along a level curve/surface.
Compare: Gradient vs. directional derivative. The gradient gives you the direction of maximum increase, while the directional derivative gives you the rate of change in any specified direction. A common exam question: "In what direction does f increase most rapidly?" Answer: the gradient direction.
Vector-Valued Extensions
When your function outputs a vector instead of a scalar, partial derivatives apply component-by-component. This extends naturally to analyzing vector fields and parametric surfaces.
Partial Derivatives of Vector-Valued Functions
For a parametric surface r(u,v)=⟨x(u,v),y(u,v),z(u,v)⟩, take partials of each component separately:
ru=⟨∂u∂x,∂u∂y,∂u∂z⟩
The vectors ru and rv are tangent to the surface at each point.
Their cross product ru×rv gives a normal vector to the surface, which you'll need for surface integrals and flux calculations.
Quick Reference Table
Concept
Key Formula / Idea
Basic computation
∂x∂f, ∂y∂f with other variables held constant
Chain rule
dtdz=fxdtdx+fydtdy; use tree diagrams for complex dependencies
Implicit differentiation
∂x∂z=−FzFx when F(x,y,z)=0
Clairaut's theorem
fxy=fyx when mixed partials are continuous
Second derivative test
D=fxxfyy−(fxy)2; check sign of D and fxx
Gradient vector
∇f=⟨fx,fy,fz⟩; points toward steepest ascent; normal to level surfaces
Directional derivatives
Duf=∇f⋅u where u is a unit vector
Self-Check Questions
If f(x,y)=x2y+exy, which rule do you use to find ∂x∂f, and what do you treat y as during the calculation?
Compare ∂x∂y∂2f and ∂y∂x∂2f. Under what condition are they equal, and why does this matter computationally?
Given ∇f=⟨3,−4⟩ at a point, what is the maximum rate of change of f, and in what direction does it occur?
When would you choose implicit differentiation over standard partial differentiation? Give an example equation where implicit differentiation is the better approach.
FRQ-style: A surface is defined by z=f(x,y). Explain how you would use the gradient to find a vector normal to the level curve f(x,y)=c and a vector normal to the surface itself. What's the relationship between these two normals?