Why This Matters
Partial derivatives are your gateway to understanding how multivariable functions behave—and in Calculus II, you're being tested on your ability to analyze functions that depend on more than one variable simultaneously. This isn't just an extension of single-variable calculus; it's a fundamentally different way of thinking about change. You'll need to master how functions respond when you tweak one variable while holding others fixed, how the gradient captures the direction of steepest increase, and how optimization works when you have multiple inputs to consider.
The concepts here connect directly to applications you'll see on exams: finding tangent planes, optimizing functions with constraints, and understanding how composite functions behave through the multivariable chain rule. Don't just memorize formulas—know why we treat other variables as constants, when Clairaut's theorem applies, and how the gradient relates to directional derivatives. These conceptual connections are exactly what FRQ prompts target.
Foundations: What Partial Derivatives Measure
Before diving into techniques, you need a rock-solid understanding of what partial derivatives actually represent. A partial derivative isolates the effect of one variable on a function's output while treating all other variables as constants.
Definition of Partial Derivatives
- Measures the rate of change of a function with respect to one variable while all other variables remain fixed—think of slicing a 3D surface with a vertical plane
- Fundamental to multivariable analysis—without this tool, you cannot analyze surfaces, optimize functions, or compute rates of change in higher dimensions
- Conceptually distinct from ordinary derivatives—you're examining change along one axis only, not total change
Notation for Partial Derivatives
- The symbol ∂ distinguishes partial derivatives from ordinary derivatives d—common notations include ∂x∂f, fx, and ∂xf
- Higher-order notation uses repeated symbols: ∂x2∂2f for second partials, ∂x∂y∂2f for mixed partials
- Subscript notation is faster for computation: fxx means differentiate twice with respect to x, while fxy means differentiate with respect to x then y
Calculating Partial Derivatives
- Treat all other variables as constants—if finding ∂x∂ of f(x,y)=x2y+y3, the y terms act like coefficients, giving 2xy
- Apply standard differentiation rules (product, quotient, chain) exactly as in single-variable calculus—the only difference is what you treat as constant
- Identify your variable clearly before differentiating—careless errors often come from forgetting which variable is "active"
Compare: Partial derivatives vs. ordinary derivatives—both measure instantaneous rate of change, but partial derivatives hold other variables fixed while ordinary derivatives allow all variables to change. On FRQs, always state explicitly which variables you're treating as constants.
Higher-Order Behavior: Second Derivatives and Beyond
Higher-order partial derivatives reveal curvature and concavity of multivariable functions—essential for optimization and understanding surface geometry.
Higher-Order Partial Derivatives
- Second partial derivatives like fxx and fyy measure how the rate of change itself changes—analogous to concavity in single-variable calculus
- Used in the second derivative test for classifying critical points as maxima, minima, or saddle points
- Notation matters: ∂x2∂2f means differentiate with respect to x twice, giving information about curvature along the x-direction
Mixed Partial Derivatives
- Differentiate with respect to different variables in succession—fxy means take ∂y∂ of ∂x∂f
- Capture variable interaction—a nonzero mixed partial indicates that changing one variable affects how the function responds to changes in another
- Order of notation: in ∂y∂x∂2f, you differentiate with respect to x first, then y (read right to left)
Clairaut's Theorem (Equality of Mixed Partials)
- States that fxy=fyx provided both mixed partials are continuous near the point—this is almost always satisfied for functions you'll encounter
- Simplifies calculations dramatically—you can compute mixed partials in whichever order is easier
- When to cite it: if an exam asks you to verify equality of mixed partials, check continuity conditions and invoke Clairaut's theorem
Compare: fxy vs. fyx—by Clairaut's theorem, these are equal for continuous functions, but the computation path differs. If one order involves simpler algebra, use it. This is a common exam shortcut.
The Chain Rule and Implicit Differentiation
When functions are composed or defined implicitly, you need specialized techniques. The multivariable chain rule tracks how changes propagate through nested dependencies.
The Chain Rule for Partial Derivatives
- Handles composite functions where intermediate variables depend on other variables—if z=f(x,y) and x=g(t), y=h(t), then dtdz=∂x∂fdtdx+∂y∂fdtdy
- Tree diagrams help visualize dependencies—draw arrows from the final output back through intermediate variables to independent variables
- Essential for related rates problems in multiple dimensions and for functions defined parametrically
Implicit Differentiation with Partial Derivatives
- Used when you can't solve explicitly for one variable—given F(x,y,z)=0, find ∂x∂z by differentiating both sides
- Formula shortcut: ∂x∂z=−FzFx when Fz=0—memorize this for speed on exams
- Apply chain rule implicitly—every term with z gets multiplied by ∂x∂z when differentiating with respect to x
Compare: Explicit vs. implicit differentiation—explicit is straightforward substitution, while implicit handles equations you can't solve for a single variable. FRQs often give surfaces in implicit form, so master the formula ∂x∂z=−FzFx.
Gradients and Directional Derivatives
The gradient unifies all first-order partial derivatives into a single vector that points toward steepest ascent. Directional derivatives generalize partial derivatives to arbitrary directions.
Gradient Vector
- Defined as ∇f=⟨fx,fy⟩ (or ⟨fx,fy,fz⟩ in 3D)—collects all first partial derivatives into one vector
- Points in the direction of steepest increase—this is the most important geometric fact about gradients
- Magnitude ∣∇f∣ gives the maximum rate of change of the function at that point
Directional Derivatives
- Measures rate of change in any direction—not just along coordinate axes like partial derivatives
- Computed as Duf=∇f⋅u where u is a unit vector in the desired direction—don't forget to normalize!
- Maximum value equals ∣∇f∣ (achieved when moving in the gradient direction); minimum is −∣∇f∣ (opposite direction)
Compare: Partial derivatives vs. directional derivatives—partial derivatives are special cases of directional derivatives along coordinate axes (i or j). If asked for the rate of change in a non-axis direction, you need the directional derivative formula.
Geometric Applications: Tangent Planes and Linearization
Partial derivatives let you construct linear approximations to surfaces—the multivariable analog of tangent lines.
Tangent Planes and Normal Lines
- Tangent plane equation: for z=f(x,y) at point (a,b,f(a,b)), the plane is z−f(a,b)=fx(a,b)(x−a)+fy(a,b)(y−b)
- Normal vector is ⟨fx,fy,−1⟩ for explicit surfaces, or ∇F for implicit surfaces F(x,y,z)=0
- Normal line passes through the point in the direction of the normal vector—used in optimization and geometric analysis
Taylor Series for Multivariable Functions
- Extends polynomial approximation to functions of several variables—the linear terms use first partials, quadratic terms use second partials
- First-order approximation: f(x,y)≈f(a,b)+fx(a,b)(x−a)+fy(a,b)(y−b)—this is the tangent plane equation
- Second-order terms involve fxx, fyy, and fxy—crucial for the second derivative test in optimization
Compare: Tangent plane vs. linearization—these are the same object with different names. "Tangent plane" emphasizes geometry; "linearization" emphasizes approximation. Both use the same formula.
Optimization and Applications
Partial derivatives are the primary tool for finding extrema of multivariable functions and modeling real-world systems.
Partial Derivatives in Optimization Problems
- Critical points occur where ∇f=0—set all first partial derivatives equal to zero and solve the system
- Second derivative test uses the discriminant D=fxxfyy−(fxy)2: if D>0 and fxx>0, you have a local minimum; D>0 and fxx<0 gives a maximum; D<0 means saddle point
- Boundary analysis required for constrained regions—check critical points and boundary extrema
Applications in Physics and Engineering
- Heat equation and diffusion use ∂t∂u and ∇2u (the Laplacian) to model temperature distribution over time
- Fluid dynamics relies on partial derivatives to express conservation laws and flow rates
- Economics uses partial derivatives for marginal analysis—∂x∂C gives marginal cost with respect to one input
Compare: Single-variable vs. multivariable optimization—single-variable uses f′(x)=0 and the second derivative sign; multivariable uses ∇f=0 and the discriminant D. The discriminant has no single-variable analog because saddle points don't exist in 1D.
Quick Reference Table
|
| Basic computation | Treating other variables as constants, standard differentiation rules |
| Higher-order derivatives | fxx, fyy, fxy for curvature and the second derivative test |
| Clairaut's theorem | fxy=fyx when mixed partials are continuous |
| Chain rule | Composite functions, parametric dependencies, tree diagrams |
| Implicit differentiation | ∂x∂z=−FzFx for implicit surfaces |
| Gradient | ∇f=⟨fx,fy⟩, direction of steepest ascent |
| Directional derivative | Duf=∇f⋅u, rate of change in direction u |
| Tangent planes | z−z0=fx(x−x0)+fy(y−y0) |
| Optimization | Critical points where ∇f=0, discriminant D=fxxfyy−fxy2 |
Self-Check Questions
-
If f(x,y)=x3y2+sin(xy), what is fxy? Verify that fxy=fyx and explain why Clairaut's theorem guarantees this.
-
Compare the gradient vector and the directional derivative: how does knowing ∇f at a point let you find the rate of change in any direction?
-
For the surface x2+y2+z2=9, use implicit differentiation to find ∂x∂z. What does this derivative represent geometrically?
-
A critical point has fxx=4, fyy=3, and fxy=5. Calculate the discriminant D and classify the critical point. What would change if fxy=2 instead?
-
Contrast finding the tangent plane to z=f(x,y) at a point versus finding the tangent plane to an implicitly defined surface F(x,y,z)=0. What role does the gradient play in each case?