The gradient vector $\nabla f(x, y, z)$ points in the direction of the maximum rate of change of a function $f(x, y, z)$ at a given point. Its components are the partial derivatives with respect to each variable:

$\nabla f(x, y, z) = \langle f_x(x, y, z),\; f_y(x, y, z),\; f_z(x, y, z) \rangle$

One geometric fact worth remembering: the gradient is always perpendicular to the level curves (in 2D) or level surfaces (in 3D) of the function at the point where it's calculated. This makes sense if you think about it: along a level surface, $f$ doesn't change at all, so the direction of greatest change must point away from that surface, i.e., normal to it.

The directional derivative $D_{\vec{u}}f(x, y, z)$ measures the rate of change of $f$ in the direction of a unit vector $\vec{u}$ . The core formula connecting it to the gradient is:

$D_{\vec{u}}f(x, y, z) = \nabla f(x, y, z) \cdot \vec{u}$

Because this is a dot product, you can also write it as:

$D_{\vec{u}}f = \|\nabla f\| \cos\theta$

where $\theta$ is the angle between $\nabla f$ and $\vec{u}$ . This form makes the sign behavior obvious:

Positive when $\vec{u}$ has a component along the gradient direction ( $\theta < 90°$ ), meaning $f$ is increasing.
Negative when $\vec{u}$ points partly against the gradient ( $\theta > 90°$ ), meaning $f$ is decreasing.
Zero when $\vec{u}$ is perpendicular to the gradient ( $\theta = 90°$ ), meaning you're moving along a level surface.

Relationship between Gradient and Directional Derivatives, Directional derivative - Wikipedia

Properties of Gradient and Directional Derivatives

Since $D_{\vec{u}}f = \|\nabla f\| \cos\theta$ , the extreme values follow directly from the range of cosine:

Maximum directional derivative: $\|\nabla f(x, y, z)\|$ , occurring when $\vec{u}$ is parallel to $\nabla f$ ( $\theta = 0$ , so $\cos\theta = 1$ ).
Minimum directional derivative: $-\|\nabla f(x, y, z)\|$ , occurring when $\vec{u}$ is antiparallel to $\nabla f$ ( $\theta = \pi$ , so $\cos\theta = -1$ ).

In other words, the gradient's magnitude tells you the steepest possible rate of increase, and the gradient's direction tells you which way to go to achieve it. Moving opposite to the gradient gives the steepest decrease.

If $f$ is differentiable at a point, then the directional derivative exists in every direction at that point and equals $\nabla f \cdot \vec{u}$ . This is sometimes called the gradient theorem for directional derivatives, and it's what justifies using the dot product formula rather than the limit definition every time.

Relationship between Gradient and Directional Derivatives, Directional Derivatives and the Gradient · Calculus

Differentiability and Chain Rule

Differentiability in Multivariable Functions

A function $f(x, y, z)$ is differentiable at a point if it's continuous there and its partial derivatives exist and are continuous in some neighborhood of the point. (A sufficient condition: if all partial derivatives are continuous near the point, then $f$ is differentiable there.)

Differentiability is a stronger condition than just having partial derivatives. For example, $f(x, y) = \sqrt{|xy|}$ has partial derivatives at $(0, 0)$ (both equal zero), but it's not differentiable there because the function doesn't behave linearly in all directions near that point.

When $f$ is differentiable at a point $(a, b, c)$ , two things follow:

The tangent plane exists, with the gradient vector $\nabla f$ as its normal.
The linear approximation is valid:

$L(x, y, z) = f(a, b, c) + f_x(a, b, c)(x - a) + f_y(a, b, c)(y - b) + f_z(a, b, c)(z - c)$

This approximation estimates $f$ near the point and also gives the equation of the tangent plane to the graph (or to the level surface, depending on context).

Multivariable Chain Rule

The chain rule for multivariable functions handles composite functions where the inputs are themselves functions of other variables.

If $z = f(x, y)$ is differentiable and $x = g(t)$ , $y = h(t)$ are differentiable functions of $t$ , then:

$\frac{dz}{dt} = \frac{\partial f}{\partial x}\frac{dx}{dt} + \frac{\partial f}{\partial y}\frac{dy}{dt}$

This extends naturally: if $f$ depends on $n$ variables and each variable depends on $m$ parameters, you sum over all intermediate variables for each parameter. Tree diagrams can help you keep track of which paths contribute to each derivative.

A common application is finding the rate of change of a quantity along a parametric curve. If a particle moves along $\vec{r}(t) = \langle x(t), y(t), z(t) \rangle$ through a scalar field $f$ , then the rate of change of $f$ along the particle's path is:

$\frac{df}{dt} = \nabla f \cdot \vec{r}\,'(t)$

Notice this is just the directional derivative formula scaled by the particle's speed. If $\vec{r}\,'(t)$ is a unit vector, you recover the directional derivative exactly. This is one of the cleanest ways to see how the chain rule and the gradient-directional derivative relationship are really the same idea from two different angles.