Derivatives
Partial Derivative
Let $U$ be an open subset of $\mathbb{R}^m$ , $\mathbf{f}: U \mapsto \mathbb{R}^n$ . The partial derivative of $\mathbf{f}$ with respect to the $i$ th variable is:
For instance, given $\mathbf{f}\begin{pmatrix}x_1 \\ x_2\end{pmatrix} = \begin{bmatrix}x_1^2 + x_2 \\ x_1 + x_2^2\end{bmatrix}$ , we have:
That is, calculate the derivative for each $f_i$ on the spot.
Jacobian Matrix and Derivatives
For function $f$ from $\mathbb{R} \mapsto \mathbb{R}$ , we have the following definition of derivative:
If the above limit exists, then we say that $f$ is derivable at point $x$ , and the derivative of $f$ at point $x$ is $m$ . We can have a better understanding of the derivative by rewriting the above equation as:
A useful interpretation for this equation is that the increment of $f$ at point $x$ can be approximated by a linear function $mh$ , and the approximation error is small compared to $h$ . This also leads us to the definition of derivative for higher dimensions:
If there exists such a linear transformation $\mathbf{M}$ that makes the above equation hold, then we say that $\mathbf{f}$ is derivable at point $\mathbf{a}$ , and the derivative of $\mathbf{f}$ at point $\mathbf{a}$ is the linear transformation $\mathbf{M}$ .
Since each partial derivative $D_i \mathbf{f}(\mathbf{a})$ is a column vector, it is very natural to combine all partial derivatives into a matrix, which is called the Jacobian matrix.
The Jacobian matrix of $\mathbf{f}(\mathbf{a})$ puts partial derivative for each component in the order from left to right.
Jacobian matrix is simply a matrix of partial derivatives, so the existence of Jacobian matrix only indicates that the function is partially derivable at point $\mathbf{a}$ . For the Jacobian matrix to be the function’s derivative, we have to show the following equation holds:
Although $\vec\nabla f(\mathbf{a}) = \left[Df(\mathbf{a})\right]^\mathsf{T}$ , do not confuse gradients with derivatives. Geometrically, gradients are only defined in spaces with an inner product. And, gradients can only be vectors, which means multi-valued functions do not have gradients (but they still can have Jacobian matrices and derivatives).
Directional Derivatives
Directional derivatives are an extension of partial derivatives, it is defined as:
If function $\mathbf{f}$ is also derivable, then we can calculate the directional derivative in the following method:
Note that $\vec{v}$ need not be a unit vector.
$C^p$ functions
A function is $C^p$ function if all its partial derivatives up to order $p$ exist and are continuous on $U$ , Specially:
- A function on $U \subseteq \mathbb{R}^n$ is $C^0$ function if it is continuous on $U$ (the derivative of order $0$ refers to the function itself);
- A function on $U \subseteq \mathbb{R}^n$ is $C^1$ function if all its partial derivatives exist and are continuous on $U$ .