Numerator Layout
- Some texts use to represent .
In Relation to the Gradient
Suppose and s.t. . Then the gradient is defined as follows.
Also we can define the Hessian as follows.
Because of the Clairaut’s theorem, usually we use the following form.
In Relation to Jacobian Matrix
Suppose and s.t. . Then the Jacobian matrix is defined as follows.
In Relation to the Chain Rule
For any possible combination of shapes for , the following chain rule holds.
References
(1) Matrix Calculus, Wikipedia (https://en.wikipedia.org/wiki/Matrix_calculus)
(2) Jacobian matrix and determinant, Wikipedia (https://en.wikipedia.org/wiki/Jacobian_matrix_and_determinant)