Numerator Layout

  • Some texts use to represent .

In Relation to the Gradient

Suppose and s.t. . Then the gradient is defined as follows.

Also we can define the Hessian as follows.

Because of the Clairaut’s theorem, usually we use the following form.

In Relation to Jacobian Matrix

Suppose and s.t. . Then the Jacobian matrix is defined as follows.

In Relation to the Chain Rule

For any possible combination of shapes for , the following chain rule holds.

References

(1) Matrix Calculus, Wikipedia (https://en.wikipedia.org/wiki/Matrix_calculus)
(2) Jacobian matrix and determinant, Wikipedia (https://en.wikipedia.org/wiki/Jacobian_matrix_and_determinant)