线性代数基础

Matrix derivatives

For a function mapping from -by-d matrices to the real numbers, we define the derivative of with respect to to be:

Thus, the gradient is itself an -by- matrix, whose -element is . For example, suppose is a 2-by-2 matrix, and the function is given by

Here, denotes the entry of the matrix . We then have