You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The gradient of of the matrix valued function F(X) = log(X) or the Hessian of the scalar valued function f(X) = tr(X * log(X)) has an analytic form e.g. Equation (64) from https://arxiv.org/pdf/1906.00037.pdf.
The code for the analytic form can become problematic if the values of some pair of eigenvalues in the input come close together. But this is not the case for the bad_W in the example, so I'm not sure what other sources of trouble to suspect.
The text was updated successfully, but these errors were encountered:
The gradient of of the matrix valued function
F(X) = log(X)
or the Hessian of the scalar valued functionf(X) = tr(X * log(X))
has an analytic form e.g. Equation (64) from https://arxiv.org/pdf/1906.00037.pdf.I've hit a few cases where the derivatives I compute manually differ from ForwardDiff, one example is the
bad_W
in: https://gist.github.com/lkapelevich/8e4e4020cc56e13a0a42e73c4fa247d3The code for the analytic form can become problematic if the values of some pair of eigenvalues in the input come close together. But this is not the case for the
bad_W
in the example, so I'm not sure what other sources of trouble to suspect.The text was updated successfully, but these errors were encountered: