Document subgradient convention #404

oxinabox · 2021-07-20T19:08:05Z

This is not written down anywhere in particular.
One writeup of it here
FluxML/Zygote.jl#1036 (comment)

mcabbott · 2021-07-27T01:00:58Z

We say "If the derivative is not defined, but the subgradient contains zero, then just say the derivative is 0".

See discussion of clamp in JuliaDiff/ForwardDiff.jl#480 (comment) for a possible counterexample. If y = clamp(x,0,1) has zero gradient at the endpoints, your parameter will tend to get stuck there; it's more useful to give the nonzero gradient so that, if moving into the bulk lowers the loss, it can see that.

oxinabox · 2021-07-27T08:28:30Z

Yeah, the wording I now have in mind is something like

You are free to chose any element of the subgradient. Choose the most useful. This will often mean choosing 0.

That counter example is a good one to include so can show that not always will the most useful it be zero.

Some more comments on the subgradient convention are here:
https://twitter.com/Awfidius/status/1419213506382028801

mcabbott · 2021-07-27T11:26:08Z

When all sub-gradients are finite, their mean is probably the neutral choice. But in the ForwardDiff clamp story, it's the one you can't have, as you can't evaluate both branches.

The mean is also the one used by FiniteDifferences, I think. Which causes some tests to fail with an implementation of maximum via findmax: JuliaDiff/ChainRules.jl#480.

oxinabox added the documentation Improvements or additions to documentation label Jul 23, 2021

oxinabox mentioned this issue Jul 27, 2021

Document what to do at nondifferentiable points #419

Merged

oxinabox mentioned this issue Aug 27, 2021

Fix rules for ^ JuliaDiff/ChainRules.jl#513

Merged

oxinabox closed this as completed in #419 Nov 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document subgradient convention #404

Document subgradient convention #404

oxinabox commented Jul 20, 2021

mcabbott commented Jul 27, 2021

oxinabox commented Jul 27, 2021

mcabbott commented Jul 27, 2021

Document subgradient convention #404

Document subgradient convention #404

Comments

oxinabox commented Jul 20, 2021

mcabbott commented Jul 27, 2021

oxinabox commented Jul 27, 2021

mcabbott commented Jul 27, 2021