Add a Warning / Error when sigmoid activation functions are used #212

nicogross · 2025-02-04T20:11:11Z

The sigmoid activation function 1/(1+e^-x) is missing two properties which are required by LRP:
f(0) = 0, sign(f(-x)) = -1 (and sign(f(x)) = +1 which is true for the sigmoid function)

I tried using the simple LRP-0 rule on a network calculating sigmoid(2*(-1) + 1*1)) and got un-intuitive results.

The text was updated successfully, but these errors were encountered:

chr5tphr · 2025-02-07T10:06:23Z

Hey Nico,

Thanks a lot for the issue. I think this makes sense. We could create another type to replace (Activation)[https://github.com/chr5tphr/zennit/blob/e5699aa7e6fb98bec67505af917d0a17cd81d3b5/src/zennit/types.py#L101] as something like SignSymmetricActivation and use that in the basic LRP rules to skip rather than skipping all activations. With #147, sigmoid would then use the gradient, but would raise a warning.

Just for reference, could you share your un-intuitive results here?

nicogross · 2025-02-07T10:32:36Z

Just a simple example:

f(x) = sigmoid( x1 w1+ x2 w2 ) = sigmoid(z1 + z2)
x1 = 2 and x2 = 1
w1 = -1 and w2 = 1
-> z1 = -2 and z2 = 1
f(x) = sigmoid( -2 +1 ) = sigmoid(-1) = 0.2689

x1 (or z1) pushes to a lower activation and x2 (or z2) pushes to a higher activation.
x1 should be assigned a negative or small relevance and x2 should be assigned a greater relevance, but:
R1 = 0.5379 an R2 = -0.2689

Maybe following paper can help to understand why:
WB-LRP (https://www.sciencedirect.com/science/article/abs/pii/S0031320324007076) pointed out, that DeepLIFT can be re-formulated as LRP, where the reference (pre-)activations are all set to be 0. This means, that for negative pre-activations, the point on the sigmoid y = sigma(x) is compared to the point (0,0), which obviously does not lie on the sigmoid curve. This leads to a negative slope (y-0)/(x-0) for x<0, y>0

chr5tphr added enhancement New feature or request core Feature/bug concerning core functionality labels Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a Warning / Error when sigmoid activation functions are used #212

Add a Warning / Error when sigmoid activation functions are used #212

nicogross commented Feb 4, 2025

chr5tphr commented Feb 7, 2025

nicogross commented Feb 7, 2025 •

edited

Loading

Add a Warning / Error when sigmoid activation functions are used #212

Add a Warning / Error when sigmoid activation functions are used #212

Comments

nicogross commented Feb 4, 2025

chr5tphr commented Feb 7, 2025

nicogross commented Feb 7, 2025 • edited Loading

nicogross commented Feb 7, 2025 •

edited

Loading