add centered version of RMSProp #1778

ludvigk · 2021-11-23T13:30:58Z

Added centered version of RMSProp, which exists in other frameworks (pytorch, TF).

Not sure if it makes more sense to have a separate CenteredRMSProp (or CRMSProp) struct for the centered version.

PR Checklist

Tests are added
Entry in NEWS.md
Documentation, if applicable
API changes require approval from a committer (different from the author, if applicable)

DhairyaLGandhi

Thanks for looking into this! I've left a couple minor thoughts.

src/optimise/optimisers.jl

Co-authored-by: Dhairya Gandhi <[email protected]>

ludvigk · 2021-11-23T14:24:04Z

Thanks for looking into this! I've left a couple minor thoughts.

Thanks, Python brings bad habits.

src/optimise/optimisers.jl

ToucheSir · 2021-11-23T16:42:43Z

src/optimise/optimisers.jl

+  centered::Bool
+  state::IdDict


Do we consider this an API change? Might have to add a deprecation warning + getproperty overload forwarding acc to state.

I assume we'll have to overload setproperty! also then, to avoid issues where the optimiser state is saved to properly resume training? Still, if it's saved in a .bson file, there will be problems. The easiest way to avoid the API change is perhaps to just define the centered version as a different optimiser. I like the idea of just having one optimiser with the centered flag though.

Another option is to create the new RMSProp version under a different name, and use a deprication warning for RMSProp.

Might just leave acc as it is as the symbol. That would resolve the deprecation. Would there be a problem?

Does that help? The IdDict doesn't contain the same information anymore, so I don't see that helping.

It does contain the same information when centered = false though, which is exactly what we want for backwards compatibility.

My concern was that both renaming and adding fields seem to be SemVer breaks. The former is easier to address, but the latter would not play well with code that uses reflection. Serialization isn't as big of a concern because optimizer state is basically useless once serialized (IdDict references will no longer match up).

DhairyaLGandhi

Could move centered to a type parameter like RMSProp{true/false} to avoid adding a field to the optimiser.

DhairyaLGandhi · 2021-12-24T08:17:02Z

src/optimise/optimisers.jl

+  if o.centered
+    @. Δ_ave = ρ * Δ_ave + (1 - ρ) * Δ
+  end
+  @. Δ *= η / (√(acc - Δ_ave^2) + ϵ)


Suggested change

@. Δ *= η / (√(acc - Δ_ave^2) + ϵ)

@. Δ *= η / (√(acc - Δ_ave * conj(Δ_ave)) + ϵ)

mcabbott · 2022-02-05T17:47:15Z

src/optimise/optimisers.jl

@@ -110,39 +110,51 @@ function apply!(o::Nesterov, x, Δ)
 end

 """
-    RMSProp(η = 0.001, ρ = 0.9)
+    RMSProp(η = 0.001, ρ = 0.9, centered = false)


On master there is RMSProp(η::Real, ρ::Real, ϵ::Real).

Probably this option should be controlled by a keyword, matching FluxML/Optimisers.jl#51

mcabbott · 2022-03-05T04:51:16Z

This need not hold up v0.13, since it adds a feature without breaking anything.

cossio · 2022-05-11T12:19:52Z

@ludvigk are you planning to follow up on this?

If not, I'd be willing to open a new PR with this.

ludvigk · 2022-05-12T07:48:59Z

@cossio

@ludvigk are you planning to follow up on this?

If not, I'd be willing to open a new PR with this.

I forgot about this. Feel free to open a new PR, or I can update this one this weekend.

codecov-commenter · 2022-05-20T10:44:20Z

Codecov Report

Merging #1778 (fbffc5c) into master (1f82da4) will decrease coverage by 0.10%.
The diff coverage is 62.50%.

@@            Coverage Diff             @@
##           master    #1778      +/-   ##
==========================================
- Coverage   87.94%   87.84%   -0.11%     
==========================================
  Files          19       19              
  Lines        1485     1489       +4     
==========================================
+ Hits         1306     1308       +2     
- Misses        179      181       +2

Impacted Files	Coverage Δ
src/optimise/optimisers.jl	`92.77% <62.50%> (-0.98%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1f82da4...fbffc5c. Read the comment docs.

ludvigk · 2022-05-22T20:51:38Z

I added the suggestions from here to the pull request. Changed "state" back to "acc", and merged with the new epsilon argument for RMSprop. Not sure what's left?

mcabbott · 2022-05-23T00:32:05Z

I think these two signatures are a bit too do-what-I-mean:

RMSProp(η::Real, ρ::Real, centered::Bool, ϵ::Real)
RMSProp(η::Real, ρ::Real, ϵ::Real)

4 is a lot of positional arguments, especially if their order isn't fixed. I would prefer it to be only a keyword argument. Ideally such a keyword should match FluxML/Optimisers.jl#51 .

add centered version of RMSProp

615f1bc

DhairyaLGandhi reviewed Nov 23, 2021

View reviewed changes

src/optimise/optimisers.jl Outdated Show resolved Hide resolved

src/optimise/optimisers.jl Outdated Show resolved Hide resolved

ludvigk and others added 2 commits November 23, 2021 15:21

Update src/optimise/optimisers.jl

68b8fec

Co-authored-by: Dhairya Gandhi <[email protected]>

Update src/optimise/optimisers.jl

594d7bb

Co-authored-by: Dhairya Gandhi <[email protected]>

ToucheSir reviewed Nov 23, 2021

View reviewed changes

DhairyaLGandhi reviewed Dec 24, 2021

View reviewed changes

ToucheSir mentioned this pull request Dec 29, 2021

make eps a parameter of optimisers #1819

Merged

1 task

ToucheSir added this to the v0.13 milestone Feb 4, 2022

ToucheSir mentioned this pull request Feb 5, 2022

v0.13 deprecations #1751

Merged

mcabbott mentioned this pull request Feb 5, 2022

Centred RMSProp FluxML/Optimisers.jl#51

Merged

mcabbott reviewed Feb 5, 2022

View reviewed changes

mcabbott added the enhancement label Feb 20, 2022

mcabbott removed this from the v0.13 milestone Mar 5, 2022

Reverted back from state to acc

be133a6

ludvigk force-pushed the master branch from e21e79f to be133a6 Compare May 20, 2022 09:23

ludvigk added 2 commits May 20, 2022 11:29

Fixed missing end

0af2a1a

Fixed wrong epsilon reference

fbffc5c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add centered version of RMSProp #1778

add centered version of RMSProp #1778

ludvigk commented Nov 23, 2021 •

edited

Loading

DhairyaLGandhi left a comment

ludvigk commented Nov 23, 2021 •

edited

Loading

ToucheSir Nov 23, 2021

ludvigk Nov 24, 2021

DhairyaLGandhi Nov 24, 2021

ludvigk Nov 24, 2021

ToucheSir Nov 24, 2021

DhairyaLGandhi left a comment

DhairyaLGandhi Dec 24, 2021

mcabbott Feb 5, 2022

mcabbott commented Mar 5, 2022

cossio commented May 11, 2022 •

edited

Loading

ludvigk commented May 12, 2022

codecov-commenter commented May 20, 2022

ludvigk commented May 22, 2022

mcabbott commented May 23, 2022 •

edited

Loading

	@. Δ *= η / (√(acc - Δ_ave^2) + ϵ)
	@. Δ = η / (√(acc - Δ_ave conj(Δ_ave)) + ϵ)

add centered version of RMSProp #1778

Are you sure you want to change the base?

add centered version of RMSProp #1778

Conversation

ludvigk commented Nov 23, 2021 • edited Loading

PR Checklist

DhairyaLGandhi left a comment

Choose a reason for hiding this comment

ludvigk commented Nov 23, 2021 • edited Loading

ToucheSir Nov 23, 2021

Choose a reason for hiding this comment

ludvigk Nov 24, 2021

Choose a reason for hiding this comment

DhairyaLGandhi Nov 24, 2021

Choose a reason for hiding this comment

ludvigk Nov 24, 2021

Choose a reason for hiding this comment

ToucheSir Nov 24, 2021

Choose a reason for hiding this comment

DhairyaLGandhi left a comment

Choose a reason for hiding this comment

DhairyaLGandhi Dec 24, 2021

Choose a reason for hiding this comment

mcabbott Feb 5, 2022

Choose a reason for hiding this comment

mcabbott commented Mar 5, 2022

cossio commented May 11, 2022 • edited Loading

ludvigk commented May 12, 2022

codecov-commenter commented May 20, 2022

Codecov Report

ludvigk commented May 22, 2022

mcabbott commented May 23, 2022 • edited Loading

ludvigk commented Nov 23, 2021 •

edited

Loading

ludvigk commented Nov 23, 2021 •

edited

Loading

cossio commented May 11, 2022 •

edited

Loading

mcabbott commented May 23, 2022 •

edited

Loading