add cosine restart learning rate #2953

hellozhaoming · 2023-10-27T10:17:26Z

No description provided.

Signed-off-by: hellozhaoming <[email protected]>

Add cosine restart learning rate

codecov · 2023-10-27T12:57:59Z

Codecov Report

Attention: 39 lines in your changes are missing coverage. Please review.

Comparison is base (2fe6927) 75.36% compared to head (05052c1) 75.07%.
Report is 16 commits behind head on devel.

Additional details and impacted files

@@            Coverage Diff             @@
##            devel    #2953      +/-   ##
==========================================
- Coverage   75.36%   75.07%   -0.30%     
==========================================
  Files         245      220      -25     
  Lines       24648    20297    -4351     
  Branches     1582      903     -679     
==========================================
- Hits        18577    15238    -3339     
+ Misses       5140     4526     -614     
+ Partials      931      533     -398

Files	Coverage Δ
deepmd/common.py	`83.65% <ø> (ø)`
deepmd/utils/argcheck.py	`96.16% <100.00%> (+0.08%)`	⬆️
deepmd/train/trainer.py	`84.56% <54.54%> (-0.50%)`	⬇️
deepmd/utils/learning_rate.py	`48.57% <22.72%> (-43.74%)`	⬇️

... and 62 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

wanghan-iapcm · 2023-10-27T13:00:04Z

deepmd/utils/learning_rate.py

+        """Get the start lr."""
+        return self.start_lr_
+
+    def value(self, step: int) -> float:


you may not need to implement the value method if you do not print the information regarding the learning rate at the beginning of the training:
https://github.com/hellozhaoming/deepmd-kit/blob/05052c195308f61b63ce2bab130ce0e8cba60604/deepmd/train/trainer.py#L566

njzjz

Please run pre-commit to format and lint the code: https://docs.deepmodeling.com/projects/deepmd/en/master/development/coding-conventions.html#run-scripts-to-check-the-code. Or you can submit from a non-protect branch and pre-commit.ci can do it for you.

Unit tests should be added for two new learning rate classes.

njzjz · 2023-10-27T19:35:31Z

deepmd/common.py

@@ -125,6 +125,7 @@ def gelu_wrapper(x):
    "softplus": tf.nn.softplus,
    "sigmoid": tf.sigmoid,
    "tanh": tf.nn.tanh,
+    "swish": tf.nn.swish,


It seems that it has been renamed to silu: tensorflow/tensorflow#41066

njzjz · 2023-10-27T19:39:35Z

deepmd/train/trainer.py

-            )
-        else:
-            for fitting_key in self.fitting:
+            if self.lr_type == "exp":


It's not a good behavior to switch the learning rate in the Trainer. Instead, implement the method LearningRate.log_start (LearningRate should be an abstract base class and inherited by all learning rate classes) and call self.lr.log_start(self.sess) here.

njzjz · 2023-10-27T19:41:29Z

deepmd/utils/argcheck.py

-        [Argument("exp", dict, learning_rate_exp())],
+        [Argument("exp", dict, learning_rate_exp()),
+         Argument("cos", dict, learning_rate_cos()),
+         Argument("cosrestart", dict, learning_rate_cosrestarts())],


You may need to add some documentation to variants (doc="xxx"). Otherwise, no one knows what they are.

njzjz · 2023-10-27T19:43:16Z

deepmd/utils/learning_rate.py

+  ```python
+  global_step = min(global_step, decay_steps)
+  cosine_decay = 0.5 * (1 + cos(pi * global_step / decay_steps))
+  decayed = (1 - alpha) * cosine_decay + alpha
+  decayed_learning_rate = learning_rate * decayed
+  ```


Please use this style: https://numpydoc.readthedocs.io/en/latest/format.html#other-points-to-keep-in-mind

njzjz · 2023-10-27T19:43:48Z

deepmd/utils/learning_rate.py

+
+  The function returns the cosine decayed learning rate while taking into account
+  possible warm restarts.
+  ```


This line should be removed.

hellozhaoming added 3 commits October 27, 2023 17:16

add swish activation function

7abd94e

Signed-off-by: hellozhaoming <[email protected]>

Add cosine restart learning rate

3f3449c

Add cosine restart learning rate

add cosine restart learning rate

05052c1

github-actions bot added the Python label Oct 27, 2023

wanghan-iapcm reviewed Oct 27, 2023

View reviewed changes

wanghan-iapcm changed the base branch from master to devel October 27, 2023 13:00

njzjz requested changes Oct 27, 2023

View reviewed changes

njzjz added the new feature label Nov 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add cosine restart learning rate #2953

add cosine restart learning rate #2953

hellozhaoming commented Oct 27, 2023

codecov bot commented Oct 27, 2023 •

edited

Loading

wanghan-iapcm Oct 27, 2023

njzjz left a comment

njzjz Oct 27, 2023

njzjz Oct 27, 2023

njzjz Oct 27, 2023

njzjz Oct 27, 2023

njzjz Oct 27, 2023

add cosine restart learning rate #2953

Are you sure you want to change the base?

add cosine restart learning rate #2953

Conversation

hellozhaoming commented Oct 27, 2023

codecov bot commented Oct 27, 2023 • edited Loading

Codecov Report

wanghan-iapcm Oct 27, 2023

Choose a reason for hiding this comment

njzjz left a comment

Choose a reason for hiding this comment

njzjz Oct 27, 2023

Choose a reason for hiding this comment

njzjz Oct 27, 2023

Choose a reason for hiding this comment

njzjz Oct 27, 2023

Choose a reason for hiding this comment

njzjz Oct 27, 2023

Choose a reason for hiding this comment

njzjz Oct 27, 2023

Choose a reason for hiding this comment

codecov bot commented Oct 27, 2023 •

edited

Loading