Move function `make_optimizer_and_scheduler` to policy #401

michel-aractingi · 2024-09-02T08:05:17Z

What this does

Move function make_optimizer_and_scheduler in train.py to the policy class. The function becomes increasingly long as we add new policies. After this PR the optimizer and scheduler of each policy can be created by calling policy.make_optimizer_and_scheduler().

How it was tested

Tested by running a training command for diffusion policy, act and tdmpc to make sure there are not syntax issues. Also the unit tests in test policies.

#ACT
python lerobot/scripts/train.py     policy=act     env=aloha     env.task=AlohaInsertion-v0 dataset_repo_id=lerobot/aloha_sim_insertion_human

#diffusion
python lerobot/scripts/train.py    hydra.run.dir=outputs/train/diffusion_pusht   hydra.job.name=diffusion_pusht   policy=diffusion env=pusht   env.task=PushT-v0   dataset_repo_id=lerobot/pusht device=cuda

#tdmpc 
python lerobot/scripts/train.py env=pusht policy=tdmpc_pusht device=cuda

Cadene

FYI I wanted to avoid this design (method from the policy instantiating the optimizer), but I dont think we can ^^

Cadene · 2024-09-02T08:31:10Z

tests/test_policies.py

-    optimizer, _ = make_optimizer_and_scheduler(cfg, policy)
+    optimizer, _ = policy.make_optimizer_and_scheduler(cfg)


Could we add optimizer, _ = policy.make_optimizer_and_scheduler(cfg) to another place in our unit tests?

It feels like this code logic should be tested for all policies, not just act. Thanks ;)

@Cadene Should we change test_act_backbone_lr to a more general function for all policies like:

@pytest.mark.parametrize( "env_name,policy_name", [ ("pusht", "tdmpc"), ("pusht", "diffusion"), ("pusht", "vqbet"), ("aloha", "act") ], ) def test_policy_backbone_lr(env_name, policy_name): """ Test that the ACT policy can be instantiated with a different learning rate for the backbone. """ cfg = init_hydra_config( DEFAULT_CONFIG_PATH, overrides=[ f"env={env_name}", f"policy={policy_name}", f"device={DEVICE}", "training.lr_backbone=0.001", "training.lr=0.01", ], ) ....

alexander-soare

Thanks for drafting this up @michel-aractingi.

So, thinking about this more deeply, I don't think the policy should return the optimizer and scheduler objects. This is because someone should be able to freely decide what these should be. Eg: Want to use exponential decaying LR on ACT? Fine.

What I do think should be provided by the policy is a list of parameter groups for optimization (which the user may use or may ignore - but they are there as a suggestion and for convenience).

So, supposing you agree with me, I would consider:

Changing the method name to get_optimizer_param_groups, and returning a list of param groups for an optimizer.
Using a standard training.lr_scheduler parameter in all policies. It can be None.
Using a standard optimizer parameter in all policies. It is required.
Using Hydra's class instantiatiation tooling to handle creating the optimizer and scheduler objects. https://hydra.cc/docs/advanced/instantiate_objects/overview/
- And using the param groups from the policy to pass to the optimizer class on instantiation.
What we would achieve with this is 1 common interface for all policies for setting the optimizer and scheduler via the hydra config.

michel-aractingi added 2 commits September 2, 2024 07:53

moved make optimizer and scheduler function to inside policy

bbce0ea

modified tests dirs

3034272

michel-aractingi requested a review from alexander-soare September 2, 2024 08:05

pass entire config to make_optimizer

06fc9b8

Cadene reviewed Sep 2, 2024

View reviewed changes

alexander-soare reviewed Sep 2, 2024

View reviewed changes

alexander-soare self-assigned this Sep 2, 2024

michel-aractingi marked this pull request as draft September 6, 2024 09:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move function `make_optimizer_and_scheduler` to policy #401

Move function `make_optimizer_and_scheduler` to policy #401

michel-aractingi commented Sep 2, 2024

Cadene left a comment

Cadene Sep 2, 2024

michel-aractingi Sep 2, 2024

alexander-soare left a comment

		optimizer, _ = make_optimizer_and_scheduler(cfg, policy)
		optimizer, _ = policy.make_optimizer_and_scheduler(cfg)

Move function make_optimizer_and_scheduler to policy #401

Are you sure you want to change the base?

Move function make_optimizer_and_scheduler to policy #401

Conversation

michel-aractingi commented Sep 2, 2024

What this does

How it was tested

Cadene left a comment

Choose a reason for hiding this comment

Cadene Sep 2, 2024

Choose a reason for hiding this comment

michel-aractingi Sep 2, 2024

Choose a reason for hiding this comment

alexander-soare left a comment

Choose a reason for hiding this comment

Move function `make_optimizer_and_scheduler` to policy #401

Move function `make_optimizer_and_scheduler` to policy #401