[Update] Optimizer visualization improved #345

AidinHamedi · 2025-02-11T19:27:22Z

Problem (Why?)

The original code's output plots were highly unstable and failed to fully demonstrate the optimizers' potential. The visualizations lacked clarity, and the optimization process often produced erratic results, making it difficult to evaluate the true performance of each optimizer. Additionally, the loss system did not account for boundary violations.

Solution (What/How?)

Code Refactoring:
- Added detailed comments and docstrings to improve code readability and maintainability.
- Organized configuration constants (e.g., OPTIMIZERS_IGNORE, OPTIMIZATION_STEPS) at the top of the script for easier tuning and experimentation.
- Improved the structure of the code by separating concerns into distinct sections (e.g., configuration, test functions, optimization logic, visualization).
Enhanced Loss System:
- Introduced a boundary penalty to penalize solutions that go outside the valid range, ensuring the optimizers stay within the defined search space.
- Combined the final position loss (distance to the global minimum) with the average loss during optimization to encourage smoother and more stable convergence.
Improved Visualization:
- Added markers for the global minimum and final position to make the plots more informative.
- Included optimizer hyperparameters in the plot title for better context.

Other Changes (Bug Fixes, Small Refactors)

Rastrigin Function Enhancement:
- Added an option to make the Rastrigin function more challenging by setting DIFFICULT_RASTRIGIN to True. This changes the initial_state to a harder starting location and introduces noise to the function output.
Hyper Param Tunning:
- Added support for other hyper params not just lr like momentum etc...

Notes

While the majority of the optimizers now produce clean and stable plots, there are still a few that exhibit instability or unclear behavior. These include:

BSAM
ASGD
MSVAG
DAdaptAdam
Muon
AliG: execute_steps function modifying best_params

kozistr · 2025-02-13T06:32:00Z

@AidinHamedi, thanks for your awesome work! appreciate it :)

review page is so lacky cuz of lots of changes, so leave some reviews here

Your code and naming are already readable and intuitive enough, and all functions have a docstring. So, in my opinion, adding extra comments is not necessary. could you please remove all comments?
question about calculating the penalty (line 348), which is 75 * total_violation. could you give me more details about the constant 75? did you get this value empirically or just set a decent value?
max_queue_len to 6. I was just wondering, did you see any speed up by setting max_queue_len to 6?

others look great to me!

AidinHamedi · 2025-02-13T09:47:43Z

I really appreciate your feedback. Let me address your comments:

Removing comments: I will remove the unnecessary comments as you recommended. The code and naming are already clear and intuitive, so I agree that additional comments may be excessive.
Penalty calculation: The constant 75 was selected somewhat randomly as a reasonable multiplier for penalizing boundary violations. It wasn’t based on empirical data but was meant to provide a strong enough penalty to deter violations while keeping it proportional to the severity of the issue. If you have ideas for a more data-driven method or a better value, I’m open to making adjustments!
max_queue_len set to 6: You’re absolutely correct—after retesting, I found that setting max_queue_len to 6 doesn’t actually enhance the tuning speed. This was due to inadequate testing on my part, and I’ll either revert this change or modify it based on more thorough testing.

…ptimizer into Vis-Update

kozistr · 2025-02-13T11:20:26Z

I really appreciate your feedback. Let me address your comments:

Removing comments: I will remove the unnecessary comments as you recommended. The code and naming are already clear and intuitive, so I agree that additional comments may be excessive.

Penalty calculation: The constant 75 was selected somewhat randomly as a reasonable multiplier for penalizing boundary violations. It wasn’t based on empirical data but was meant to provide a strong enough penalty to deter violations while keeping it proportional to the severity of the issue. If you have ideas for a more data-driven method or a better value, I’m open to making adjustments!

max_queue_len set to 6: You’re absolutely correct—after retesting, I found that setting max_queue_len to 6 doesn’t actually enhance the tuning speed. This was due to inadequate testing on my part, and I’ll either revert this change or modify it based on more thorough testing.

1, 3: checked!
2: if 75 is a reasonable multiplier, then I think it's okay to go. I just want to know if is there any insight or method you use :)

Lastly, could you please run make run & make check?

AidinHamedi · 2025-02-14T07:33:29Z

For make run & make check: I reformatted the code to mitigate errors on check. However, the Makefile does not have a run target.
Regarding the number 75: I wanted a number that was big enough to highly discourage going outside of the boundaries. A higher value like 100 should have worked, but somewhat randomly, I chose the number 75 and thought that would be enough. No complicated reasoning was involved. Changing it to something like 50 or 100 doesn’t make a massive difference.

Hope that clarifies everything!

codecov · 2025-02-14T11:01:03Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (b82f7c4) to head (b35b22d).
Report is 7 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff            @@
##              main      #345   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files          111       111           
  Lines         8891      8891           
=========================================
  Hits          8891      8891

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

kozistr · 2025-02-14T11:02:29Z

For make run & make check: I reformatted the code to mitigate errors on check. However, the Makefile does not have a run target.

Regarding the number 75: I wanted a number that was big enough to highly discourage going outside of the boundaries. A higher value like 100 should have worked, but somewhat randomly, I chose the number 75 and thought that would be enough. No complicated reasoning was involved. Changing it to something like 50 or 100 doesn’t make a massive difference.

Hope that clarifies everything!

oh, sorry for my typo. it was make format, not run lol

thanks for the clarification and thank you again for your contribution!

AidinHamedi and others added 3 commits February 11, 2025 22:20

update: visualize_optimizers

4cf8cd2

Update visualize_optimizers.py

9fa53e3

Update visualize_optimizers.py

e3a4887

AidinHamedi requested a review from kozistr as a code owner February 11, 2025 19:27

kozistr assigned AidinHamedi Feb 13, 2025

kozistr added documentation Improvements or additions to documentation enhancement New feature or request labels Feb 13, 2025

AidinHamedi added 2 commits February 13, 2025 13:19

Code refactor

f6a3c64

Merge branch 'Vis-Update' of https://github.com/AidinHamedi/pytorch_o…

f4e4597

…ptimizer into Vis-Update

Code format

b35b22d

kozistr merged commit 5f4e62f into kozistr:main Feb 14, 2025
4 checks passed

AidinHamedi deleted the Vis-Update branch February 14, 2025 14:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Update] Optimizer visualization improved #345

[Update] Optimizer visualization improved #345

AidinHamedi commented Feb 11, 2025 •

edited

Loading

kozistr commented Feb 13, 2025

AidinHamedi commented Feb 13, 2025

kozistr commented Feb 13, 2025

AidinHamedi commented Feb 14, 2025

codecov bot commented Feb 14, 2025 •

edited

Loading

kozistr commented Feb 14, 2025

[Update] Optimizer visualization improved #345

[Update] Optimizer visualization improved #345

Conversation

AidinHamedi commented Feb 11, 2025 • edited Loading

Problem (Why?)

Solution (What/How?)

Other Changes (Bug Fixes, Small Refactors)

Notes

kozistr commented Feb 13, 2025

AidinHamedi commented Feb 13, 2025

kozistr commented Feb 13, 2025

AidinHamedi commented Feb 14, 2025

codecov bot commented Feb 14, 2025 • edited Loading

Codecov Report

kozistr commented Feb 14, 2025

AidinHamedi commented Feb 11, 2025 •

edited

Loading

codecov bot commented Feb 14, 2025 •

edited

Loading