How to sync distribute model paramaters when training with continual learning fashion? #3421

Iranb · 2025-03-05T13:44:15Z

When performing distributed continual learning tasks, it is common to expand model parameters as tasks increase. For example, I have defined an expand_classifier() method with random initialization to increase the parameters of the classifier.

How can I ensure that the newly added parameters are initialized the same on each GPU model?

If i do

if self.accelerator.is_main_process:
    self.model.module.prompt.expand_classifier()

How can i sync classifier across all distributed model?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to sync distribute model paramaters when training with continual learning fashion? #3421

How to sync distribute model paramaters when training with continual learning fashion? #3421

Iranb commented Mar 5, 2025

How to sync distribute model paramaters when training with continual learning fashion? #3421

How to sync distribute model paramaters when training with continual learning fashion? #3421

Comments

Iranb commented Mar 5, 2025