Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: (1) skipped last step (2) redundant validation and logging #409

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

dirtyDan0
Copy link

This PR solves these 2 following problems.

  1. Last step skipped

    self.global_steps += 1 before if self.global_steps >= self.total_training_steps makes the last step skipped.

    We start from step 1, and we expect self.total_training_steps in total.

    self.global_steps += 1
    if self.global_steps >= self.total_training_steps:

    When self.global_steps == self.total_training_steps-1:

    • we have only executed self.total_training_steps-1 steps

    • self.global_steps is updated to self.total_training_steps

    • self.global_steps >= self.total_training_steps is satisfied, and the training ends.

    Therefore, we should put self.global_steps += 1 at last

  2. redundant validation and logging

    If self.total_training_steps % self.config.trainer.test_freq == 0 :

@CLAassistant
Copy link

CLAassistant commented Feb 27, 2025

CLA assistant check
All committers have signed the CLA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants