Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to generate exemplars combination after training. #4

Open
LeonChengg opened this issue Feb 26, 2025 · 3 comments
Open

How to generate exemplars combination after training. #4

LeonChengg opened this issue Feb 26, 2025 · 3 comments

Comments

@LeonChengg
Copy link

After executing the train_example_selection.py script, I obtained the checkpoint files for the trained model. However, I am unsure how to generate and output the exemplars combination using this model. Could you provide guidance on the necessary steps?

@SHUMKASHUN
Copy link
Owner

Thanks for the interest in our work. Basically after training the model, you can check the self.sample_probs which contains the probability distribution for each examplar entry (i.e. 8 examplars for gsm8k). Then you take the argmax (just as the code in validation_step), you will get the index for exemplars combination.

@LeonChengg
Copy link
Author

I ran the training for 10 epochs, but unfortunately sadly, it still hasn’t converged, and the validation accuracy remains at 0.3. 😞

@SHUMKASHUN
Copy link
Owner

For most experiments we did with code-davinci-002 and text-davinci, they normally converge (or doesn't change much) at 4,5 epoches. Maybe you need to check if the variance-reduced algorithm works properly, i.e., produce meaningful loss. Because I remember for 10 runs (1 iteration update) need to have different accuracy.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants