Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question related to how to use the validation and training splits. #2423

Open
sorobedio opened this issue Oct 24, 2024 · 3 comments
Open

Question related to how to use the validation and training splits. #2423

sorobedio opened this issue Oct 24, 2024 · 3 comments
Labels
asking questions For asking for clarification / support on library usage.

Comments

@sorobedio
Copy link

Hello, I would like to know how I can use the validation split to evaluate the models, and similarly, how to use the training split for evaluation if needed. I haven't found an option where the user can specify the dataset split they want to use for model evaluation. Could you provide guidance on how to set this up?
thank you

@baberabb
Copy link
Contributor

baberabb commented Oct 24, 2024

Hi! You can switch up the sets used in the task yamls. We use the test split if provided, otherwise the validation split is used. Example:

training_split: train
validation_split: validation
test_split: test

If a fewshot_split is not provided, then the priority is training > val > test to extract the fewshot examples.

@sorobedio
Copy link
Author

I see. So the validation set is never used when both the training and test sets are present. Is the Open LM leaderboard following the same approach?
Thank you.

@baberabb baberabb added the asking questions For asking for clarification / support on library usage. label Oct 25, 2024
@baberabb
Copy link
Contributor

I see. So the validation set is never used when both the training and test sets are present. Is the Open LM leaderboard following the same approach? Thank you.

yes!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
asking questions For asking for clarification / support on library usage.
Projects
None yet
Development

No branches or pull requests

2 participants