Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LM1B: Jax #621

Open
6 tasks
dvsaisurya opened this issue Jan 24, 2024 · 0 comments
Open
6 tasks

LM1B: Jax #621

dvsaisurya opened this issue Jan 24, 2024 · 0 comments
Labels
✨ Feature Request Request for a new feature or enhancement of an existing one Future Version

Comments

@dvsaisurya
Copy link

Workload

LM1B

Task

Transformer language model training, resembling https://github.com/google/flax/tree/main/examples/lm1b .

Dataset

LM1B dataset. https://www.tensorflow.org/datasets/catalog/lm1b

Model

decoder-only Transformer, encoder+decoder based Transformer.

Reference Implementation

https://github.com/google/flax/tree/main/examples/lm1b


ToDo

  • Implement data input pipeline
    • Document specific dataset version in workload-specific README
  • Add model
    • Document model in workload-specific README
  • Provide sample submission (and sample tuning search space)
    • Document results of sample submission in workload-specific README (how long did it take to reach the target performance)
@priyakasimbeg priyakasimbeg added ✨ Feature Request Request for a new feature or enhancement of an existing one Future Version labels Mar 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
✨ Feature Request Request for a new feature or enhancement of an existing one Future Version
Projects
None yet
Development

No branches or pull requests

2 participants