Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature(Forge): Complete Bullet Dataset for Training #98

Open
1 of 3 tasks
justinthelaw opened this issue Jul 13, 2023 · 0 comments · Fixed by #173
Open
1 of 3 tasks

Feature(Forge): Complete Bullet Dataset for Training #98

justinthelaw opened this issue Jul 13, 2023 · 0 comments · Fixed by #173
Assignees
Labels
feature New feature or request

Comments

@justinthelaw
Copy link
Owner

justinthelaw commented Jul 13, 2023

Generate a data set that will be used to train a model for generation of Bullet Forge training data. Generally speaking, this model acts as a bullet interpreter that will take-in a set of the bullets that have been scraped form the internet, and spit back out generalized and context-rich sentences. This "bullet interpreter" will act as a standalone model capable of translating any bullet into easily comprehendible achievements that look as if they have been written by an Airman or Guardian.

There are two tasks involved with this issue:

  • Generate a first run of this bullet interpreter model, v0.1.0, using a corpus of 500 select bullets that have been run through ChatGPT
  • Generate a further 1000 select bullets and interpreted completions, totaling 1500 now, with human-cleaning involved to produce a v1.0.0 of the bullet interpreter
  • Use v1.0.0 of the bullet interpreter to generate all 33,600+ bullets and bullet interpretations to form the final training set for the bullet generation models
@justinthelaw justinthelaw converted this from a draft issue Jul 13, 2023
@justinthelaw justinthelaw self-assigned this Jul 13, 2023
@justinthelaw justinthelaw added feature New feature or request help wanted Extra attention is needed good first issue Good for newcomers labels Jul 13, 2023
@justinthelaw justinthelaw changed the title Feature(BulletForge): Complete Dataset for Training Feature(Forge): Complete Bullet Dataset for Training Jul 24, 2023
@justinthelaw justinthelaw removed help wanted Extra attention is needed good first issue Good for newcomers labels Aug 29, 2023
@justinthelaw justinthelaw removed this from Opera Aug 30, 2023
@justinthelaw justinthelaw moved this to 🏗 In Progress in Opera Aug 30, 2023
@justinthelaw justinthelaw linked a pull request Sep 14, 2023 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or request
Projects
Status: 🏗 In Progress
3 participants