RookWorld

Re-implementation of https://arxiv.org/pdf/2402.04494 (focus on Figure A6)

decoder transformer architecture (Llama)
9M parameters (context 78 or 79, 8 heads, 8 layers, embedding dim 256)
- use HF Llama implementation instead of llm.c in RookWorld
Likely hyperparams LR 4-e4, BS 1024 (4x TPUs V5 @ 95G)
40M data samples, BS 1024, 5e6 steps
- 14.7% train/test overlap!
Predictor: base BC (State-Action)
- Also: SV (State-Value from E)
- Limited: AV (Action-Value per Legal Move, we only have Top 5 M/E, not all)

Experiments:

Text Classification (instead of CLM in RookWorld)
- switch names rook and rookworld. classification model cannot be world model
Dataset: Lichess Games + Stockfish Selfplay + Optional Puzzles + Optional COT
Context: 78|79 without, ~170 with COT
Multi-Task Tokenization (Arbiter)
Distillation with Soft Tokens (M+E), KD-loss vs CrossEntropyLoss with Probabilities

Evals:

Action Accuracy
Puzzle Accuracy

|- data.py        Load/prepare dataset for training and evaluation
|- eval.py        Evaluate checkpoint on actions, puzzles, checkmate-in-one
|- train.py       Train from scratch
|- download.sh    Download train & test data
|- src/
   |- model.py    Model & Tokenizer definition
   |- policy.py   Inference code for evals & (TODO) data generation
   |- const.py    Constant values like Action Space and Vocab
   |- utils/      Data conversion scripts
   |- data/       Train & Eval Data

Usage

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
checkpoints		checkpoints
data		data
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
data.py		data.py
download.sh		download.sh
eval.py		eval.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RookWorld

Usage

About

Releases

Languages

jorahn/rook

Folders and files

Latest commit

History

Repository files navigation

RookWorld

Usage

About

Resources

Stars

Watchers

Forks

Releases

Languages