SkyThought/skythought/train at main · NovaSky-AI/SkyThought

History

Name		Name	Last commit message	Last commit date
parent directory ..
LLaMA-Factory		LLaMA-Factory
README.md		README.md

README.md

Training

We use a fork from LLaMA-Factory to perform training.

Step 1: Please add the data path produced by the tools directory or the one we provide, to the file_name field of Sky-T1 entry in LLaMA-Factory/data/dataset_info.json.

Step 2: run

FORCE_TORCHRUN=1 NNODES=1 NODE_RANK=0 MASTER_PORT=29501 llamafactory-cli train examples/train_full/qwen2_full_sft.yaml

to train from a 32B model on 8 H100 GPUs. Interested readers can refer to the detailed settings in examples/train_full/qwen2_full_sft.yaml.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train

train

README.md

Training

Files

train

Directory actions

More options

Directory actions

More options

Latest commit

History

train

Folders and files

parent directory

README.md

Training