From ccae82a90a8f6a9acf0f6bae27be2e128f12cc5a Mon Sep 17 00:00:00 2001 From: Amirhossein Kazemnejad <2122102+kazemnejad@users.noreply.github.com> Date: Wed, 2 Oct 2024 12:20:11 -0400 Subject: [PATCH] Update README.md --- README.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.md b/README.md index ba56ba1..23f260a 100755 --- a/README.md +++ b/README.md @@ -10,6 +10,7 @@ - [Running the experiments](#running-the-experiments) - [Code Structure](#code-structure) - [Initial SFT Checkpoints](#initial-sft-checkpoints) +- [Acknowledgement](#acknowledgement) - [Citation](#citation) @@ -153,6 +154,10 @@ singularity exec --nv \ | DeepSeekMath | [🤗 Deepseekmath-SFT-GSM8K](https://huggingface.co/realtreetune/deepseekmath-7b-sft-GSM8K) | [🤗 Deepseekmath-SFT-MATH](https://huggingface.co/realtreetune/deepseekmath-7b-sft-MATH-v2) | | Rho1B | [🤗 Rhomath-SFT-GSM8K](https://huggingface.co/realtreetune/rho-1b-sft-GSM8K) | [🤗 Rhomath-SFT-MATH](https://huggingface.co/realtreetune/rho-1b-sft-MATH) | +## Acknowledgement + +This is the release codebase for VinePPO. It is developed by [@kazemnejad](https://github.com/kazemnejad) and [@miladink](https://github.com/miladink). + ## Code Structure - [`configs`](https://github.com/McGill-NLP/vineppo/tree/main/configs): Contains Jsonnet files for configuring experiment settings. - [`src/treetune`](https://github.com/McGill-NLP/vineppo/tree/main/src/treetune): The main directory for source code, encompassing: