GitHub

PRS:

Code for our EMNLP2024 paper Preference-Guided Reflective Sampling for Aligning Language Models. We provide the code for generating data with PRS, where the generated data can be used in iterative offline RL training for aligning a language model. For comparison, we also include the baseline sampling methods such as random sampling.

Website

News:

Oct. 2024: PRS is accepted by EMNLP2024 as the main paper!
Jun. 2024: Release the first version of code for PRS.

Quick Start

To sample responses, in file of run_best_of_N.eval_mode.sh, you have to specify the

data path of prompts;
the policy model, such as Mistral-7B-Instruct-v0.2;
the reward model, such as UltraRM-13b.

We provide the example data of prompts from Alpaca (see data/alpaca_gpt4.dev_set.num=100.w_preference_by_gpt-3.5.jsonl).

Then run:

bash run_best_of_N.eval_mode.sh

For PRS, you will get two files of responses. You can combine them with combine_for_tree_search.py:

python combine_for_tree_search.py path_to_initial_response path_to_refinement path_to_save

Requirements

Install vLLM: We use vLLM to fasten model sampling, so you have to install vLLM from here.

Citation

If you find our code is helpful to your work, please cite our paper:

@inproceedings{hai_prs_emnlp,
      author = {Ye, Hai and Ng, Hwee You},
      booktitle = {Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
      title = {Preference-Guided Reflective Sampling for Aligning Language Models},
      url = {https://arxiv.org/pdf/2408.12163},
      year = {2024}
    }

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
code		code
data		data
figures		figures
prompts		prompts
utils		utils
.DS_Store		.DS_Store
README.md		README.md
combine_for_tree_search.py		combine_for_tree_search.py
generate_reward_from_ultraFeedback.py		generate_reward_from_ultraFeedback.py
generate_self_reflection.py		generate_self_reflection.py
run_best_of_N.eval_mode.sh		run_best_of_N.eval_mode.sh
user_preference.txt		user_preference.txt
user_preference_1.txt		user_preference_1.txt
user_preference_2.txt		user_preference_2.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PRS:

News:

Quick Links

Quick Start

Requirements

Citation

About

Releases

Packages

Languages

nusnlp/PRS

Folders and files

Latest commit

History

Repository files navigation

PRS:

News:

Quick Links

Quick Start

Requirements

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages