GitHub - zucchini-nlp/RLHF-GPT2

Alignment Task with trl Library

Overview

This repository implements a Command Line Interface (CLI) for fine-tuning the gpt2-medium model using the trl library. The fine-tuning process supports two training options - Reinforcement Learning with Human Feedback (RLHF) using Proximal Policy Optimization (PPO) with a reward model or using Direct Preference Optimization (DPO) without a reward model.

The implementation allows for a single iteration of the following stages:

SFT (Supervised Fine-Tuning)
(Optional) Reward Model Training
RL with PPO or DPO

Usage

Training Scripts

You can use the provided scripts to train different parts of the alignment iteration. For instance, to train the SFT model with a gpt2-medium backbone on the Anthropic-hh dataset, run the following command:

bash run_sft.sh

Similarly, you can execute other scripts such as run_reward.sh and run_rlhf.sh to train the reward model and perform RLHF, respectively.

Experiment

To conduct a normal training on the Anthropic-hh dataset, run the appropriate script or follow the provided examples.

Feel free to explore and modify the scripts/config to adapt the fine-tuning process according to your specific requirements.

Generation

To generate using the fine-tuned models, use the below command:

python generate.py --pretrained_model_name_or_path RaushanTurganbay/GPT2_sft_and_dpo_tuned max_new_tokens 256 {{other generaion_args}}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
configs.py		configs.py
dataset.py		dataset.py
generate.py		generate.py
requirements.txt		requirements.txt
run_dpo.sh		run_dpo.sh
run_ppo.sh		run_ppo.sh
run_reward.sh		run_reward.sh
run_sft.sh		run_sft.sh
train_dpo.py		train_dpo.py
train_ppo.py		train_ppo.py
train_reward_model.py		train_reward_model.py
train_sft.py		train_sft.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Usage

Training Scripts

Experiment

Generation

About

Releases

Packages

Languages

License

zucchini-nlp/RLHF-GPT2

Folders and files

Latest commit

History

Repository files navigation

Overview

Usage

Training Scripts

Experiment

Generation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages