Skip to content

Latest commit

 

History

History
8.6 MB

2022-Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.pdf

File metadata and controls

8.6 MB
Loading