alpaca_farm
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
842stars
Forks
63
Open issues
6
Watchers
842
Size
1.9 MB
PythonApache License 2.0
deep-learninginstruction-followinglarge-language-modelsnatural-language-processingreinforcement-learning-from-human-feedback
Created: May 3, 2023
Updated: Apr 10, 2026
Last push: Jul 1, 2024