alpaca_farm
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
844stars
Forks
64
Open issues
6
Watchers
844
Size
1.9 MB
PythonApache License 2.0
deep-learninginstruction-followinglarge-language-modelsnatural-language-processingreinforcement-learning-from-human-feedback
Created: May 3, 2023
Updated: May 26, 2026
Last push: Jul 1, 2024