⭐ Star AlbumentationsX on GitHub — 498+ stars and counting!

alpaca_farm

tatsu-lab/alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

846stars

Forks

Open issues

Watchers

846

Size

1.9 MB

PythonApache License 2.0

deep-learninginstruction-followinglarge-language-modelsnatural-language-processingreinforcement-learning-from-human-feedback

Created: May 3, 2023

Updated: Jul 12, 2026

Last push: Jul 1, 2024