⭐ Star AlbumentationsX on GitHub — 448+ stars and counting!

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,839stars

Forks

159

Open issues

Watchers

1,839

Size

28.1 MB

MIT License

Created: Apr 10, 2022

Updated: May 27, 2026

Last push: Jun 17, 2025

Archived