Star AlbumentationsX on GitHub — it powers this leaderboard
huggingface/trl
Train transformer language models with reinforcement learning.