Star AlbumentationsX on GitHub — it powers this leaderboard
huggingface/nanotron
Minimalistic large language model 3D-parallelism training