Star AlbumentationsX on GitHub — it powers this leaderboard
LAION-AI/dataset-spec
Describe the format of image/text datasets