Star AlbumentationsX on GitHub — it powers this leaderboard
gchq/synthetic-data-generator
Code for generating synthetic data for testing