⭐ Star AlbumentationsX on GitHub — 307+ stars and counting!
Shubhamsaboo/olmocr
Toolkit for linearizing PDFs for LLM datasets/training