Star AlbumentationsX on GitHub — it powers this leaderboard
toeverything/pdf-extract
A rust library for extracting content from pdfs