Star AlbumentationsX on GitHub — it powers this leaderboard

Star on GitHub
← Back to leaderboard
QuivrHQ

MegaParse

QuivrHQ/MegaParse

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

7,342stars
Forks
416
Open issues
32
Watchers
7,342
Size
56.6 MB
PythonApache License 2.0
docxllmparserpdfpowerpoint
Created: May 29, 2024
Updated: Feb 27, 2026
Last push: Feb 21, 2025