⭐ Star AlbumentationsX on GitHub — 448+ stars and counting!

Star on GitHub
QuivrHQ

MegaParse

QuivrHQ/MegaParse

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

7,377stars
Forks
419
Open issues
32
Watchers
7,377
Size
56.6 MB
PythonApache License 2.0
docxllmparserpdfpowerpoint
Created: May 29, 2024
Updated: May 26, 2026
Last push: Feb 21, 2025