Star AlbumentationsX on GitHub — it powers this leaderboard

Star on GitHub
← Back to leaderboard
opendatalab

MinerU

opendatalab/MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

54,481stars
Forks
4,519
Open issues
183
Watchers
54,481
Size
145.6 MB
PythonGNU Affero General Public License v3.0
ai4sciencedocument-analysisextract-datalayout-analysisocrparserpdfpdf-converterpdf-extractor-llmpdf-extractor-pretrainpdf-extractor-ragpdf-parserpython
Created: Feb 29, 2024
Updated: Feb 18, 2026
Last push: Feb 9, 2026