MinerU
opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
54,482stars
Forks
4,521
Open issues
181
Watchers
215
Size
145.6 MB
PythonGNU Affero General Public License v3.0
extract-datalayout-analysisocrparserpdfpdf-converterpythondocument-analysispdf-parserpdf-extractor-llmpdf-extractor-pretrainpdf-extractor-ragai4science
Created: Feb 29, 2024
Updated: Feb 18, 2026
Last push: Feb 9, 2026