mineru-pdf

// Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.

$ git log --oneline --stat

stars:370

forks:70

updated:February 19, 2026

SKILL.mdreadonly

SKILL.md Frontmatter

namemineru-pdf

descriptionParse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.

MinerU PDF

Parse a PDF locally with MinerU (CPU). Default output is Markdown + JSON. Use tables/images only when requested.

# Run from the skill directory
./scripts/mineru_parse.sh /path/to/file.pdf

Optional examples:

./scripts/mineru_parse.sh /path/to/file.pdf --format json
./scripts/mineru_parse.sh /path/to/file.pdf --tables --images

If flags differ from your wrapper or you need advanced defaults (backend/method/device/threads/format mapping), read:

Output root defaults to ./mineru-output/.
MinerU creates the per-document subfolder under the output root (e.g., ./mineru-output/<basename>/...).

Default is single-PDF parsing. Only implement batch folder parsing if explicitly requested.