mineru-pdf
Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.
Why use this skill?
Convert PDFs to Markdown or JSON locally with MinerU. High-fidelity document parsing for text, tables, and images without cloud dependencies.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/kesslerio/mineru-pdf-parser-clawdbot-skillWhat This Skill Does
The mineru-pdf skill provides a robust, local-first solution for converting complex PDF documents into structured machine-readable formats. Leveraging the powerful MinerU engine, this skill runs entirely on your CPU, ensuring data privacy and offline capability. It excels at complex document layout analysis, allowing it to extract text, tables, and images with high fidelity. Whether you are dealing with academic papers, technical documentation, or financial reports, this skill transforms static PDF content into Markdown or JSON, making it ready for downstream LLM analysis or database ingestion.
Installation
To integrate this skill into your environment, use the OpenClaw installation command:
clawhub install openclaw/skills/skills/kesslerio/mineru-pdf-parser-clawdbot-skill
Ensure that you have the necessary system dependencies required by MinerU installed on your machine. Once installed, the skill exposes the mineru_parse.sh script, which serves as the primary interface for your PDF processing tasks.
Use Cases
- Research Extraction: Parse dense academic PDFs into Markdown to summarize findings or populate knowledge bases.
- Data Digitization: Convert tabular data trapped in PDF reports into structured JSON for automated analysis or spreadsheet ingestion.
- Content Migration: Transform legacy PDF manuals into modern Markdown documents while preserving structural hierarchy.
- Local Data Privacy: Process sensitive documentation entirely on-device without uploading files to third-party cloud parsing services.
Example Prompts
- "Please parse the document located at /data/research/2023_report.pdf and output the result in Markdown format."
- "Extract the tables and images from the file /docs/manual.pdf using the mineru-pdf skill and save the output to the default directory."
- "Convert /finance/q4_statement.pdf into a structured JSON file so I can programmatically analyze the financial data."
Tips & Limitations
- Performance: Because this skill performs heavy computation locally on the CPU, large documents or high-resolution images may increase processing time. Ensure your environment has sufficient available RAM.
- Output Management: MinerU follows a strict directory structure. Note that all output is generated under
./mineru-output/<basename>/to prevent directory clutter. Always verify the file paths in your logs after execution. - Reference Documentation: For complex configurations, such as tuning backend methods or multi-threading, consult the
references/mineru-cli.mdfile included in the skill repository. - Batch Processing: This skill is optimized for single-file processing. If you have large archives, consider writing a simple shell loop to iterate through your directory rather than forcing batch flags.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-kesslerio-mineru-pdf-parser-clawdbot-skill": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, code-execution
Related Skills
coding-agent
Run Codex CLI, Claude Code, OpenCode, or Pi Coding Agent via background process for programmatic control.
google-messages
Send and receive SMS/RCS via Google Messages web interface (messages.google.com). Use when asked to "send a text", "check texts", "SMS", "text message", "Google Messages", or forward incoming texts to other channels.
academic-deep-research
Transparent, rigorous research with full methodology — not a black-box API wrapper. Conducts exhaustive investigation through mandated 2-cycle research per theme, APA 7th citations, evidence hierarchy, and 3 user checkpoints. Self-contained using native OpenClaw tools (web_search, web_fetch, sessions_spawn). Use for literature reviews, competitive intelligence, or any research requiring academic rigor and reproducibility.
soulcraft
Create or improve SOUL.md files for OpenClaw agents through guided conversation. Use when designing agent personality, crafting a soul, or saying "help me create a soul". Supports self-improvement.
dialpad
Send SMS and make voice calls via Dialpad API. Supports single/batch SMS, voice calls with TTS, and caller ID selection.