Official Verified file management Safety 3/5

pdf-parser

使用 MinerU API 将 PDF 解析为 Markdown，支持公式、表格、OCR。提供本地文件和在线 URL 两种解析方式。触发条件：(1) 用户说"解析 PDF [路径]"，(2) 用户说"将 PDF 转为 Markdown"，(3) 在 paper-workflow 中自动调用。使用场景：学术论文解析、文档提取、知识库构建。

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/alex-zxyz/mineru-pdf-parser

Download Source Code (.zip)

What This Skill Does

The PDF Parser skill integrates the powerful MinerU API directly into your OpenClaw agent workflow, enabling high-fidelity conversion of complex PDF documents into structured Markdown. Unlike standard text extractors, this skill is specifically engineered to handle the nuances of technical and academic documents. It features advanced OCR for image-based PDFs, precise LaTeX formula extraction, and automatic structural conversion of tables into Markdown format. By leveraging the MinerU engine, the agent can transform static, unreadable PDF layouts into machine-actionable knowledge assets that can be further processed by LLMs or stored in your local knowledge base.

Installation

To begin, ensure you have the necessary dependencies installed by running pip install requests in your environment. You must obtain a valid API token from https://mineru.net/. Once you have your token, you must set it as an environment variable named MINERU_TOKEN. On Linux/macOS, use export MINERU_TOKEN=your-token-here, and on Windows PowerShell, use $env:MINERU_TOKEN = "your-token-here". Finally, install the skill using the OpenClaw command: clawhub install openclaw/skills/skills/alex-zxyz/mineru-pdf-parser.

Use Cases

Academic Research: Rapidly convert long research papers and journal articles into Markdown to summarize, query, or extract data points using AI.
Technical Documentation: Parse manuals and blueprints that include complex tables and mathematical notations for integration into documentation systems.
Knowledge Management: Convert local archives of reports and financial statements into a standardized format for building private RAG (Retrieval-Augmented Generation) knowledge bases.
Document Archiving: Streamline the process of digitizing legacy scanned files that are otherwise locked in static image formats.

Example Prompts

"解析 PDF ~/Documents/research_paper.pdf，并将内容转换成 Markdown 格式。"
"将 PDF 转为 Markdown，请使用 pipeline 引擎进行精细化解析。"
"解析当前的 PDF 文件，并提取其中的公式与表格数据到知识库。"

Tips & Limitations

Engine Selection: While the 'vlm' engine is default and highly versatile, switch to 'pipeline' if you require stricter adherence to document flow for complex layouts.
Privacy: Be aware that this skill sends your document content to the MinerU external API for processing. Avoid processing highly sensitive or classified documents.
File Management: Note that processed outputs are automatically stored in ~/.openclaw/MinerU_Results/. Ensure you have sufficient disk space when processing large batches of high-resolution PDFs.
Latency: The parsing process involves network requests and heavy computation on the server side; always use the --wait flag if you want the agent to confirm completion before proceeding to downstream tasks.

Read Full Documentation on GitHub

Metadata

Author@alex-zxyz

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-alex-zxyz-mineru-pdf-parser": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#pdf#markdown#ocr#academic#parser

Safety Score: 3/5

Flags: file-read, file-write, external-api

Related Skills

minimax-usage

查询 MiniMax Token Plan 剩余用量。slash command。查询 MiniMax Token Plan 剩余次数和重置时间，支持 M2.7/Speech/视频/图片/音乐等模型的用量查询。 Query MiniMax Token Plan usage and reset time. Supports M2.7, Speech, Video, Image, and Music models.

alex-zxyz 4473