ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified data analysis Safety 4/5

paper-parse

Parse academic PDF papers into markdown with figure extraction.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/chen-li-17/paper-parse-figures
Or

What This Skill Does

The paper-parse skill is a specialized utility designed for researchers, students, and academics who frequently interact with scholarly PDF documents. By leveraging the power of PyMuPDF and pymupdf4llm, this agent skill automates the tedious process of converting rigid PDF layouts into structured, readable Markdown files. Beyond simple text extraction, the tool identifies and isolates figures and captions, creating a dedicated repository of visual assets. It essentially bridges the gap between static academic publishing formats and modern LLM-compatible text formats, allowing you to ingest complex research papers into your AI workflows without losing structural integrity.

Installation

To integrate this skill into your OpenClaw environment, use the following CLI command provided by the ClawHub registry:

clawhub install openclaw/skills/skills/chen-li-17/paper-parse-figures

This command automatically resolves the necessary dependencies, including PyMuPDF and the pymupdf4llm library, through uv. Ensure you have uv installed in your path for a seamless setup experience.

Use Cases

This skill is ideal for several professional scenarios:

  1. Literature Reviews: Quickly convert a folder of fifty PDF papers into Markdown to build a local vector database for Retrieval-Augmented Generation (RAG).
  2. Academic Summarization: Extract text and figures from new pre-prints to generate automated briefing documents or summaries.
  3. Knowledge Management: Convert dense research documents into Obsidian or Notion-compatible Markdown files, complete with referenced images.
  4. Data Extraction: Programmatically pull figure-based datasets from papers for further analysis.

Example Prompts

  1. "OpenClaw, please parse the PDF located at ~/downloads/transformer_paper.pdf and place the markdown output into my research folder."
  2. "Extract all figures from the paper in ./input/quantum-research.pdf and organize them into the /figures folder for my upcoming presentation."
  3. "Convert the academic paper at ./data/diffusion-models.pdf to markdown and identify the authors in the parsed metadata json."

Tips & Limitations

For optimal results, ensure the PDF files are not scanned images (OCR-based PDFs might require additional preprocessing). The skill works best with standard academic two-column layouts. If the output directory is not specified, it defaults to the script's local context, so it is recommended to always define an --output-dir to keep your project workspace clean. Note that highly complex figures with embedded sub-figures may occasionally require manual oversight to ensure captions are correctly mapped to their respective images.

Metadata

Stars3875
Views0
Updated2026-04-07
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-chen-li-17-paper-parse-figures": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#research#pdf-parsing#markdown#academic#data-extraction
Safety Score: 4/5

Flags: file-read, file-write, code-execution