ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified file management Safety 4/5

Unidoc Parser

Skill by aaiccee

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/aaiccee/unidoc-parser
Or

What This Skill Does

The Unidoc Parser skill is a powerful document conversion agent designed to bridge the gap between unstructured documents and machine-readable data. Utilizing the UniDoc cloud-based API, this tool enables the seamless transformation of complex file formats—including PDFs, Microsoft Word documents (.doc, .docx), and various image formats like PNG and JPG—into either standardized Markdown or structured JSON. By handling both synchronous processing for quick tasks and asynchronous polling for larger, resource-intensive files, it ensures that your documents are ready for downstream AI consumption, indexing, or archival without requiring manual intervention.

Installation

To integrate this capability into your OpenClaw environment, execute the following command in your terminal: clawhub install openclaw/skills/skills/aaiccee/unidoc-parser Ensure that you have an active network connection, as the skill requires communication with the UniDoc API endpoint (http://unidoc.uat.hivoice.cn) to perform its processing tasks.

Use Cases

This skill is indispensable for professionals dealing with high-volume document ingestion. Use cases include:

  1. Data Extraction: Converting scanned invoices or legal contracts into JSON format for direct ingestion into databases or CRM systems.
  2. Content Migration: Turning legacy Word documents into Markdown files to facilitate easier migration to static site generators or documentation platforms.
  3. Batch Processing: Automating the conversion of entire directories of research papers or reports into a readable, searchable format using asynchronous mode for background execution.

Example Prompts

  1. "Unidoc, please parse the report at ./documents/annual_review.pdf into Markdown format and save the result to ./outputs/annual_review."
  2. "Convert the document at /data/receipts/march_invoices.docx into JSON so I can use the data in my spreadsheet generator. Use asynchronous mode since it is a large file."
  3. "Run the Unidoc Parser on /files/scanned_image.jpg to extract the text content into an md file."

Tips & Limitations

To maximize efficiency, prioritize using asynchronous mode (--mode async) for large batches or files exceeding standard size limits, as this avoids connection timeouts. The tool automatically handles status polling every second, so you do not need to intervene during the process. Please note that the utility performs file-write operations, so ensure your target directories have appropriate write permissions. If you face connectivity hurdles, verify that your local environment has access to the UniDoc API URL; if errors persist, check the references/unidoc-notes.md file for API-specific troubleshooting steps.

Metadata

Author@aaiccee
Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-aaiccee-unidoc-parser": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#document-processing#ocr#conversion#pdf-tools#automation
Safety Score: 4/5

Flags: network-access, file-write, file-read, external-api