U2 Doc Parser
Skill by aaiccee
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/aaiccee/u2-doc-parserWhat This Skill Does
The U2 Doc Parser is a powerful document conversion utility provided by aaiccee that integrates the UniDoc API into your OpenClaw workflow. It is designed to take complex document formats—including PDF, DOC, DOCX, and various image formats like PNG and JPG—and convert them into clean, structured Markdown or JSON. By leveraging a cloud-based backend, the skill handles the heavy lifting of OCR and text extraction, allowing you to easily transform static documents into machine-readable data or editable text files. The skill supports both synchronous processing, where the agent waits for the file to be processed, and asynchronous processing, which uses a polling mechanism to check for completion, making it highly robust for handling various file sizes and network conditions.
Installation
To integrate this utility into your environment, use the OpenClaw package manager. Open your terminal and run the following command:
clawhub install openclaw/skills/skills/aaiccee/u2-doc-parser
Ensure that you have an active internet connection, as the installer will pull the necessary scripts from the openclaw/skills repository to enable the functionality.
Use Cases
This skill is perfect for developers and data analysts who need to automate the ingestion of legacy documents. For example, you can use it to convert scanned PDF invoices into JSON objects for automated bookkeeping, or transform dense technical manuals into Markdown files to be included in a static site generator or a knowledge base. It is also an excellent tool for converting academic papers or image-based notes into accessible formats that can be easily parsed or summarized by other AI agents within your ecosystem.
Example Prompts
- "Use the U2 Doc Parser to convert the file 'manual.pdf' in my downloads folder to a Markdown file named 'manual.md'."
- "Extract the data from 'report.docx' and save it as a JSON file, using the async mode to ensure it handles the conversion process reliably."
- "Convert my scanned image 'handwritten_notes.jpg' to Markdown and print the output directly to my terminal so I can verify the OCR quality."
Tips & Limitations
- Privacy First: Always remember that this skill uploads your files to
https://unidoc.uat.hivoice.cn. Never process sensitive or confidential data. - Performance: For large documents, always prefer
--mode asyncto prevent timeouts. - Monitoring: If you encounter issues, refer to
references/unidoc-notes.mdto troubleshoot API connectivity. - Formatting: Default to Markdown for reading convenience and JSON for data integration workflows.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-aaiccee-u2-doc-parser": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-read, file-write, external-api
Related Skills
Asr File Transfer
Skill by aaiccee
med-chronic-disease-review
门诊慢病审核(糖尿病/高血压)。输入 OCR 结果数组 JSON,输出审核结论与原因(原始 JSON + 自然语言结论)。
u2-tts
Text-to-speech conversion using UniSound's TTS WebSocket API for generating high-quality Chinese Mandarin audio from text. Supports multiple voices, adjustable parameters, and real-time streaming synthesis.
med-initial-record-gen
从中文医患对话文本生成门诊初诊病历,输出结构化分段的病历正文(文本)。
Unidoc Parser
Skill by aaiccee