pdf-to-docx
Convert PDF files to editable Word documents using pdf2docx
Why use this skill?
Efficiently convert static PDF files into editable Word documents while preserving layout, tables, and text formatting using the OpenClaw pdf-to-docx skill.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/lijie420461340/pdf-to-docxWhat This Skill Does
The pdf-to-docx skill is an efficient utility for OpenClaw users to transform static PDF files into fully editable Microsoft Word (DOCX) documents. Utilizing the powerful pdf2docx Python library, this skill focuses on high-fidelity extraction that preserves the document's original layout, including tables, images, and complex text formatting. Unlike standard copy-paste or inferior OCR methods that often lose structural integrity, this tool reads the native metadata and layout objects of the PDF to reconstruct the document in a format suitable for immediate editing, restructuring, or archival.
Installation
To integrate this skill into your environment, run the following command in your terminal within your OpenClaw project workspace:
clawhub install openclaw/skills/skills/lijie420461340/pdf-to-docx
Ensure your Python environment meets the dependencies required by pdf2docx before running.
Use Cases
This skill is indispensable for professionals dealing with documentation workflows. Common use cases include:
- Transforming non-editable business reports or contracts into Word files for revision or collaboration.
- Extracting tables from annual reports to manipulate data in Word or Excel.
- Converting long research papers or manuals into a readable Word format for easier annotation.
- Automating the conversion of multiple PDF documents into standard business templates.
Example Prompts
- "Please convert the attached PDF report into an editable Word document for my team to review."
- "I only need pages 1 through 5 of this long PDF document; can you turn those into a Word file for me?"
- "Convert this contract to Word format and ensure all the tables are preserved correctly."
Tips & Limitations
- Best results are achieved with "native" PDFs (those generated directly from software like Word or LaTeX). If you are working with scanned PDFs containing images of text, consider using an OCR pre-processing step, as this skill prioritizes native document structure over pixel-based image analysis.
- For large documents, specify page ranges to optimize processing speed.
- Use the provided advanced configuration options if you encounter issues with margins or line-spacing in the resulting output.
- Ensure you have read access to the file location and write permissions for the target output directory when invoking the tool.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-lijie420461340-pdf-to-docx": {
"enabled": true,
"auto_update": true
}
}
}Tags
Flags: file-read, file-write
Related Skills
career-compass
职场罗盘 by Barry — 一站式求职辅助 Skill。整合简历解析优化、公司调研(就业向)、同城职位搜索、模拟面试四大模块。输入个人信息/简历,自动生成简历优化方向、公司调研报告、招聘表单,并可进行模拟面试。
wechat-article-export
微信公众号多功能导出工具。將公眾號文章導出為長截圖(PNG)、PDF 或 Markdown,支持任選一種或多種格式。觸發詞:「導出微信文章」、「公眾號截圖」、「文章轉PDF」、「文章轉Markdown」、「微信導出」。
DocPilot
智能文档处理专家,支持文档解析、信息抽取、文档分类
landing-page-angle-tester
针对同一产品生成多种 landing page 叙事角度,并标注适配人群和证据要求。;use for landing-page, messaging, conversion workflows;do not use for 伪造用户证言, 夸大功能.
accounting-assistant
Buchhaltungs-Automatisierung mit EÜR-Erstellung, DATEV-Export, PDF-Beleganalyse und Steuer-Vorbereitung. Ideal für Freelancer und KMU.