markdown-converter
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
Why use this skill?
Learn how to use the OpenClaw markdown-converter skill to turn PDFs, Office files, images, and audio into structured Markdown for better LLM processing and analysis.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/steipete/markdown-converterWhat This Skill Does
The markdown-converter is a powerful utility designed to bridge the gap between unstructured or binary document formats and LLM-friendly text. By utilizing the markitdown engine, this skill serves as an automated pipeline that ingests complex files like PDFs, Office documents (Word, Excel, PowerPoint), and even multimedia streams like YouTube URLs or audio files, and serializes their content into clean, structured Markdown. This conversion is crucial for RAG (Retrieval-Augmented Generation) workflows, data analysis, and content synthesis, as it extracts semantic structure—such as tables, headers, and lists—that standard text extractors often fail to preserve. It supports advanced features like OCR for images and transcription for audio, ensuring that your AI agent can 'see' and 'hear' the contents of your workspace.
Installation
To integrate this skill into your environment, run the following command within your terminal or the OpenClaw management console:
clawhub install openclaw/skills/skills/steipete/markdown-converter
Once installed, the uvx package manager will handle the dependencies on the first run, creating a cached environment for efficient performance in subsequent execution.
Use Cases
This skill is ideal for developers and analysts dealing with diverse data silos. Common use cases include:
- Automated Reporting: Convert complex quarterly PDFs or Excel sheets into Markdown to summarize insights via an LLM.
- Content Transformation: Transcribe audio lectures or YouTube videos directly into structured blog posts or summary documents.
- Data Ingestion: Process legacy HTML, JSON, or XML archives into a unified format for bulk analysis.
- OCR Processing: Extract text from scanned physical documents or images of whiteboards to make them searchable and editable.
Example Prompts
- "Convert the quarterly-results.pdf in my current directory to Markdown and save it as summary.md."
- "Extract the text and table data from data_dump.xlsx so I can analyze the growth trends."
- "Summarize the content of this YouTube video link: [URL] by converting its transcript into a Markdown summary using the markdown-converter tool."
Tips & Limitations
For the best results with complex or handwritten PDF documents, always utilize the Azure Document Intelligence flag (-d) combined with your specific endpoint; this significantly improves structural accuracy compared to standard PDF parsing. Note that the initial execution may take a moment while the environment installs the necessary backend dependencies. For large ZIP files, be aware that the skill will iterate through all contained files, which may consume significant processing time. Always verify the charset if you are dealing with legacy files to prevent character encoding issues.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-steipete-markdown-converter": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read, file-write, external-api
Related Skills
swiftui-liquid-glass
Implement, review, or improve SwiftUI features using the iOS 26+ Liquid Glass API. Use when asked to adopt Liquid Glass in new SwiftUI UI, refactor an existing feature to Liquid Glass, or review Liquid Glass usage for correctness, performance, and design alignment.
qmd
Local search/indexing CLI (BM25 + vectors + rerank) with MCP mode.
songsee
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
summarize
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
bird
X/Twitter CLI for reading, searching, and posting via cookies or Sweetistics.