ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified

doc-importer

Import external documents (PDF, DOCX, PPTX, XLSX, HTML) into editable markdown for rewriting or project integration

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/athola/nm-scribe-doc-importer
Or

Night Market Skill — ported from claude-night-market/scribe. For the full experience with agents, hooks, and commands, install the Claude Code plugin.

Document Importer

Import external documents into editable markdown.

When To Use

  • User provides a DOCX, PPTX, XLSX, PDF, or HTML file to convert into project documentation
  • User wants to extract content from a document for rewriting or remediation
  • User has a slide deck or spreadsheet to turn into markdown documentation

When NOT To Use

  • Academic paper analysis: use tome:papers
  • Web article knowledge intake: use memory-palace:knowledge-intake
  • Content already in markdown: use scribe:doc-generator remediation mode directly

Import Workflow

Step 1: Identify Source

Determine the source document:

  • Local file path: verify it exists with Read tool
  • URL: verify accessibility
  • User description: confirm format and location

Step 2: Convert to Markdown

Apply the leyline:document-conversion protocol:

  1. Construct URI from source (file path or URL)
  2. Try the markitdown MCP tool for best quality
  3. If unavailable, use native tool fallbacks
  4. If format unsupported, inform user

Step 3: Structural Cleanup

After conversion, normalize the markdown:

  • Ensure ATX headings (# style, not setext underlines)
  • Wrap prose lines at 80 characters per leyline:markdown-formatting
  • Fix broken tables (align columns, add headers)
  • Remove conversion artifacts (page numbers, headers/footers, watermarks, repeated logos)
  • Preserve all substantive content

Step 4: Sanitize External Content

Apply the leyline:content-sanitization checklist:

  • Size check (truncate sections over 2000 words)
  • Strip system/instruction tags
  • Wrap in external content boundary markers

Step 5: Write Draft

Write the converted markdown to the target location. Default: same directory as source, with .md extension. Ask the user for target path if ambiguous.

Step 6: Hand Off to Doc-Generator (Optional)

If the user wants polishing or rewriting:

  • Invoke Skill(scribe:doc-generator) in Remediation mode on the imported file
  • The doc-generator handles slop detection, style application, and quality gates

Offer this step; do not assume the user wants remediation.

Output Quality

The imported markdown should:

  • Have a top-level # Title from the document title
  • Preserve the original heading hierarchy
  • Convert tables to markdown tables
  • Convert images to ![alt](path) references (note: image files may need separate handling)
  • Convert lists faithfully
  • Mark unclear or garbled sections with <!-- REVIEW: conversion artifact -->

Exit Criteria

Metadata

Author@athola
Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-athola-nm-scribe-doc-importer": {
      "enabled": true,
      "auto_update": true
    }
  }
}
Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.