Official Verified utilities Safety 5/5

ocr-local

Extract text from images using Tesseract.js OCR (100% local, no API key required). Supports Chinese (simplified/traditional) and English.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/15914355527/ocr-local-v2

Download Source Code (.zip)

What This Skill Does

The ocr-local skill provides a robust, private, and efficient text recognition engine for the OpenClaw platform. By utilizing Tesseract.js, it performs Optical Character Recognition (OCR) entirely on your local machine. This means your images are never uploaded to a cloud server, ensuring complete data privacy and security. The tool supports high-accuracy detection of Simplified Chinese, Traditional Chinese, and English, making it ideal for extracting data from screenshots, PDFs, or photos of documents without needing expensive or rate-limited API keys.

Installation

To integrate this capability into your workflow, use the following OpenClaw command in your terminal:

clawhub install openclaw/skills/skills/15914355527/ocr-local-v2

Once installed, the skill resides within your local scripts directory. Note that the first time you execute an OCR task, the engine will automatically download the necessary language training data (approximately 20MB per language). These files are cached locally for future use, ensuring that subsequent requests remain extremely fast.

Use Cases

Screenshot Data Extraction: Convert text found in application UI, dashboards, or error messages into searchable/copyable text.
Document Digitization: Transform images of receipts, invoices, or manual notes into digital text for archiving or database entry.
Multilingual Support: Extract mixed-language data from documents containing both English and Chinese characters.
Automation Pipelines: Chain this skill with other OpenClaw tools to perform text-based analysis on image assets automatically.

Example Prompts

"OpenClaw, extract the text from the screenshot named invoice_001.png and save it as a text file for my records."
"Use the ocr-local skill on document.jpg and translate the content into English if it contains Traditional Chinese."
"Scan the dashboard image and give me the total value figures found in the text using the JSON output mode."

Tips & Limitations

For optimal results, ensure your input images are of high resolution and contrast. While the model is highly capable, it may struggle with low-resolution photos, complex background patterns, or heavily stylized handwriting. Because it runs locally, processing time is dependent on your machine's CPU power; large images may take a few seconds longer to process. Always ensure your image files are accessible to the script's path when calling it via the command line.

Read Full Documentation on GitHub

Metadata

Author@15914355527

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-15914355527-ocr-local-v2": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#ocr#tesseract#text-extraction#privacy#chinese-ocr

Safety Score: 5/5

Flags: file-read, code-execution

Related Skills

tavily-search

Web search using Tavily's LLM-optimized API. Returns relevant results with content snippets, scores, and metadata.

15914355527 4473

exa-web-search-free

Free AI search via Exa MCP. Web search for news/info, code search for docs/examples from GitHub/StackOverflow, company research for business intel. No API key needed.

15914355527 4473

baidu-search

Search the web using Baidu AI Search Engine (BDSE). Use for live information, documentation, or research topics.

15914355527 4473

nano-pdf

Edit PDFs with natural-language instructions using the nano-pdf CLI.

15914355527 4473