Official Verified utilities Safety 5/5

image-ocr

Extract text from images using Tesseract OCR

Why use this skill?

Use the OpenClaw image-ocr skill to quickly extract text from PNG, JPEG, and TIFF files. Automate document digitization using Tesseract OCR.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/xejrax/image-ocr

Download Source Code (.zip)

What This Skill Does

The image-ocr skill for OpenClaw provides a robust interface for performing Optical Character Recognition (OCR) on various image file formats. By leveraging the industry-standard Tesseract engine, this skill allows users to transform visual data into machine-readable text seamlessly. Whether you are dealing with screenshots, scanned documents, or photographic text, this skill processes the pixels and returns clean, extractable string data. It supports a wide array of formats including PNG, JPEG, TIFF, and BMP, and offers language-specific support to ensure accurate character recognition across different character sets.

Installation

To utilize this skill, ensure you have the base Tesseract engine installed on your system. On Fedora or RHEL-based systems, execute 'sudo dnf install tesseract'. Once the dependency is met, install the OpenClaw skill by running: 'clawhub install openclaw/skills/skills/xejrax/image-ocr'.

Use Cases

This tool is invaluable for digitizing physical paperwork, automating data entry from screenshots, or extracting text from legacy image-based document archives. Developers and power users will find it highly effective for automating workflows where information is trapped in non-textual formats.

Example Prompts

"OpenClaw, extract the text from the screenshot named invoice_001.png and save it to my notes."
"Please run image-ocr on receipt.jpg and tell me the total amount listed at the bottom."
"Use the image-ocr skill to read the document.png file using the French language pack."

Tips & Limitations

For best results, ensure images have high contrast and clear typography. Low-resolution images or those with heavy noise/blur may result in lower accuracy. While Tesseract is powerful, it is not perfect for handwritten notes; it performs best with printed, standard fonts. If you are getting inconsistent results, try preprocessing your images with brightness or contrast adjustments before running the OCR command. Always specify the language flag if you are working with documents that contain non-English characters to significantly improve the character mapping accuracy.

Read Full Documentation on GitHub

Metadata

Author@xejrax

Stars919

Updated2026-02-12

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-xejrax-image-ocr": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#ocr#tesseract#image-processing#digitization

Safety Score: 5/5

Flags: file-read

Related Skills

system-info

Quick system diagnostics: CPU, memory, disk, uptime

xejrax 919

calendar

Manage Google Calendar events using `gcalcli`. Create, list, and delete calendar events from the CLI.

xejrax 919

log-tail

Stream recent logs from systemd journal

xejrax 919

wifi-qr

Generate QR code for Wi-Fi credentials

xejrax 919

ping-beads

Verify the bead daemon is alive and responsive

xejrax 919