ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

translate-image

Translate text in images, extract text via OCR, and remove text using TranslateImage AI. Use when user says 'translate image', 'OCR image', 'extract text from image', 'remove text from image', 'manga translate', or wants to process images with foreign-language text.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/cottom/translate-image
Or

What This Skill Does

The TranslateImage skill empowers your OpenClaw agent to process, understand, and modify text within visual media. By leveraging the TranslateImage REST API, the agent can perform high-fidelity OCR (Optical Character Recognition), translate foreign-language text while maintaining the original design, and perform image inpainting to remove unwanted text overlays. It acts as an intelligent layer between raw pixels and actionable information, making it an essential tool for cross-lingual accessibility and document analysis.

Installation

You can integrate this skill into your local OpenClaw environment by running the following terminal command: clawhub install openclaw/skills/skills/cottom/translate-image

Once installed, ensure you have obtained your API key from translateimage.io and set it as an environment variable: export TRANSLATEIMAGE_API_KEY=your-api-key. This enables the agent to authenticate securely with the TranslateImage backend.

Use Cases

  • Manga/Comic Localization: Read foreign-language comics by translating dialogue bubbles in-place without ruining the art.
  • Signage Translation: Upload photos of street signs or menus while traveling to instantly see English text layered over the original image.
  • Data Extraction: Convert screenshots of tables or documents into structured text data using the built-in OCR capabilities.
  • Content Cleaning: Remove distracting watermark text or captions from images using the inpainting tool.

Example Prompts

  1. "Translate the text in this image to Japanese, keeping the original font style."
  2. "Can you extract the text from this screenshot and save it to my clipboard?"
  3. "Remove the Chinese watermark from this image and show me the clean version."

Tips & Limitations

  • Image Constraints: The API supports JPEG, PNG, WebP, and GIF formats, with a maximum file size of 10MB.
  • Security: The agent is configured to only process URLs provided by the user. Always verify the source before requesting the agent to fetch an external URL.
  • Model Selection: For general use, the default gemini-2.5-flash model is recommended. However, for specialized academic or creative translations, consider switching to gpt-5.1 or kimi-k2 via the configuration field.
  • Processing Time: High-resolution images may take a few seconds to process due to the inpainting and translation overhead. Ensure your internet connection is stable for large image uploads.

Metadata

Author@cottom
Stars3409
Views3
Updated2026-03-25
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-cottom-translate-image": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#translation#ocr#manga#computer-vision#inpainting
Safety Score: 4/5

Flags: network-access, file-write, file-read, external-api