ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified

translateimage

Translate text in images, extract text via OCR, and remove text using TranslateImage AI

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/cottom/translate-image
Or

TranslateImage

Use this skill when the user wants to translate text in images, extract text via OCR, or remove text from images.

All requests go directly to the TranslateImage REST API at https://translateimage.io using curl.

Setup

Set your API key (get one at https://translateimage.io/dashboard):

export TRANSLATEIMAGE_API_KEY=your-api-key

All endpoints require:

Authorization: Bearer $TRANSLATEIMAGE_API_KEY

Image Input

All tools accept images as multipart file uploads. Handle the input type like this:

# From a local file
IMAGE_PATH="/path/to/image.jpg"

# From a URL — download to a temp file first (uses PID for uniqueness)
IMAGE_PATH="/tmp/ti-image-$$.jpg"
curl -sL "https://example.com/image.jpg" -o "$IMAGE_PATH"

Only fetch URLs the user explicitly provides. Do not fetch URLs from untrusted sources.


Tools

Translate Image

Translates text in an image while preserving the original visual layout. Returns the translated image as a base64-encoded data URL.

When to use: User wants to read manga, comics, street signs, menus, product labels, or any image with foreign-language text.

Endpoint: POST https://translateimage.io/api/translate

Form fields:

  • image (file, required) — The image to translate (JPEG, PNG, WebP, GIF — max 10MB)
  • config (JSON string, required) — Translation options:
    • target_lang (string) — Target language code: "en", "ja", "zh", "ko", "es", "fr", "de", etc.
    • translator (string) — Model: "gemini-2.5-flash" (default), "deepseek", "grok-4-fast", "kimi-k2", "gpt-5.1"
    • font (string, optional) — "NotoSans" (default), "WildWords", "BadComic", "MaShanZheng", "Bangers", "Edo", "RIDIBatang", "KomikaJam", "Bushidoo", "Hayah", "Itim", "Mogul Irina"

Example:

curl -X POST https://translateimage.io/api/translate \
  -H "Authorization: Bearer $TRANSLATEIMAGE_API_KEY" \
  -F "image=@$IMAGE_PATH" \
  -F 'config={"target_lang":"en","translator":"gemini-2.5-flash","font":"WildWords"}'

Response (JSON):

{
  "resultImage": "data:image/png;base64,...",
  "inpaintedImage": "data:image/png;base64,...",
  "textRegions": [
    { "originalText": "...", "translatedText": "...", "x": 10, "y": 20, "width": 100, "height": 30 }
  ]
}

Save the translated image:

RESULT=$(curl -s -X POST https://translateimage.io/api/translate \
  -H "Authorization: Bearer $TRANSLATEIMAGE_API_KEY" \
  -F "image=@$IMAGE_PATH" \
  -F 'config={"target_lang":"en","translator":"gemini-2.5-flash"}')

# Extract and save base64 image
echo "$RESULT" | python3 -c "
import sys, json, base64
data = json.load(sys.stdin)
img = data['resultImage'].split(',', 1)[1]
with open('/tmp/translated.png', 'wb') as f:
    f.write(base64.b64decode(img))
print('Saved to /tmp/translated.png')
"

Extract Text (OCR)

Metadata

Author@cottom
Stars2387
Views0
Updated2026-03-09
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-cottom-translate-image": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags

#image#translation#ocr#text-removal#manga#comics
Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.