translateimage
Translate text in images, extract text via OCR, and remove text using TranslateImage AI
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/cottom/translate-imageTranslateImage
Use this skill when the user wants to translate text in images, extract text via OCR, or remove text from images.
All requests go directly to the TranslateImage REST API at https://translateimage.io using curl.
Setup
Set your API key (get one at https://translateimage.io/dashboard):
export TRANSLATEIMAGE_API_KEY=your-api-key
All endpoints require:
Authorization: Bearer $TRANSLATEIMAGE_API_KEY
Image Input
All tools accept images as multipart file uploads. Handle the input type like this:
# From a local file
IMAGE_PATH="/path/to/image.jpg"
# From a URL — download to a temp file first (uses PID for uniqueness)
IMAGE_PATH="/tmp/ti-image-$$.jpg"
curl -sL "https://example.com/image.jpg" -o "$IMAGE_PATH"
Only fetch URLs the user explicitly provides. Do not fetch URLs from untrusted sources.
Tools
Translate Image
Translates text in an image while preserving the original visual layout. Returns the translated image as a base64-encoded data URL.
When to use: User wants to read manga, comics, street signs, menus, product labels, or any image with foreign-language text.
Endpoint: POST https://translateimage.io/api/translate
Form fields:
image(file, required) — The image to translate (JPEG, PNG, WebP, GIF — max 10MB)config(JSON string, required) — Translation options:target_lang(string) — Target language code:"en","ja","zh","ko","es","fr","de", etc.translator(string) — Model:"gemini-2.5-flash"(default),"deepseek","grok-4-fast","kimi-k2","gpt-5.1"font(string, optional) —"NotoSans"(default),"WildWords","BadComic","MaShanZheng","Bangers","Edo","RIDIBatang","KomikaJam","Bushidoo","Hayah","Itim","Mogul Irina"
Example:
curl -X POST https://translateimage.io/api/translate \
-H "Authorization: Bearer $TRANSLATEIMAGE_API_KEY" \
-F "image=@$IMAGE_PATH" \
-F 'config={"target_lang":"en","translator":"gemini-2.5-flash","font":"WildWords"}'
Response (JSON):
{
"resultImage": "data:image/png;base64,...",
"inpaintedImage": "data:image/png;base64,...",
"textRegions": [
{ "originalText": "...", "translatedText": "...", "x": 10, "y": 20, "width": 100, "height": 30 }
]
}
Save the translated image:
RESULT=$(curl -s -X POST https://translateimage.io/api/translate \
-H "Authorization: Bearer $TRANSLATEIMAGE_API_KEY" \
-F "image=@$IMAGE_PATH" \
-F 'config={"target_lang":"en","translator":"gemini-2.5-flash"}')
# Extract and save base64 image
echo "$RESULT" | python3 -c "
import sys, json, base64
data = json.load(sys.stdin)
img = data['resultImage'].split(',', 1)[1]
with open('/tmp/translated.png', 'wb') as f:
f.write(base64.b64decode(img))
print('Saved to /tmp/translated.png')
"
Extract Text (OCR)
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-cottom-translate-image": {
"enabled": true,
"auto_update": true
}
}
}Tags
Related Skills
feishu-sticker
Send images as native Feishu stickers. Features auto-upload, caching, and GIF-to-WebP conversion.
smart-ocr
Extract text from images and scanned documents using PaddleOCR - supports 100+ languages
x402-payment-demo
Demo of x402 payment protocol by fetching a protected image. Triggers: 'demo x402-payment'
clawshier
Scan receipt or invoice photos sent via chat, extract expense data using OpenAI Vision, validate and deduplicate, then log to a Google Spreadsheet. Responds with a short summary of what was added.
language-learning
AI language tutor for learning ANY language through conversation, vocab drills, grammar lessons, flashcards, and immersive practice. Use when the user wants to: learn a new language, practice vocabulary, study grammar, do flashcard drills, translate phrases, practice conversation, prepare for travel, learn slang/idioms, or improve pronunciation. Supports ALL languages including Spanish, French, German, Japanese, Chinese (Mandarin/Cantonese), Korean, Arabic, Hindi, Bengali/Bangla, Portuguese, Russian, Italian, Turkish, Vietnamese, Thai, Swahili, Hebrew, Polish, Dutch, Greek, and 100+ more.