tg-media-resolve
Resolve Telegram <media:image>, <media:document>, <media:video> and other media placeholders into actual files for vision/analysis. Use when a Telegram message contains a media placeholder (e.g. <media:image>) that you cannot see — typically in quoted/replied-to messages or group chat history. Downloads the media via Telegram Bot API and returns a local file path for the image tool or further processing.
Why use this skill?
Easily download and analyze Telegram images, videos, and documents. Automate your OpenClaw workflow by converting media placeholders into usable files for AI vision and analysis.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/kurinzo/tg-media-resolveWhat This Skill Does
The tg-media-resolve skill acts as a bridge between the Telegram Bot API and your local environment, enabling OpenClaw to process media files embedded in chat history. Often, Telegram API messages only provide text placeholders like <media:image> or <media:video> for attachments. This skill automates the extraction, temporary forwarding, and local download of these files, converting them into standard file paths that OpenClaw can then feed into vision models or other analysis tools. It handles authentication via your existing bot tokens and manages cleanup of temporary files to keep your workspace tidy.
Installation
To integrate this skill, run the following command in your terminal:
clawhub install openclaw/skills/skills/kurinzo/tg-media-resolve
Ensure that your OpenClaw configuration file is populated with a valid telegram.botToken under the channels section, as the skill relies on this token to authorize API requests for file retrieval.
Use Cases
- Visual Context Analysis: When a user shares a screenshot or document in a group chat history, use this to download and analyze the file content using vision models.
- Content Moderation/Management: Resolve and save documents or images from Telegram channels for external storage or automated archival processes.
- Voice-to-Text Pipelines: Extract voice messages and video notes from Telegram to feed into speech-to-text models for transcription.
- Automated Summarization: Retrieve visual evidence or shared documents referenced in long-running group threads to provide a comprehensive summary of a discussion.
Example Prompts
- "I see a media:image placeholder in the last message from User_A. Can you download it and tell me what is in the image?"
- "Download the document attached to message [id:12345] in the current group and summarize its contents for me."
- "That video note in the reply-to message looks important. Please resolve the media and let me know the transcript of what was said."
Tips & Limitations
- Forwarding Privacy: By default, the skill may perform a temporary forward. Use the
--forward-toflag pointing to your own DM chat ID to prevent the forward from becoming visible in public group chats. - File Limits: Telegram Bot API has a hard cap of 20MB for files downloaded this way. Larger files will fail to process.
- Bot Permissions: Your bot must be a member of the target chat to fetch the message media; ensure the bot has read access to the relevant channels.
- Cleanup: The script includes a cleanup phase, but ensure your
/tmpdirectory has adequate storage capacity for batch processing high-resolution media.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-kurinzo-tg-media-resolve": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, external-api