Deepdub TTS
Generate speech audio using Deepdub and attach it as a MEDIA file (Telegram-compatible).
Why use this skill?
Easily convert text to natural-sounding speech in OpenClaw with the Deepdub TTS skill. Perfect for Telegram audio integration and accessibility.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/yuval-deepdub/deepdub-ttsWhat This Skill Does
The Deepdub TTS skill empowers OpenClaw to transform plain text into high-quality, natural-sounding speech. By leveraging the advanced Deepdub API, this skill generates audio files that are automatically formatted as MEDIA attachments. This ensures seamless integration with messaging platforms like Telegram, allowing your agent to communicate in a more human-like, audible format. Whether you are generating responses for a conversational agent or producing dynamic content, this skill bridges the gap between text-based reasoning and voice communication.
Installation
To integrate this skill into your environment, ensure you have Python 3.9 or higher installed. Use the recommended package manager 'uv' to handle dependencies: uv pip install deepdub. Once dependencies are met, install the skill via the OpenClaw hub using the command: clawhub install openclaw/skills/skills/yuval-deepdub/deepdub-tts.
Before launching, you must configure the following environment variables: DEEPDUB_API_KEY and DEEPDUB_VOICE_PROMPT_ID. Optional configurations include setting DEEPDUB_LOCALE (defaulting to en-US) and OPENCLAW_MEDIA_DIR if you prefer a custom storage location for your generated audio files.
Use Cases
- Automated Customer Support: Send personalized audio greetings or troubleshooting steps to users via Telegram.
- Content Creation: Automatically convert research summaries or news articles into podcasts or audio clips.
- Accessibility: Ensure your agent-based services remain inclusive by providing audio versions of text-based information for visually impaired users.
- Interactive Storytelling: Build immersive roleplay scenarios where the agent speaks responses rather than simply typing them.
Example Prompts
- "Deepdub, please convert this message to audio: 'The system update is complete and all services are back online.'"
- "Read the following text using the default voice prompt: [Paste long text here]."
- "Send a voice note to the Telegram channel saying: 'Don't forget to review the project roadmap before our 3 PM meeting.'"
Tips & Limitations
- Voice Quality: Performance heavily depends on the
DEEPDUB_VOICE_PROMPT_ID. Experiment with different prompts to find the tone that best fits your agent's personality. - Cost: Be aware that frequent API calls to Deepdub may incur costs based on your subscription tier.
- File Size: Audio files can become large; ensure your
OPENCLAW_MEDIA_DIRhas sufficient storage and implement cleanup scripts if your agent generates content at high volume. - Performance: Generation is dependent on network latency to the Deepdub API. For real-time applications, consider pre-generating static responses.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-yuval-deepdub-deepdub-tts": {
"enabled": true,
"auto_update": true
}
}
}Tags
Flags: external-api, file-write, network-access
Related Skills
narrator-ai-cli
Create AI-narrated film/drama commentary videos via CLI. Two workflow paths (Original & Adapted narration), 100+ movies, 146 BGM tracks, 63 dubbing voices in 11 languages, 90+ narration templates. Use when creating narration videos, film commentary, short drama dubbing, or video production.
Lead Radar
Every morning, scans Reddit, Hacker News, Indie Hackers, Stack Overflow, Quora, Hashnode, Dev.to, GitHub, and Lobsters for people actively asking for what you sell. Delivers the top 10 buying-intent leads to your Telegram with a pre-drafted reply. Powered by Gemini 2.5 Flash.
podcast-agent
Search articles on any topic, generate a two-host dialogue script, and synthesize podcast audio via TTS. Turn long reads into listenable content.
agent3-hub
Universal AI resource registry — search and invoke agents, MCP servers, and APIs through a single MCP endpoint. Includes Telegram content search, Google search, X/Twitter search, and more.
ym-mediatoolkit
流式视频处理工具集 - 压缩、封面提取、音频转换,无需下载完整视频