doubao-open-tts
Text-to-Speech service using Doubao (Volcano Engine) API with 200+ voices, interactive voice selection, and multilingual support
Why use this skill?
Integrate 200+ natural-sounding voices into your AI agents with the Doubao Open TTS skill. Support for multiple formats, speed control, and secure API handling.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/xdrshjr/doubao-api-open-ttsWhat This Skill Does
The doubao-open-tts skill is a sophisticated Text-to-Speech (TTS) integration designed for the OpenClaw AI agent ecosystem. Leveraging the robust Volcano Engine (Doubao) API, this skill allows users to convert plain text into high-quality, natural-sounding audio. It provides access to over 200 distinct voice profiles, catering to various personas, emotional tones, and language requirements. The skill is built with a developer-first approach, offering both a command-line interface and a clean Python API, enabling seamless integration into custom agent workflows. Key features include adjustable speech speed, custom volume controls, and support for multiple audio formats (mp3, pcm, wav).
Installation
To integrate this skill into your environment, use the OpenClaw CLI tool. Execute the following command in your terminal:
clawhub install openclaw/skills/skills/xdrshjr/doubao-api-open-tts
Ensure that you have an active internet connection and that your OpenClaw environment is properly initialized. After installation, the skill will be available for use within your agent scripts, following the standard module import structure.
Use Cases
- Content Creation: Convert written blog posts or articles into audiobooks or podcast episodes using professional-grade voices.
- Accessible Interfaces: Add speech capabilities to your agent to provide voice-based responses for users with visual impairments or those who prefer audio interaction.
- Multilingual Support: Generate speech in different languages for global applications, utilizing the vast voice library provided by the Volcano Engine.
- Automated Alerts: Use the TTS service to generate dynamic, personalized audio notifications for system status updates.
Example Prompts
- "OpenClaw, please convert the following text into an audio file using a professional-sounding female voice: 'Welcome to the system update summary.' Save it as report.mp3."
- "I need to generate a voice message for my project. Can you list the available voice categories provided by the Doubao TTS skill?"
- "Convert my latest document to speech using the Shiny voice profile. If my API keys aren't set up yet, let's go through the configuration process together."
Tips & Limitations
- API Security: This skill enforces strict security protocols. If API credentials are missing, the agent is hardcoded to halt execution. Do not attempt to bypass this by hardcoding keys directly into your scripts.
- Voice Selection: With 200+ voices, it is recommended to test several samples to find the one that best matches your target audience's demographic.
- Cost Considerations: Be aware that excessive API usage may incur costs from the Volcano Engine; monitor your usage within their console.
- Credential Management: Always store your App ID, Access Token, and Secret Key in the provided .env file, and never commit this file to public repositories.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-xdrshjr-doubao-api-open-tts": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, external-api