ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

doubao-open-tts

Text-to-Speech service using Doubao (Volcano Engine) API with 200+ voices, interactive voice selection, and multilingual support

Why use this skill?

Integrate 200+ natural-sounding voices into your AI agents with the Doubao Open TTS skill. Support for multiple formats, speed control, and secure API handling.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/xdrshjr/doubao-api-open-tts
Or

What This Skill Does

The doubao-open-tts skill is a sophisticated Text-to-Speech (TTS) integration designed for the OpenClaw AI agent ecosystem. Leveraging the robust Volcano Engine (Doubao) API, this skill allows users to convert plain text into high-quality, natural-sounding audio. It provides access to over 200 distinct voice profiles, catering to various personas, emotional tones, and language requirements. The skill is built with a developer-first approach, offering both a command-line interface and a clean Python API, enabling seamless integration into custom agent workflows. Key features include adjustable speech speed, custom volume controls, and support for multiple audio formats (mp3, pcm, wav).

Installation

To integrate this skill into your environment, use the OpenClaw CLI tool. Execute the following command in your terminal:

clawhub install openclaw/skills/skills/xdrshjr/doubao-api-open-tts

Ensure that you have an active internet connection and that your OpenClaw environment is properly initialized. After installation, the skill will be available for use within your agent scripts, following the standard module import structure.

Use Cases

  1. Content Creation: Convert written blog posts or articles into audiobooks or podcast episodes using professional-grade voices.
  2. Accessible Interfaces: Add speech capabilities to your agent to provide voice-based responses for users with visual impairments or those who prefer audio interaction.
  3. Multilingual Support: Generate speech in different languages for global applications, utilizing the vast voice library provided by the Volcano Engine.
  4. Automated Alerts: Use the TTS service to generate dynamic, personalized audio notifications for system status updates.

Example Prompts

  1. "OpenClaw, please convert the following text into an audio file using a professional-sounding female voice: 'Welcome to the system update summary.' Save it as report.mp3."
  2. "I need to generate a voice message for my project. Can you list the available voice categories provided by the Doubao TTS skill?"
  3. "Convert my latest document to speech using the Shiny voice profile. If my API keys aren't set up yet, let's go through the configuration process together."

Tips & Limitations

  • API Security: This skill enforces strict security protocols. If API credentials are missing, the agent is hardcoded to halt execution. Do not attempt to bypass this by hardcoding keys directly into your scripts.
  • Voice Selection: With 200+ voices, it is recommended to test several samples to find the one that best matches your target audience's demographic.
  • Cost Considerations: Be aware that excessive API usage may incur costs from the Volcano Engine; monitor your usage within their console.
  • Credential Management: Always store your App ID, Access Token, and Secret Key in the provided .env file, and never commit this file to public repositories.

Metadata

Author@xdrshjr
Stars919
Views0
Updated2026-02-12
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-xdrshjr-doubao-api-open-tts": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#tts#audio#voice-synthesis#volcano-engine#accessibility
Safety Score: 4/5

Flags: file-write, file-read, external-api