siliconflow-tts-gen
Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/lilei0311/siliconflow-tts-genSiliconFlow TTS Generation Skill
Text-to-Speech using SiliconFlow API with CosyVoice2 model. Supports 8 preset voices, multiple languages, and Chinese dialects.
Features
- 🎙️ 8 Preset Voices: 4 male + 4 female voices
- 🌍 Multilingual: Chinese, English, Japanese, Korean
- 🗣️ Chinese Dialects: Cantonese, Sichuan, Shanghai, Tianjin, Wuhan
- ⚡ Ultra Low Latency: 150ms first packet delay
- 🎵 Voice Cloning: 3-second rapid voice cloning
- 💾 Auto Download: Saves audio files locally
Requirements
- Environment Variable:
SILICONFLOW_API_KEY - Optional Config File:
~/.openclaw/openclaw.json(for auto-detect)
Installation
npx clawhub install siliconflow-tts-gen
Configuration
Set your SiliconFlow API key:
export SILICONFLOW_API_KEY="your-api-key"
Usage
List Available Voices
python3 scripts/generate.py --list-voices
Generate Speech
# Basic usage (default voice: alex)
python3 scripts/generate.py "你好,世界"
# Specify voice
python3 scripts/generate.py "Hello World" --voice bella
# Adjust speed
python3 scripts/generate.py "你好" --voice claire --speed 0.9
# Save to file
python3 scripts/generate.py "欢迎收听" --output welcome.mp3
# Change format
python3 scripts/generate.py "Hello" --format wav
Available Voices
Male Voices
| ID | Name | Characteristic |
|---|---|---|
| alex | 沉稳男声 | Mature and steady |
| benjamin | 低沉男声 | Deep and low |
| charles | 磁性男声 | Magnetic |
| david | 欢快男声 | Cheerful |
Female Voices
| ID | Name | Characteristic |
|---|---|---|
| anna | 沉稳女声 | Mature and elegant |
| bella | 激情女声 | Passionate |
| claire | 温柔女声 | Gentle and kind |
| diana | 欢快女声 | Sweet and happy |
Parameters
| Parameter | Type | Default | Range | Description |
|---|---|---|---|---|
--voice | string | alex | - | Voice ID |
--speed | float | 1.0 | 0.25-4.0 | Speech speed |
--format | string | mp3 | mp3/opus/wav/pcm | Output format |
--output | string | output.mp3 | - | Output file path |
Security Notes
- This skill requires an API key to call SiliconFlow services
- The script reads
~/.openclaw/openclaw.jsononly to auto-detect API keys - No sensitive data is transmitted except to
api.siliconflow.cn - Review the code at
scripts/generate.pybefore providing credentials
Author
MaxStorm Team
License
MIT
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-lilei0311-siliconflow-tts-gen": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
wps-office
WPS Office automation skill supporting document creation, Markdown conversion, and image-text layout.
siliconflow-image-gen
Generate images using SiliconFlow API (FLUX.1, Stable Diffusion, etc.)
agent-evolver
AI Agent self-evolution engine that enables agents to learn from experience, detect problems, extract insights, and optimize strategies autonomously. Invoke when users need to improve agent performance, analyze execution errors, or implement continuous learning capabilities.
Wps Office
Skill by lilei0311
Imessage
Skill by lilei0311