senseaudio-voice
SenseAudio Voice - 语音合成 (TTS) + 语音识别 (ASR),支持语言自动切换
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/cowboy231/kid-point-voice-componentWhat This Skill Does
The senseaudio-voice skill provides a robust, multi-modal voice processing engine for the OpenClaw AI agent. It seamlessly integrates both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) capabilities into a single, cohesive workflow. By intelligently switching between the high-quality SenseAudio API (best for Chinese synthesis) and Microsoft's Edge TTS (ideal for English and Japanese), this skill ensures a natural-sounding experience regardless of the target language.
Key features include:
- Automated Language Intelligence: The skill automatically detects the input language and chooses the optimal engine, reducing the need for manual configuration.
- WAV/MP3 Export: Provides flexible file output, with audio files automatically organized by date in your workspace directory.
- Flexible Synthesis: Support for multiple voices, adjustable speed, and variable volume settings, allowing for personalized AI persona behavior.
- Robust Playback: Built-in support for automatic playback after generation, ensuring immediate feedback for user queries.
Installation
To install this skill, run the following command in your terminal:
clawhub install openclaw/skills/skills/cowboy231/kid-point-voice-component
Once installed, ensure your ~/.openclaw/openclaw.json configuration file is updated with your SENSE_API_KEY if you intend to use the premium SenseAudio features for Chinese language synthesis.
Use Cases
This skill is designed for interactive scenarios that require human-like response. Common use cases include:
- Educational Tools: Using the default child-voice to create engaging study assistants for children.
- Internationalized Agents: Building agents that communicate fluently across Chinese, English, and Japanese markets without needing separate deployments.
- Accessibility: Converting textual AI outputs into spoken audio for users who prefer auditory information.
- Workflow Automation: Creating automated reminders or status notifications that can be played on a local speaker via system commands.
Example Prompts
- "Speak the following in a friendly voice: 'Good morning, please review your dashboard for today's tasks.'"
- "Convert this text to Japanese using the female voice: 'おはようございます、今日の予定を確認しましょう。'"
- "Summarize the last report and play the audio using the male_0004_a voice profile."
Tips & Limitations
- Chinese Language: For optimal results, ensure your SenseAudio credentials are valid. This provides access to natural-sounding Chinese voices that are superior to the fallback options.
- Edge TTS: For English and Japanese, no API key is required as it connects directly to Microsoft's reliable edge service.
- Storage Management: Files are saved in
{workspace}/audio/YYYY-MM-DD/. Periodically clear this folder to save disk space if the agent is used for high-volume tasks. - Dependency: The skill relies on the
requestslibrary. Ensure your environment has basic internet access to reach the external TTS endpoints.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-cowboy231-kid-point-voice-component": {
"enabled": true,
"auto_update": true
}
}
}Tags
Flags: network-access, file-write, file-read, external-api
Related Skills
narrator-ai-cli
Create AI-narrated film/drama commentary videos via CLI. Two workflow paths (Original & Adapted narration), 100+ movies, 146 BGM tracks, 63 dubbing voices in 11 languages, 90+ narration templates. Use when creating narration videos, film commentary, short drama dubbing, or video production.
narrator-ai-cli
AI电影解说视频自动生成技能(AI解说大师 CLI Skill)。当用户需要创建电影解说视频、短剧解说、影视二创、AI配音旁白视频、film commentary、video narration、drama dubbing、movie narration时触发。内置93部电影素材、146首BGM、63种配音音色(11种语言)、90+解说模板。通过narrator-ai-cli命令行工具实现:搜片选片→选择模板→选BGM→选配音→生成文案→合成视频的全流程自动化。CLI client for Narrator AI (AI解说大师) video narration API. Use when user needs to create AI narration videos, manage narration tasks, browse dubbing/BGM/material resources, or automate video production.
podcast-agent
Search articles on any topic, generate a two-host dialogue script, and synthesize podcast audio via TTS. Turn long reads into listenable content.
ym-mediatoolkit
流式视频处理工具集 - 压缩、封面提取、音频转换,无需下载完整视频
content-rewriter
Cross-platform content repurposer. Takes one piece of content and rewrites it for multiple Chinese social media platforms, adapting tone, format, length, and style.