senseaudio-voice
SenseAudio Voice - 语音合成 (TTS) + 语音识别 (ASR),支持语言自动切换
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/cowboy231/senseaudio-voice-cnWhat This Skill Does
SenseAudio Voice is a high-performance, dual-engine voice interaction skill designed for OpenClaw. It integrates both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) to provide a seamless natural language processing experience. By intelligently routing requests between SenseAudio (for high-quality Chinese voice generation) and Microsoft Edge TTS (for robust international language support in English and Japanese), this skill ensures optimal audio quality regardless of the user's target language. It features automated language detection, a flexible voice selection system, and a robust file management structure that organizes output audio by date.
Installation
To install this skill, run the following command in your terminal:
clawhub install openclaw/skills/skills/cowboy231/senseaudio-voice-cn
Ensure you have the requests Python library installed. For users targeting Chinese audio, obtain an API key from the SenseAudio platform and save it to your ~/.openclaw/openclaw.json under env.SENSE_API_KEY.
Use Cases
- Educational Tools: Perfect for children's learning scenarios, utilizing the
child_0001_avoice to provide interactive feedback or homework reminders. - Multilingual Assistant: Acts as a bridge for AI agents that need to communicate effectively in Chinese, English, or Japanese without switching backend configurations manually.
- Accessibility Services: Provides a voice output layer for automation scripts, making AI agents more interactive and approachable for non-technical users.
- Content Creation: Quickly batch-generate voice-over files for multimedia projects, saved with intuitive timestamped naming conventions.
Example Prompts
- "OpenClaw, please read this message aloud using the standard Chinese child voice: 'It is time to start your math practice now.'"
- "Convert the following text to an English audio file: 'Welcome to the system, how can I assist you today?' and play it back for me."
- "Summarize the latest weather report and narrate it using a professional male voice, then save the output to my workspace."
Tips & Limitations
- Engine Selection: While the
automode is excellent for most, you can force specific engines using--engine senseaudioor--engine edgeif you prefer a specific sound profile. - Storage: Audio files are stored in
{workspace}/audio/YYYY-MM-DD/. Regularly clear this directory if you generate large volumes of audio to maintain workspace cleanliness. - Dependencies: This skill is optimized for simplicity; it relies primarily on the
requestslibrary and avoids heavy local decoding dependencies, making it fast and portable. Note that the Chinese engine requires a verified mainland mobile number, whereas the Edge TTS engine is globally accessible without authentication.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-cowboy231-senseaudio-voice-cn": {
"enabled": true,
"auto_update": true
}
}
}Tags
Flags: network-access, file-write, file-read, external-api
Related Skills
narrator-ai-cli
Create AI-narrated film/drama commentary videos via CLI. Two workflow paths (Original & Adapted narration), 100+ movies, 146 BGM tracks, 63 dubbing voices in 11 languages, 90+ narration templates. Use when creating narration videos, film commentary, short drama dubbing, or video production.
narrator-ai-cli
AI电影解说视频自动生成技能(AI解说大师 CLI Skill)。当用户需要创建电影解说视频、短剧解说、影视二创、AI配音旁白视频、film commentary、video narration、drama dubbing、movie narration时触发。内置93部电影素材、146首BGM、63种配音音色(11种语言)、90+解说模板。通过narrator-ai-cli命令行工具实现:搜片选片→选择模板→选BGM→选配音→生成文案→合成视频的全流程自动化。CLI client for Narrator AI (AI解说大师) video narration API. Use when user needs to create AI narration videos, manage narration tasks, browse dubbing/BGM/material resources, or automate video production.
podcast-agent
Search articles on any topic, generate a two-host dialogue script, and synthesize podcast audio via TTS. Turn long reads into listenable content.
ym-mediatoolkit
流式视频处理工具集 - 压缩、封面提取、音频转换,无需下载完整视频
content-rewriter
Cross-platform content repurposer. Takes one piece of content and rewrites it for multiple Chinese social media platforms, adapting tone, format, length, and style.