Official Verified communication Safety 4/5

senseaudio-voice

SenseAudio Voice - 语音合成 (TTS) + 语音识别 (ASR)，支持语言自动切换

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/cowboy231/kid-point-voice-component

Download Source Code (.zip)

What This Skill Does

The senseaudio-voice skill provides a robust, multi-modal voice processing engine for the OpenClaw AI agent. It seamlessly integrates both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) capabilities into a single, cohesive workflow. By intelligently switching between the high-quality SenseAudio API (best for Chinese synthesis) and Microsoft's Edge TTS (ideal for English and Japanese), this skill ensures a natural-sounding experience regardless of the target language.

Key features include:

Automated Language Intelligence: The skill automatically detects the input language and chooses the optimal engine, reducing the need for manual configuration.
WAV/MP3 Export: Provides flexible file output, with audio files automatically organized by date in your workspace directory.
Flexible Synthesis: Support for multiple voices, adjustable speed, and variable volume settings, allowing for personalized AI persona behavior.
Robust Playback: Built-in support for automatic playback after generation, ensuring immediate feedback for user queries.

Installation

To install this skill, run the following command in your terminal:

clawhub install openclaw/skills/skills/cowboy231/kid-point-voice-component

Once installed, ensure your ~/.openclaw/openclaw.json configuration file is updated with your SENSE_API_KEY if you intend to use the premium SenseAudio features for Chinese language synthesis.

Use Cases

This skill is designed for interactive scenarios that require human-like response. Common use cases include:

Educational Tools: Using the default child-voice to create engaging study assistants for children.
Internationalized Agents: Building agents that communicate fluently across Chinese, English, and Japanese markets without needing separate deployments.
Accessibility: Converting textual AI outputs into spoken audio for users who prefer auditory information.
Workflow Automation: Creating automated reminders or status notifications that can be played on a local speaker via system commands.

Example Prompts

"Speak the following in a friendly voice: 'Good morning, please review your dashboard for today's tasks.'"
"Convert this text to Japanese using the female voice: 'おはようございます、今日の予定を確認しましょう。'"
"Summarize the last report and play the audio using the male_0004_a voice profile."

Tips & Limitations

Chinese Language: For optimal results, ensure your SenseAudio credentials are valid. This provides access to natural-sounding Chinese voices that are superior to the fallback options.
Edge TTS: For English and Japanese, no API key is required as it connects directly to Microsoft's reliable edge service.
Storage Management: Files are saved in {workspace}/audio/YYYY-MM-DD/. Periodically clear this folder to save disk space if the agent is used for high-volume tasks.
Dependency: The skill relies on the requests library. Ensure your environment has basic internet access to reach the external TTS endpoints.

Read Full Documentation on GitHub

Metadata

Author@cowboy231

Stars3409

Updated2026-03-25

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-cowboy231-kid-point-voice-component": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Related Skills

narrator-ai-cli

Create AI-narrated film/drama commentary videos via CLI. Two workflow paths (Original & Adapted narration), 100+ movies, 146 BGM tracks, 63 dubbing voices in 11 languages, 90+ narration templates. Use when creating narration videos, film commentary, short drama dubbing, or video production.

4myhime 4473

narrator-ai-cli

AI电影解说视频自动生成技能（AI解说大师 CLI Skill）。当用户需要创建电影解说视频、短剧解说、影视二创、AI配音旁白视频、film commentary、video narration、drama dubbing、movie narration时触发。内置93部电影素材、146首BGM、63种配音音色（11种语言）、90+解说模板。通过narrator-ai-cli命令行工具实现：搜片选片→选择模板→选BGM→选配音→生成文案→合成视频的全流程自动化。CLI client for Narrator AI (AI解说大师) video narration API. Use when user needs to create AI narration videos, manage narration tasks, browse dubbing/BGM/material resources, or automate video production.

4myhime 4473

podcast-agent

Search articles on any topic, generate a two-host dialogue script, and synthesize podcast audio via TTS. Turn long reads into listenable content.

besty0121 4473

ym-mediatoolkit

流式视频处理工具集 - 压缩、封面提取、音频转换，无需下载完整视频

370299455cx-web 4473

content-rewriter

Cross-platform content repurposer. Takes one piece of content and rewrites it for multiple Chinese social media platforms, adapting tone, format, length, and style.

alexandrali1989-cell 4473

senseaudio-voice

Install via CLI (Recommended)

What This Skill Does

Installation

Use Cases

Example Prompts

Tips & Limitations

Metadata

Tags

Related Skills

narrator-ai-cli

narrator-ai-cli

podcast-agent

ym-mediatoolkit

content-rewriter