ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified communication Safety 4/5

senseaudio-voice

SenseAudio Voice - 语音合成 (TTS) + 语音识别 (ASR),支持语言自动切换

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/cowboy231/kid-point-voice-component
Or

What This Skill Does

The senseaudio-voice skill provides a robust, multi-modal voice processing engine for the OpenClaw AI agent. It seamlessly integrates both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) capabilities into a single, cohesive workflow. By intelligently switching between the high-quality SenseAudio API (best for Chinese synthesis) and Microsoft's Edge TTS (ideal for English and Japanese), this skill ensures a natural-sounding experience regardless of the target language.

Key features include:

  • Automated Language Intelligence: The skill automatically detects the input language and chooses the optimal engine, reducing the need for manual configuration.
  • WAV/MP3 Export: Provides flexible file output, with audio files automatically organized by date in your workspace directory.
  • Flexible Synthesis: Support for multiple voices, adjustable speed, and variable volume settings, allowing for personalized AI persona behavior.
  • Robust Playback: Built-in support for automatic playback after generation, ensuring immediate feedback for user queries.

Installation

To install this skill, run the following command in your terminal:

clawhub install openclaw/skills/skills/cowboy231/kid-point-voice-component

Once installed, ensure your ~/.openclaw/openclaw.json configuration file is updated with your SENSE_API_KEY if you intend to use the premium SenseAudio features for Chinese language synthesis.

Use Cases

This skill is designed for interactive scenarios that require human-like response. Common use cases include:

  • Educational Tools: Using the default child-voice to create engaging study assistants for children.
  • Internationalized Agents: Building agents that communicate fluently across Chinese, English, and Japanese markets without needing separate deployments.
  • Accessibility: Converting textual AI outputs into spoken audio for users who prefer auditory information.
  • Workflow Automation: Creating automated reminders or status notifications that can be played on a local speaker via system commands.

Example Prompts

  1. "Speak the following in a friendly voice: 'Good morning, please review your dashboard for today's tasks.'"
  2. "Convert this text to Japanese using the female voice: 'おはようございます、今日の予定を確認しましょう。'"
  3. "Summarize the last report and play the audio using the male_0004_a voice profile."

Tips & Limitations

  • Chinese Language: For optimal results, ensure your SenseAudio credentials are valid. This provides access to natural-sounding Chinese voices that are superior to the fallback options.
  • Edge TTS: For English and Japanese, no API key is required as it connects directly to Microsoft's reliable edge service.
  • Storage Management: Files are saved in {workspace}/audio/YYYY-MM-DD/. Periodically clear this folder to save disk space if the agent is used for high-volume tasks.
  • Dependency: The skill relies on the requests library. Ensure your environment has basic internet access to reach the external TTS endpoints.

Metadata

Author@cowboy231
Stars3409
Views1
Updated2026-03-25
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-cowboy231-kid-point-voice-component": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags

#tts#asr#voice#senseaudio#edge#speech#audio#chinese#english#japanese#http
Safety Score: 4/5

Flags: network-access, file-write, file-read, external-api

Related Skills

narrator-ai-cli

Create AI-narrated film/drama commentary videos via CLI. Two workflow paths (Original & Adapted narration), 100+ movies, 146 BGM tracks, 63 dubbing voices in 11 languages, 90+ narration templates. Use when creating narration videos, film commentary, short drama dubbing, or video production.

4myhime 4473

narrator-ai-cli

AI电影解说视频自动生成技能(AI解说大师 CLI Skill)。当用户需要创建电影解说视频、短剧解说、影视二创、AI配音旁白视频、film commentary、video narration、drama dubbing、movie narration时触发。内置93部电影素材、146首BGM、63种配音音色(11种语言)、90+解说模板。通过narrator-ai-cli命令行工具实现:搜片选片→选择模板→选BGM→选配音→生成文案→合成视频的全流程自动化。CLI client for Narrator AI (AI解说大师) video narration API. Use when user needs to create AI narration videos, manage narration tasks, browse dubbing/BGM/material resources, or automate video production.

4myhime 4473

podcast-agent

Search articles on any topic, generate a two-host dialogue script, and synthesize podcast audio via TTS. Turn long reads into listenable content.

besty0121 4473

ym-mediatoolkit

流式视频处理工具集 - 压缩、封面提取、音频转换,无需下载完整视频

370299455cx-web 4473

content-rewriter

Cross-platform content repurposer. Takes one piece of content and rewrites it for multiple Chinese social media platforms, adapting tone, format, length, and style.

alexandrali1989-cell 4473