voice-ai-tts
High-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API.
Why use this skill?
Integrate Voice.ai into OpenClaw for professional text-to-speech. Features 9 personas, 11 languages, and real-time streaming audio capabilities.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/gizmogremlin/openclaw-skill-voice-ai-voicesWhat This Skill Does
The voice-ai-tts skill for OpenClaw provides a robust, high-quality text-to-speech synthesis engine powered by the Voice.ai API. It enables users to convert text into lifelike speech directly within their terminal or OpenClaw environment. By leveraging a library of 9 distinct, carefully curated personas and support for 11 different languages, the skill is designed for versatility. It supports both standard file-based synthesis and real-time streaming, allowing for lower latency output when generating longer passages of text. Because it is pre-integrated into the OpenClaw framework, users can interact with the synthesis engine using simple chat-based commands, removing the need for complex API handling or external audio manipulation tools.
Installation
Installation is streamlined and does not require external NPM dependencies or heavy configuration. Because the skill is bundled with its own Node.js SDK and CLI tools, it is ready to use immediately upon installation via the ClawHub repository. Users only need to set a single environment variable, VOICE_AI_API_KEY, which is obtained from the official Voice.ai dashboard. Once the key is configured, the skill automatically registers its commands with OpenClaw, making the /tts and /voices commands available for immediate use.
Use Cases
This skill is ideal for developers, content creators, and accessibility-focused users. You can use it to generate voice-overs for video projects, provide auditory feedback for automated scripts, or create conversational AI agents that need a human-like voice. It is particularly effective for real-time applications where streaming output allows for immediate auditory feedback, such as reading back search results or providing summaries during a long-running task.
Example Prompts
- "/tts --voice ellie Good evening, the system update is complete and all services are running normally."
- "/tts --stream This is an experiment to see how quickly the audio can start playing while the remainder of the long text is being processed by the backend."
- "/voices"
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-gizmogremlin-openclaw-skill-voice-ai-voices": {
"enabled": true,
"auto_update": true
}
}
}Tags
Flags: network-access, file-read, file-write, external-api
Related Skills
narrator-ai-cli
Create AI-narrated film/drama commentary videos via CLI. Two workflow paths (Original & Adapted narration), 100+ movies, 146 BGM tracks, 63 dubbing voices in 11 languages, 90+ narration templates. Use when creating narration videos, film commentary, short drama dubbing, or video production.
narrator-ai-cli
AI电影解说视频自动生成技能(AI解说大师 CLI Skill)。当用户需要创建电影解说视频、短剧解说、影视二创、AI配音旁白视频、film commentary、video narration、drama dubbing、movie narration时触发。内置93部电影素材、146首BGM、63种配音音色(11种语言)、90+解说模板。通过narrator-ai-cli命令行工具实现:搜片选片→选择模板→选BGM→选配音→生成文案→合成视频的全流程自动化。CLI client for Narrator AI (AI解说大师) video narration API. Use when user needs to create AI narration videos, manage narration tasks, browse dubbing/BGM/material resources, or automate video production.
podcast-agent
Search articles on any topic, generate a two-host dialogue script, and synthesize podcast audio via TTS. Turn long reads into listenable content.
ym-mediatoolkit
流式视频处理工具集 - 压缩、封面提取、音频转换,无需下载完整视频
video-producer
短视频一键生成技能 v2.2。调用video-director进行画面规划,然后生成AI素材、TTS配音、视频渲染,输出完整MP4。