sapi-tts
Windows SAPI5 text-to-speech with Neural voices. Lightweight alternative to GPU-heavy TTS - zero GPU usage, instant generation. Auto-detects best available voice for your language. Works on Windows 10/11.
Why use this skill?
Add instant, zero-GPU text-to-speech to your OpenClaw agent using Windows SAPI5. Supports Neural voices and works on Windows 10 and 11.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/dexiaong/sapi-ttslWhat This Skill Does
The sapi-tts skill is a highly optimized, lightweight text-to-speech solution designed for Windows 10 and 11 environments. By leveraging the built-in SAPI5 (Speech API version 5) interface, this skill allows OpenClaw agents to generate human-like speech output without consuming any GPU resources. It is engineered for instant, real-time generation, making it an ideal choice for low-latency voice notifications, automated reading tasks, or accessible UI components. The skill intelligently auto-detects the best available voice based on the user's system language, supporting both high-quality Neural voices (on Windows 11) and stable legacy voices (on Windows 10).
Installation
To install the sapi-tts skill, follow these steps:
- Open your terminal in the designated OpenClaw skills directory.
- Execute the command:
clawhub install openclaw/skills/skills/dexiaong/sapi-tts. - Navigate to the installed directory and ensure the provided
tts.ps1PowerShell script is present. - Verify your environment has execution policy settings that allow local script execution by running
Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope CurrentUser. - You can test the installation by listing available voices with the command
./tts.ps1 -ListVoices.
Use Cases
- Real-time Voice Feedback: Providing instant audible alerts or confirmations for long-running agent tasks.
- Accessibility: Converting textual data, logs, or chat history into audio for visually impaired users.
- Agent Interaction: Adding a natural human interface to your agent without the hardware overhead of running cloud-based or local GPU-intensive TTS models.
- Scripted Automation: Integrating voice synthesis directly into local Windows workflows where network-dependent cloud TTS APIs might be too slow or unreliable.
Example Prompts
- "Speak the following text aloud using the best available neural voice: 'Task completed successfully.'"
- "List all my installed Windows voices to see if I have a high-quality French neural voice available."
- "Convert this status report into speech and play it back to me immediately."
Tips & Limitations
- Neural vs Legacy: Neural voices are only available natively on Windows 11. On Windows 10, the output will default to legacy SAPI5 voices which may sound more synthetic. Ensure your Windows speech recognition settings are updated.
- Performance: Because this uses local system calls, generation speed is nearly instantaneous regardless of text length.
- Zero GPU Usage: Perfect for systems where your GPU is already fully utilized by LLMs or image generation models.
- Security: The script runs locally via PowerShell; ensure it remains in a trusted directory.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-dexiaong-sapi-ttsl": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read, code-execution
Related Skills
omnicog
Universal service integration for OpenClaw — connect Reddit, Steam, Spotify, GitHub, Discord, and more with a single API.
chaterimo
AI Customer Service for Shopify & E-commerce - Query conversations, analyze chatbot performance, and manage your Chaterimo AI assistant
lulu-monitor
AI-powered LuLu Firewall companion for macOS. Monitors firewall alerts, analyzes connections with AI, sends Telegram notifications with Allow/Block buttons. Use when setting up LuLu integration, handling firewall callbacks, or troubleshooting LuLu Monitor issues.
gmail-client-PM
Read and send emails via Gmail. Use to list unread messages, read specific emails by ID, or send new emails.
arb-injection
BYOCB ArbInjectionSkill: Scan EVM smart contracts for arbitrary call injection vulnerabilities. Monitor chains in real-time or scan specific addresses.