Openclaw Voice
Skill by frank-bot07
Why use this skill?
Enhance your OpenClaw CLI with voice capabilities. Use Whisper STT and ElevenLabs TTS to record, store, and listen to AI-generated transcripts locally.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/frank-bot07/openclaw-voiceWhat This Skill Does
OpenClaw Voice is a robust bridge between natural language and your local machine, integrating Whisper for accurate Speech-to-Text (STT) and ElevenLabs for high-quality Text-to-Speech (TTS). Designed as a modular component within the OpenClaw ecosystem, this skill enables users to record voice inputs, process them into actionable text, and generate expressive auditory feedback. By leveraging SQLite (in WAL mode) for local persistence, it ensures that your conversations are cataloged, timestamped, and searchable for later retrieval. It effectively turns command-line workflows into interactive, voice-augmented experiences.
Installation
To install this skill, use the clawhub CLI tool within your terminal. Ensure your environment is set up with Node.js ESM support and that the required system dependencies like sox are installed on your host machine:
clawhub install openclaw/skills/skills/frank-bot07/openclaw-voice
Use Cases
- Voice Logging: Effortlessly record meeting summaries or fleeting thoughts directly into your searchable local database.
- Auditory Feedback: Receive spoken responses from your AI agents, allowing you to stay productive away from the screen.
- Offline History Management: Maintain a searchable repository of voice-based interactions for project documentation.
- Hands-Free CLI Interaction: Use voice commands to trigger system tasks without needing to type syntax into your terminal.
Example Prompts
- "OpenClaw, start recording my meeting summary and save it to the history log."
- "OpenClaw, generate a spoken summary of the last three transcripts using the ElevenLabs TTS engine."
- "OpenClaw, find and read back the transcript tagged with 'Project Alpha' from yesterday."
Tips & Limitations
Currently, this skill operates as a CLI-based tool, meaning it is optimized for discrete command executions rather than real-time, low-latency conversational streams—a feature slated for v1.1. Ensure your system's audio input/output devices are correctly configured for sox and ffplay. Since this skill accesses external APIs for STT/TTS services, you must provide valid credentials for Whisper and ElevenLabs. Regularly vacuum your SQLite database to maintain optimal search performance for large transcript volumes.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-frank-bot07-openclaw-voice": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, external-api
Related Skills
Model Switchboard
Skill by frank-bot07
nano-banana-2
Gemini image generation, editing, and search-grounded image creation via gemini-3.1-flash-image-preview (Nano Banana 2). USE FOR: - Generating images from text prompts (text-to-image) - Editing or transforming an existing image with text instructions - Generating images grounded in live web/image search results Requires GEMINI_API_KEY environment variable. See rules/setup.md for configuration and rules/security.md for output handling guidelines.
openclaw-ecommerce
E-commerce price monitoring, order tracking, and margin analysis for OpenClaw agents. Track product prices, get alerts on drops, manage orders, and calculate real margins after fees.
Openclaw Crm
Skill by frank-bot07
Openclaw Monitor
Skill by frank-bot07