ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified communication Safety 4/5

Openclaw Voice

Skill by frank-bot07

Why use this skill?

Enhance your OpenClaw CLI with voice capabilities. Use Whisper STT and ElevenLabs TTS to record, store, and listen to AI-generated transcripts locally.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/frank-bot07/openclaw-voice
Or

What This Skill Does

OpenClaw Voice is a robust bridge between natural language and your local machine, integrating Whisper for accurate Speech-to-Text (STT) and ElevenLabs for high-quality Text-to-Speech (TTS). Designed as a modular component within the OpenClaw ecosystem, this skill enables users to record voice inputs, process them into actionable text, and generate expressive auditory feedback. By leveraging SQLite (in WAL mode) for local persistence, it ensures that your conversations are cataloged, timestamped, and searchable for later retrieval. It effectively turns command-line workflows into interactive, voice-augmented experiences.

Installation

To install this skill, use the clawhub CLI tool within your terminal. Ensure your environment is set up with Node.js ESM support and that the required system dependencies like sox are installed on your host machine: clawhub install openclaw/skills/skills/frank-bot07/openclaw-voice

Use Cases

  • Voice Logging: Effortlessly record meeting summaries or fleeting thoughts directly into your searchable local database.
  • Auditory Feedback: Receive spoken responses from your AI agents, allowing you to stay productive away from the screen.
  • Offline History Management: Maintain a searchable repository of voice-based interactions for project documentation.
  • Hands-Free CLI Interaction: Use voice commands to trigger system tasks without needing to type syntax into your terminal.

Example Prompts

  1. "OpenClaw, start recording my meeting summary and save it to the history log."
  2. "OpenClaw, generate a spoken summary of the last three transcripts using the ElevenLabs TTS engine."
  3. "OpenClaw, find and read back the transcript tagged with 'Project Alpha' from yesterday."

Tips & Limitations

Currently, this skill operates as a CLI-based tool, meaning it is optimized for discrete command executions rather than real-time, low-latency conversational streams—a feature slated for v1.1. Ensure your system's audio input/output devices are correctly configured for sox and ffplay. Since this skill accesses external APIs for STT/TTS services, you must provide valid credentials for Whisper and ElevenLabs. Regularly vacuum your SQLite database to maintain optimal search performance for large transcript volumes.

Metadata

Stars2387
Views1
Updated2026-03-09
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-frank-bot07-openclaw-voice": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#voice#stt#tts#productivity#cli
Safety Score: 4/5

Flags: file-write, file-read, external-api