local-stt
Local STT with selectable backends - Parakeet (best accuracy) or Whisper (fastest, multilingual).
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/araa47/local-sttWhat This Skill Does
The local-stt skill provides OpenClaw with high-performance, private, and offline speech-to-text capabilities. By leveraging ONNX Runtime with int8 quantization, it delivers efficient transcription without needing a cloud-based API connection. Users can choose between two specialized backends: Parakeet, which is optimized for high-accuracy English transcription and understands filler words, or Whisper, which offers broad support for 99 languages. The skill is designed for speed and reliability, allowing for fast processing of audio files while keeping sensitive voice data entirely on the host machine.
Installation
To install this skill, use the clawhub CLI utility. Run the following command in your terminal:
clawhub install openclaw/skills/skills/araa47/local-stt
Once installed, ensure that your environment meets the dependency requirements for ONNX Runtime. You can verify the installation by running the script directly via ~/.openclaw/skills/local-stt/scripts/local-stt.py --help to confirm that the backend switches and model configurations are accessible.
Use Cases
This skill is perfect for users who prioritize privacy and low latency. Common use cases include:
- Transcribing private meeting audio or voice memos locally without uploading files to third-party servers.
- Building automated audio processing pipelines where speed is critical, such as monitoring incoming voice messages in Matrix rooms.
- Creating local documentation from recorded interviews, leveraging the high accuracy of Parakeet v2.
- Multilingual environments where Whisper can quickly switch between different languages for transcription needs.
Example Prompts
- "OpenClaw, transcribe the latest audio file in my downloads folder using the Parakeet v2 engine."
- "Please process the voice note in the project room using Whisper base to get a quick summary."
- "Transcribe my last voice recording, but use the large-v3-turbo model for the best possible accuracy."
Tips & Limitations
While the local-stt skill is powerful, there are trade-offs to consider. The Parakeet engine is highly recommended for English-only tasks where accuracy is the primary goal, such as transcribing technical notes or dictated commands. If your content includes multiple languages or needs a smaller footprint, the Whisper backend is superior, specifically the 'tiny' model if system resources are constrained. Keep in mind that while int8 quantization provides excellent performance, it may marginally affect precision compared to full-precision models. Always check the --room-id flag if you intend to pipe results directly to a specific communication channel.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-araa47-local-stt": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read, code-execution
Related Skills
local-whisper
Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.
ez-unifi
Use when asked to manage UniFi network - list/restart/upgrade devices, block/unblock clients, manage WiFi networks, control PoE ports, manage traffic rules, create guest vouchers, or any UniFi controller task. Works with UDM Pro/SE, Dream Machine, Cloud Key Gen2+, or self-hosted controllers.
ez-google
Use when asked to send email, check inbox, read emails, check calendar, schedule meetings, create events, search Google Drive, create Google Docs, read or write spreadsheets, find contacts, or any task involving Gmail, Google Calendar, Drive, Docs, Sheets, Slides, or Contacts. Agent-friendly with hosted OAuth - no API keys needed.
gemini-stt
Transcribe audio files using Google's Gemini API or Vertex AI
md-to-pdf
Convert markdown files to clean, formatted PDFs using reportlab