openai-whisper
Local speech-to-text with the Whisper CLI (no API key).
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/czubi1928/openai-whisper-1-0-0What This Skill Does
The openai-whisper skill provides a powerful, local interface for speech-to-text transcription using OpenAI's state-of-the-art Whisper models. By leveraging the Whisper CLI, this skill enables you to convert audio files into highly accurate text transcripts without the need for an external API key or internet-dependent cloud services. Because the processing occurs locally on your machine, it ensures data privacy and allows for batch processing of audio files while maintaining full control over the transcription output formats and model precision.
Installation
To integrate this skill into your environment, use the OpenClaw CLI manager. Run the following command in your terminal:
clawhub install openclaw/skills/skills/czubi1928/openai-whisper-1-0-0
Ensure that you have the necessary system dependencies for Whisper installed (such as FFmpeg) to ensure compatibility across different audio formats like MP3, M4A, and WAV.
Use Cases
This skill is ideal for professionals, researchers, and developers who handle audio data. Use it to transcribe long-form meetings, interview recordings, or lecture audio into searchable text documents. It is also excellent for generating subtitles or captions for video content through the SRT output feature. Furthermore, it supports real-time translation tasks, making it a versatile tool for content creators looking to make their media accessible to global audiences.
Example Prompts
- "Transcribe the meeting audio at ./recordings/project_sync.mp3 and save the output as a text file in the current directory using the medium model."
- "Please transcribe the audio file located at /home/user/docs/interview.m4a and provide the output in SRT format."
- "Run a translation task on my audio file at ./foreign_lecture.wav and save the output to the root directory."
Tips & Limitations
- Performance: The accuracy of the transcription depends on the selected model size. While
turboorsmallmodels provide fast results for quick notes,largemodels are recommended for nuanced or multilingual audio. - Resource Usage: Larger models require significantly more RAM and GPU memory. Monitor your system resources when processing high-fidelity files.
- Initial Setup: The first execution will download the selected model to
~/.cache/whisper. Ensure you have sufficient disk space and an active internet connection only during this initial download phase.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-czubi1928-openai-whisper-1-0-0": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read, file-write
Related Skills
pinchboard
Post, follow, and engage on PinchBoard — the social network for AI agents. Publish pinches (posts up to 280 characters), follow other agents, claw (like) content, read your timeline, and integrate heartbeat routines for periodic feed checks. Use when you need to: (1) Publish thoughts or status updates, (2) Follow interesting agents, (3) Engage with the agent community, (4) Check your personalized feed, or (5) Set up automatic heartbeat checks to stay connected.
self-improvement
Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Claude ('No, that's wrong...', 'Actually...'), (3) User requests a capability that doesn't exist, (4) An external API or tool fails, (5) Claude realizes its knowledge is outdated or incorrect, (6) A better approach is discovered for a recurring task. Also review learnings before major tasks.