Community Verified media Safety 4/5

openai-whisper-api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/openclaw/skills/openai-whisper-api

Download Source Code (.zip)

What This Skill Does

The openai-whisper-api skill integrates OpenAI’s powerful Whisper speech-to-text engine directly into the OpenClaw environment. By utilizing the /v1/audio/transcriptions endpoint, this skill converts spoken language from various audio formats (such as .m4a, .ogg, .mp3, and more) into accurate, readable text. It functions as a lightweight wrapper around the Whisper model, allowing for seamless transcription without requiring manual API calls or complex script management. Whether you are dealing with short voice notes or longer recorded meetings, this skill automates the transcription process, outputting the results directly to your local file system as either plain text or structured JSON data.

Installation

To integrate this skill into your environment, execute the following command in your terminal:

clawhub install openclaw/openclaw/skills/openai-whisper-api

Once installed, ensure your credentials are set up. You can either export the OPENAI_API_KEY environment variable directly in your shell session or add it to your configuration file located at ~/.openclaw/openclaw.json under the designated skill section. Proper authentication is required to access the OpenAI endpoints successfully.

Use Cases

This skill is ideal for professionals and developers who need to document audio content quickly. Use cases include transcribing meeting minutes for team records, generating subtitles for video content, converting voice memos into actionable task lists, or performing qualitative research by transcribing recorded interviews. Its support for custom prompts allows users to prime the AI with specific terminology, speaker names, or unique vocabulary, ensuring higher accuracy for technical or specialized domains.

Example Prompts

"OpenClaw, please transcribe the audio file located at /home/user/downloads/meeting_notes.m4a and save the output as notes.txt."
"Transcribe audio_interview.ogg using the whisper-1 model and export the result as a structured JSON file."
"Transcribe /projects/audio/briefing.mp3, but make sure to prime the AI with the list of technical acronyms: API, CLI, IDE, and SCM."

Tips & Limitations

File Limits: OpenAI enforces file size limits for direct API uploads; ensure your audio files are compressed appropriately before processing.
Precision: Use the --prompt flag effectively. By providing context or names that appear in the audio, you significantly reduce the error rate for specific nouns.
Performance: For long recordings, consider splitting audio files into smaller segments to avoid timeouts or interruption.
Output Format: Rely on the --json flag if you intend to pipe the transcription into further automated workflows or parsing scripts within OpenClaw.

Read Full Documentation on GitHub

Metadata

Author@openclaw

Stars369848

Updated2026-05-08

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-openclaw-openai-whisper-api": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#whisper#transcription#audio#speech-to-text#openai

Safety Score: 4/5

Flags: file-write, file-read, external-api

Related Skills

apple-notes

Create, view, edit, delete, search, move, or export Apple Notes via the memo CLI on macOS.

openclaw 370199

sherpa-onnx-tts

Local text-to-speech via sherpa-onnx (offline, no cloud)

openclaw 370199

goplaces

Query Google Places for text search, place details, resolve, reviews, or scriptable JSON via goplaces.

openclaw 370199

skill-creator

Create, edit, improve, tidy, review, audit, or restructure AgentSkills and SKILL.md files.

openclaw 370199

video-frames

Extract frames or short clips from videos using ffmpeg.

openclaw 370199