ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified productivity Safety 4/5

voice-transcriber

Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves both audio files and text transcripts. Perfect for voice-first AI workflows, founder journaling, and meeting notes.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/aiwithabidi/voice-transcriber-pro
Or

What This Skill Does

The voice-transcriber skill for OpenClaw is a robust utility designed to turn unstructured audio into actionable text documentation. Leveraging the power of Deepgram Nova-3, this skill provides industry-leading transcription accuracy while maintaining a streamlined workflow for saving both the original audio files and their corresponding text transcripts. Whether you are capturing a fleeting idea, archiving a long-form meeting, or creating a voice-first journal, the voice-transcriber handles the heavy lifting of audio ingestion, processing, and archival storage.

Installation

To add this skill to your OpenClaw environment, execute the following command in your terminal:

clawhub install openclaw/skills/skills/aiwithabidi/voice-transcriber-pro

Ensure that you have sufficient permissions for file system access, as the skill requires read access to audio files and write access for generating text output files. No complex configuration files are required, but verify your environment supports Python 3.x and standard shell execution.

Use Cases

  • Journaling: Founders and professionals can record quick thoughts on the go and have them automatically transcribed into a searchable archive.
  • Meeting Recaps: Upload recordings of team synchronization calls to generate searchable, timestamped text logs.
  • Research: Convert interviews, podcast snippets, or lecture recordings from various audio formats (OGG, MP3, WAV, etc.) into accessible text documents.
  • Voice-First AI Workflows: Integrate this into automated pipelines where mobile audio uploads trigger downstream AI analysis or project management task creation.

Example Prompts

  1. "OpenClaw, transcribe the file located at /data/meetings/team_sync_v1.wav and save the results to the project documentation folder."
  2. "I just uploaded a voice note to the inbox. Please use the voice-transcriber to process it and add the summary to my daily journal."
  3. "Transcribe my latest audio note titled 'product-ideas.ogg' and provide me with the main action items identified in the text."

Tips & Limitations

For the best results, ensure your audio files are clear and recorded with minimal background noise. While Deepgram Nova-3 is highly resilient, extremely muffled audio may degrade accuracy. Currently, this skill supports major audio formats including OGG, MP3, WAV, M4A, FLAC, and WEBM. Note that file processing time is proportional to the audio duration; for very long recordings, allow a moment for the script to finalize the transcript archival. Keep your file paths organized to ensure the archival system remains clean and searchable.

Metadata

Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-aiwithabidi-voice-transcriber-pro": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#transcription#productivity#voice-notes#audio-processing#journaling
Safety Score: 4/5

Flags: file-write, file-read, external-api, code-execution

Related Skills

freshsales

Freshsales CRM integration — manage contacts, leads, deals, accounts, tasks, and sales sequences via the Freshsales API. Track deal pipelines, automate lead assignments, log activities, and generate sales reports. Built for AI agents — Python stdlib only, no dependencies. Use for sales CRM, contact management, deal tracking, pipeline reporting, and sales automation.

aiwithabidi 4473

gemini-video-analyzer

Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe speech, identify objects and actions. Use when: (1) User sends a video file and wants it analyzed, (2) Video summarization or description needed, (3) Extracting text, UI elements, or information from screen recordings, (4) Answering questions about video content, (5) Comparing multiple videos, (6) Analyzing tutorials, demos, or walkthroughs.

aiwithabidi 4473

agent-memory

Full AI agent memory stack — Mem0 unified memory engine with vector search (Qdrant) and knowledge graph (Neo4j), plus SQLite for structured data. Complete setup script and tools. Give your OpenClaw agent a real brain with semantic recall, entity relationships, and structured storage.

aiwithabidi 4473

neon

Neon serverless Postgres — manage projects, branches, databases, roles, endpoints, and compute via the Neon API. Create database branches for development, manage connection endpoints, scale compute, and monitor usage. Built for AI agents — Python stdlib only, zero dependencies. Use for serverless Postgres, database branching, database management, development workflows, and cloud database automation.

aiwithabidi 4473

onepassword

1Password Connect — vaults, items, secrets management for server-side applications.

aiwithabidi 4473