ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

Audio Summary

Skill by alanoo7

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/alanoo7/audio-summary
Or

What This Skill Does

The Audio Summary skill by alanoo7 is an advanced utility designed for OpenClaw users who need to extract insights from multimedia content quickly. It acts as an automated transcription and summarization assistant that bridges the gap between raw video/audio files and structured information. By leveraging the power of ffmpeg and the high-performance qwen3-asr-flash model, this skill performs a sophisticated pipeline: it automatically extracts audio from video files (such as MP4s), transcodes it to 16k mono to meet input constraints, transcribes the speech to text, and generates a structured summary of the content.

Installation

To integrate this skill into your OpenClaw environment, ensure you have ffmpeg installed and accessible in your system's PATH. Once prepared, run the following command in your terminal: clawhub install openclaw/skills/skills/alanoo7/audio-summary

Use Cases

This skill is perfect for professionals and students who deal with large volumes of recorded media. Common use cases include: summarizing recorded meetings or lectures into actionable minutes, quickly scanning long educational YouTube videos for key concepts, or processing interview recordings to extract specific viewpoints without needing to listen to the entire duration manually.

Example Prompts

  1. "Summarize the content of the video file located at C:\Meetings\ProjectUpdate.mp4 and save the output to my workspace."
  2. "Extract the audio from my video recording and give me a bulleted summary of all the key decisions made during the call."
  3. "Transcribe and summarize this MP4 file: D:\Classes\Lecture01.mp4, focusing specifically on the discussion about machine learning."

Tips & Limitations

  • File Duration: Due to Base64 encoding limitations (6MB ceiling), the skill is optimized for clips between 5-8 minutes. For longer media, manually split the file before processing.
  • Performance: The skill uses ffmpeg for transcoding; ensure the file path provided is correct and the video is not encrypted or protected.
  • Costs: This skill interfaces with the Bailian API. Be aware of the qwen3-asr-flash billing structure and manage your usage accordingly.
  • Output: The summary is saved automatically in the same directory as the source file for easy reference.

Metadata

Author@alanoo7
Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-alanoo7-audio-summary": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#audio#transcription#video-summarization#productivity#nlp
Safety Score: 4/5

Flags: file-write, file-read, external-api, code-execution