Audio Summary
Skill by alanoo7
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/alanoo7/audio-summaryWhat This Skill Does
The Audio Summary skill by alanoo7 is an advanced utility designed for OpenClaw users who need to extract insights from multimedia content quickly. It acts as an automated transcription and summarization assistant that bridges the gap between raw video/audio files and structured information. By leveraging the power of ffmpeg and the high-performance qwen3-asr-flash model, this skill performs a sophisticated pipeline: it automatically extracts audio from video files (such as MP4s), transcodes it to 16k mono to meet input constraints, transcribes the speech to text, and generates a structured summary of the content.
Installation
To integrate this skill into your OpenClaw environment, ensure you have ffmpeg installed and accessible in your system's PATH. Once prepared, run the following command in your terminal:
clawhub install openclaw/skills/skills/alanoo7/audio-summary
Use Cases
This skill is perfect for professionals and students who deal with large volumes of recorded media. Common use cases include: summarizing recorded meetings or lectures into actionable minutes, quickly scanning long educational YouTube videos for key concepts, or processing interview recordings to extract specific viewpoints without needing to listen to the entire duration manually.
Example Prompts
- "Summarize the content of the video file located at C:\Meetings\ProjectUpdate.mp4 and save the output to my workspace."
- "Extract the audio from my video recording and give me a bulleted summary of all the key decisions made during the call."
- "Transcribe and summarize this MP4 file: D:\Classes\Lecture01.mp4, focusing specifically on the discussion about machine learning."
Tips & Limitations
- File Duration: Due to Base64 encoding limitations (6MB ceiling), the skill is optimized for clips between 5-8 minutes. For longer media, manually split the file before processing.
- Performance: The skill uses
ffmpegfor transcoding; ensure the file path provided is correct and the video is not encrypted or protected. - Costs: This skill interfaces with the Bailian API. Be aware of the
qwen3-asr-flashbilling structure and manage your usage accordingly. - Output: The summary is saved automatically in the same directory as the source file for easy reference.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-alanoo7-audio-summary": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, external-api, code-execution