ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

youtube-anycaption-summarizer

Turn YouTube videos into dependable markdown transcripts and polished summaries — even when caption coverage is messy. This skill works with manual closed captions (CC), auto-generated subtitles, or no usable subtitles at all by using subtitle-first extraction with local Whisper fallback. Supports private/restricted videos via cookies, batch processing, transcript cleanup, language backfill, source-language or user-selected summary language, and end-to-end completion reporting. Ideal for YouTube research, technical walkthroughs, founder content, tutorials, private/internal uploads, and batch video summarization workflows.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/arthurli202602-commits/youtube-anycaption-summarizer
Or

What This Skill Does

The YouTube AnyCaption Summarizer is a robust, self-contained OpenClaw agent skill designed to bridge the gap between inaccessible video content and actionable documentation. Unlike standard transcription tools that rely solely on YouTube's provided metadata or captions, this skill implements a multi-tiered extraction strategy. It first attempts to fetch manual closed captions (CC), proceeds to auto-generated subtitles, and finally falls back to local Whisper transcription if no usable text is available. The skill outputs clean, formatted markdown transcripts alongside polished, session-ready summaries, ensuring consistent documentation across diverse video sources.

Installation

Setting up the skill requires a few system dependencies to ensure the local audio processing engine is functional. Open your terminal and execute the following commands to install the necessary binaries and download the Whisper model:

brew install yt-dlp ffmpeg whisper-cpp
MODELS_DIR="$HOME/.openclaw/workspace"
MODEL_PATH="$MODELS_DIR/ggml-medium.bin"
mkdir -p "$MODELS_DIR"
if [ ! -f "$MODEL_PATH" ]; then
  curl -L https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-medium.bin \
    -o "$MODEL_PATH.part" && mv "$MODEL_PATH.part" "$MODEL_PATH"
fi

After installation, you can install the skill via the OpenClaw command interface: clawhub install openclaw/skills/skills/arthurli202602-commits/youtube-anycaption-summarizer. This process is non-destructive and verifies your system environment before finalizing.

Use Cases

This skill is highly effective for technical research and professional content management. Users can process founder videos and operator walkthroughs into searchable knowledge bases. It excels in batch-processing long-form technical explainers that require precise transcriptions and high-level summaries. Furthermore, it supports authenticated access for private or internal-only YouTube videos via cookie configuration, making it indispensable for private tutorial libraries and corporate training materials that are not accessible to public-facing scrapers.

Example Prompts

  1. "Summarize this video: https://youtube.com/watch?v=EXAMPLE_ID and provide a detailed markdown transcript for my technical documentation."
  2. "Process this playlist of three tutorial videos; generate a single summary document and separate transcripts for each."
  3. "Summarize this private company update video using my browser cookies for authentication: [URL]"

Tips & Limitations

To ensure optimal performance, ensure that your yt-dlp is updated regularly, as YouTube frequently changes its playback architecture. While the local Whisper fallback is highly reliable, it is resource-intensive; for long videos, ensure you have sufficient disk space and memory. If you require multi-language support, verify that your selected Whisper model variant supports the target source language. Always use the --models-dir flag if you prefer to keep your library outside of the default OpenClaw workspace.

Metadata

Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-arthurli202602-commits-youtube-anycaption-summarizer": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#youtube#transcription#summarization#whisper#automation
Safety Score: 4/5

Flags: network-access, file-write, file-read, code-execution