ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

yt-assemblyai-monitor

YouTube channel monitor and video transcription using AssemblyAI cloud API. Pure Python + requests only — no ffmpeg, no Whisper, no extra tools needed. Monitors YouTube channels for new videos, extracts audio URLs via innertube API, submits to AssemblyAI for cloud transcription, and returns text + AI summary. Works on Mac, Linux, Windows. Only dependency: requests (usually pre-installed). Use when: user asks to monitor YouTube channels, transcribe YouTube videos, summarize video content, or set up YouTube content monitoring.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/azazlf09/yt-assemblyai-monitor
Or

What This Skill Does

The yt-assemblyai-monitor is a specialized OpenClaw skill designed to automate the process of monitoring YouTube channels and transcribing video content. Unlike traditional methods that rely on heavy local dependencies like FFmpeg or Whisper, this agent leverages the power of AssemblyAI cloud APIs. It uses the YouTube innertube API via Python's requests library to extract audio URLs directly, passing them to AssemblyAI for cloud-side transcription. This architecture ensures that local compute resources remain free and eliminates complex software installation hurdles. The skill automatically fetches full transcriptions, provides AI-generated summaries, and logs detailed utterance segments with speaker labels, making it an efficient tool for research, accessibility, or content tracking.

Installation

To integrate this skill, use the command clawhub install openclaw/skills/skills/azazlf09/yt-assemblyai-monitor. After installation, ensure you have an AssemblyAI API key. Configure it by either setting the ASSEMBLYAI_API_KEY environment variable or placing a config.json file inside the data/ directory. No further compilation or dependency management is required, as the script is designed for pure Python execution.

Use Cases

This skill is perfect for creators, researchers, and developers who need to stay updated with video content without manual intervention. Common use cases include: transcribing long-form educational videos for searchability, monitoring competitor channels for updates, creating searchable archives of personal interest channels, and generating quick summaries of video content for faster consumption.

Example Prompts

  1. "Monitor the 'TechExplained' YouTube channel and summarize every new video they post."
  2. "Transcribe this video URL [link] and let me know the main points discussed by the speaker."
  3. "List all the channels I am currently monitoring and tell me which ones have new videos today."

Tips & Limitations

Note that this tool works best on residential IPs; excessive requests from datacenter IPs may trigger YouTube's bot protection. Transcription speed is generally 70-80% of the video duration. Always check the free tier limits on AssemblyAI, as you get 100 hours of processing per month. Results are automatically saved locally in the data/summaries/ directory, allowing you to easily browse or index your past transcriptions whenever you need.

Metadata

Author@azazlf09
Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-azazlf09-yt-assemblyai-monitor": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#youtube#transcription#automation#summary#productivity
Safety Score: 4/5

Flags: network-access, file-write, file-read, external-api