yt-assemblyai-monitor
YouTube channel monitor and video transcription using AssemblyAI cloud API. Pure Python + requests only — no ffmpeg, no Whisper, no extra tools needed. Monitors YouTube channels for new videos, extracts audio URLs via innertube API, submits to AssemblyAI for cloud transcription, and returns text + AI summary. Works on Mac, Linux, Windows. Only dependency: requests (usually pre-installed). Use when: user asks to monitor YouTube channels, transcribe YouTube videos, summarize video content, or set up YouTube content monitoring.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/azazlf09/yt-assemblyai-monitorWhat This Skill Does
The yt-assemblyai-monitor is a specialized OpenClaw skill designed to automate the process of monitoring YouTube channels and transcribing video content. Unlike traditional methods that rely on heavy local dependencies like FFmpeg or Whisper, this agent leverages the power of AssemblyAI cloud APIs. It uses the YouTube innertube API via Python's requests library to extract audio URLs directly, passing them to AssemblyAI for cloud-side transcription. This architecture ensures that local compute resources remain free and eliminates complex software installation hurdles. The skill automatically fetches full transcriptions, provides AI-generated summaries, and logs detailed utterance segments with speaker labels, making it an efficient tool for research, accessibility, or content tracking.
Installation
To integrate this skill, use the command clawhub install openclaw/skills/skills/azazlf09/yt-assemblyai-monitor. After installation, ensure you have an AssemblyAI API key. Configure it by either setting the ASSEMBLYAI_API_KEY environment variable or placing a config.json file inside the data/ directory. No further compilation or dependency management is required, as the script is designed for pure Python execution.
Use Cases
This skill is perfect for creators, researchers, and developers who need to stay updated with video content without manual intervention. Common use cases include: transcribing long-form educational videos for searchability, monitoring competitor channels for updates, creating searchable archives of personal interest channels, and generating quick summaries of video content for faster consumption.
Example Prompts
- "Monitor the 'TechExplained' YouTube channel and summarize every new video they post."
- "Transcribe this video URL [link] and let me know the main points discussed by the speaker."
- "List all the channels I am currently monitoring and tell me which ones have new videos today."
Tips & Limitations
Note that this tool works best on residential IPs; excessive requests from datacenter IPs may trigger YouTube's bot protection. Transcription speed is generally 70-80% of the video duration. Always check the free tier limits on AssemblyAI, as you get 100 hours of processing per month. Results are automatically saved locally in the data/summaries/ directory, allowing you to easily browse or index your past transcriptions whenever you need.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-azazlf09-yt-assemblyai-monitor": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, file-read, external-api