youtube-transcript-pipeline
Generate, clean, correct, translate, and package YouTube interview transcripts with speaker-attributed timestamps into reusable deliverables. Use for workflows involving Deepgram transcription, diarization correction, bilingual output, and structured folder packaging for handoff.
Why use this skill?
Turn YouTube links into structured, speaker-labeled, and translated transcript packages. Features Deepgram Nova-3 integration and automated folder workflows.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/bluebirdback/youtube-transcript-pipelineWhat This Skill Does
The youtube-transcript-pipeline is a sophisticated automation workflow designed to transform raw YouTube audio into high-fidelity, structured transcript deliverables. It handles the complete lifecycle of audio processing, from initial download and chunked transcription via Deepgram's Nova-3 model to multi-stage post-processing. Key capabilities include speaker diarization correction, optional bilingual translation (English-to-Chinese), and professional-grade packaging into standardized project directories. By maintaining strict adherence to timestamp and line-format integrity, the skill ensures that all intermediate artifacts—raw transcripts, cleaned versions, and translated snippets—remain auditable and consistent for downstream editorial or research use.
Installation
To install this skill, use the OpenClaw CLI in your terminal:
clawhub install openclaw/skills/skills/bluebirdback/youtube-transcript-pipeline
Ensure you have your Deepgram API key configured in your environment variables, as the pipeline relies on the nova-3 model for high-accuracy diarization and word-level timing.
Use Cases
- Journalistic Interviews: Transcribing long-form interviews with clear speaker separation between the journalist and the guest.
- Content Repurposing: Automatically generating cleaned, structured transcript files for blog posts, social media, or archival purposes.
- Cross-Lingual Research: Translating technical interview transcripts into Chinese while preserving original timestamps and speaker tags to maintain a clear link back to the source audio.
- Content Auditing: Maintaining a repository of raw, cleaned, and translated transcripts for complex editorial projects where version control is critical.
Example Prompts
- "Use the youtube-transcript-pipeline to transcribe this interview: [URL]. Please focus on cleaning up the speaker attribution so the interviewer is identified as 'Host' and the interviewee as 'Guest'."
- "Download the audio for [URL], generate a transcript with Deepgram, and then provide a Chinese translation of the resulting text while keeping the exact timestamp format."
- "Package the latest transcription project into a standard workbench folder. Ensure the MANIFEST.txt is updated and include both the raw and the speaker-corrected versions."
Tips & Limitations
- Speaker Attribution: Always perform a conservative correction pass first. The pipeline is designed to be cautious to prevent over-attributing short interjections to the wrong speaker.
- Audio Quality: While Nova-3 is powerful, ensure the input audio is relatively clean; heavy background noise may degrade speaker identification accuracy.
- Versioning: Use the
scripts/folder provided to handle workflow helpers. It is highly recommended to run thecreate_youtube_transcript_workbench.shscript initially to establish the correct directory hierarchy, as the pipeline expects this structure for automated packaging and manifest generation.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-bluebirdback-youtube-transcript-pipeline": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, file-read, external-api, code-execution
Related Skills
claude-to-free
Migrate OpenClaw from Claude subscription OAuth to a free or cheap model provider (OpenRouter, Gemini, Ollama). Use when the user says Claude stopped working, gets an auth error, mentions the Anthropic April 2026 subscription ban, or asks to switch models without paying Anthropic more.
youtube-transcript-pipeline-lite
Run a lightweight YouTube transcript workflow: transcribe, attribution cleanup, translation, and packaging with minimal tooling. Use for repeatable transcript handoff tasks when you need a concise, auditable process over custom automation.
clawhub-publish-doctor
Diagnose and mitigate ClawHub/ClawDHUB publish failures (auth, browser-login, missing dependencies, pending security-scan visibility errors, and wrong profile/skill URLs). Use when publishing skills to ClawHub fails, inspect reports temporary errors, or you need a safer publish+verify workflow with retries.
exec-clawhub-publish-doctor
Diagnose and mitigate exec-related tooling failures around ClawHub publishing and GitHub CLI queries (auth, browser-login, missing dependencies, pending security-scan visibility errors, wrong profile/skill URLs, and gh JSON-field mismatch errors like Unknown JSON field). Use when publishing skills to ClawHub fails, inspect reports temporary errors, or GitHub CLI search commands fail due to field schema differences.
claw-history
Provide a chronological history of all actions the agent has taken from the beginning (birth) until now. Use when the user asks for full lifetime timeline/accountability, "from birth until now," "everything you've done so far," "full action log," or equivalent chronological-history requests.