Official Verified media Safety 3/5

youtube-transcript-pipeline

Generate, clean, correct, translate, and package YouTube interview transcripts with speaker-attributed timestamps into reusable deliverables. Use for workflows involving Deepgram transcription, diarization correction, bilingual output, and structured folder packaging for handoff.

Why use this skill?

Turn YouTube links into structured, speaker-labeled, and translated transcript packages. Features Deepgram Nova-3 integration and automated folder workflows.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/bluebirdback/youtube-transcript-pipeline

Download Source Code (.zip)

What This Skill Does

The youtube-transcript-pipeline is a sophisticated automation workflow designed to transform raw YouTube audio into high-fidelity, structured transcript deliverables. It handles the complete lifecycle of audio processing, from initial download and chunked transcription via Deepgram's Nova-3 model to multi-stage post-processing. Key capabilities include speaker diarization correction, optional bilingual translation (English-to-Chinese), and professional-grade packaging into standardized project directories. By maintaining strict adherence to timestamp and line-format integrity, the skill ensures that all intermediate artifacts—raw transcripts, cleaned versions, and translated snippets—remain auditable and consistent for downstream editorial or research use.

Installation

To install this skill, use the OpenClaw CLI in your terminal: clawhub install openclaw/skills/skills/bluebirdback/youtube-transcript-pipeline Ensure you have your Deepgram API key configured in your environment variables, as the pipeline relies on the nova-3 model for high-accuracy diarization and word-level timing.

Use Cases

Journalistic Interviews: Transcribing long-form interviews with clear speaker separation between the journalist and the guest.
Content Repurposing: Automatically generating cleaned, structured transcript files for blog posts, social media, or archival purposes.
Cross-Lingual Research: Translating technical interview transcripts into Chinese while preserving original timestamps and speaker tags to maintain a clear link back to the source audio.
Content Auditing: Maintaining a repository of raw, cleaned, and translated transcripts for complex editorial projects where version control is critical.

Example Prompts

"Use the youtube-transcript-pipeline to transcribe this interview: [URL]. Please focus on cleaning up the speaker attribution so the interviewer is identified as 'Host' and the interviewee as 'Guest'."
"Download the audio for [URL], generate a transcript with Deepgram, and then provide a Chinese translation of the resulting text while keeping the exact timestamp format."
"Package the latest transcription project into a standard workbench folder. Ensure the MANIFEST.txt is updated and include both the raw and the speaker-corrected versions."

Tips & Limitations

Speaker Attribution: Always perform a conservative correction pass first. The pipeline is designed to be cautious to prevent over-attributing short interjections to the wrong speaker.
Audio Quality: While Nova-3 is powerful, ensure the input audio is relatively clean; heavy background noise may degrade speaker identification accuracy.
Versioning: Use the scripts/ folder provided to handle workflow helpers. It is highly recommended to run the create_youtube_transcript_workbench.sh script initially to establish the correct directory hierarchy, as the pipeline expects this structure for automated packaging and manifest generation.

Read Full Documentation on GitHub

Metadata

Author@bluebirdback

Stars1776

Updated2026-03-02

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-bluebirdback-youtube-transcript-pipeline": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#transcription#ai-audio#workflow#youtube-automation

Safety Score: 3/5

Flags: network-access, file-write, file-read, external-api, code-execution

Related Skills

claude-to-free

Migrate OpenClaw from Claude subscription OAuth to a free or cheap model provider (OpenRouter, Gemini, Ollama). Use when the user says Claude stopped working, gets an auth error, mentions the Anthropic April 2026 subscription ban, or asks to switch models without paying Anthropic more.

bluebirdback 4473

youtube-transcript-pipeline-lite

Run a lightweight YouTube transcript workflow: transcribe, attribution cleanup, translation, and packaging with minimal tooling. Use for repeatable transcript handoff tasks when you need a concise, auditable process over custom automation.

bluebirdback 4473

clawhub-publish-doctor

Diagnose and mitigate ClawHub/ClawDHUB publish failures (auth, browser-login, missing dependencies, pending security-scan visibility errors, and wrong profile/skill URLs). Use when publishing skills to ClawHub fails, inspect reports temporary errors, or you need a safer publish+verify workflow with retries.

bluebirdback 4473

exec-clawhub-publish-doctor

Diagnose and mitigate exec-related tooling failures around ClawHub publishing and GitHub CLI queries (auth, browser-login, missing dependencies, pending security-scan visibility errors, wrong profile/skill URLs, and gh JSON-field mismatch errors like Unknown JSON field). Use when publishing skills to ClawHub fails, inspect reports temporary errors, or GitHub CLI search commands fail due to field schema differences.

bluebirdback 4473

claw-history

Provide a chronological history of all actions the agent has taken from the beginning (birth) until now. Use when the user asks for full lifetime timeline/accountability, "from birth until now," "everything you've done so far," "full action log," or equivalent chronological-history requests.

bluebirdback 4473