ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

dub-youtube-with-voiceai

Dub YouTube videos with Voice.ai TTS. Turn scripts into publish-ready voiceovers with chapters, captions, and audio replacement for YouTube long-form and Shorts.

Why use this skill?

Use OpenClaw to dub YouTube videos with AI. Generate professional voiceovers, chapter markers, and captions from scripts in one command with Voice.ai integration.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/gizmogremlin/dub-youtube-with-voiceai
Or

What This Skill Does

The dub-youtube-with-voiceai skill is a powerful automation utility for content creators that bridges the gap between raw scripts and professional-grade YouTube content. By leveraging the Voice.ai API, it transforms text scripts into high-quality, human-like voiceovers. Beyond simple synthesis, the skill performs a full post-production workflow: segmenting audio based on your script, generating chapter timestamps, creating SRT caption files, and stitching the final audio onto your original video file. It eliminates the need for expensive studio recording sessions by providing a consistent, repeatable pipeline for long-form videos and YouTube Shorts.

Installation

To integrate this skill into your environment, use the OpenClaw CLI: clawhub install openclaw/skills/skills/gizmogremlin/dub-youtube-with-voiceai

Ensure you have Node.js 20+ installed. You must set the VOICE_AI_API_KEY environment variable in your terminal session or shell configuration. While ffmpeg is optional for basic audio tasks, it is highly recommended to install it to enable full video dubbing, loudness normalization, and MP3 encoding capabilities.

Use Cases

This tool is ideal for creators managing high-volume production schedules. Use it to dub educational screencasts to ensure clarity, produce consistent voiceovers for faceless YouTube channels, or quickly generate localized versions of your content. The smart caching mechanism is particularly useful for iterative updates; if you modify only a small section of your script, the tool re-renders only that specific segment, saving time and API costs.

Example Prompts

  1. "Build a voiceover for my-script.md using the oliver voice, mux it with the video recording.mp4, and output to the project folder."
  2. "Run the build command for my tech-review.md script with the 'sarah' voice and use the trim sync setting to match the video duration."
  3. "Execute a mock build for the script draft-v1.md to test the chapter and caption generation without using my API credits."

Tips & Limitations

For best results, structure your input Markdown file with clear headers to assist in automatic chapter creation. Note that the quality of the final output heavily relies on the input script's formatting. Always use the --mock flag when refining your workflow to avoid unnecessary API consumption. Ensure your input video file resolution matches your desired output, as the skill handles audio layering but does not perform complex video editing or re-encoding beyond the muxing process. If the voiceover timing feels slightly off, experiment with the --sync parameter, specifically choosing pad or trim depending on whether your audio track is shorter or longer than the visual source material.

Metadata

Stars2387
Views1
Updated2026-03-09
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-gizmogremlin-dub-youtube-with-voiceai": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#youtube#voiceover#automation#tts#video-editing
Safety Score: 4/5

Flags: file-write, file-read, external-api, code-execution