ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

voiceai-creator-voiceover-pipeline

Turn scripts into publishable voiceovers with Voice.ai TTS, including segments, chapters, captions, and video muxing.

Why use this skill?

Automate your production workflow with the Voice.ai Creator Voiceover Pipeline. Generate professional voiceovers, YouTube chapters, captions, and mux video.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/gizmogremlin/voiceai-voiceover-creator
Or

What This Skill Does

The voiceai-creator-voiceover-pipeline is a powerful, integrated CLI tool designed to streamline the production of professional-grade audio and video content. By leveraging Voice.ai's advanced TTS engine, this skill transforms text scripts into studio-quality voiceovers. It automates complex post-production tasks including the segmentation of long-form scripts, generation of YouTube-ready chapter timestamps, creation of SRT caption files, and the production of a web-based review page for segment auditing. For video creators, it features an automated muxing capability that replaces existing audio tracks in videos with the newly generated AI voiceover, supporting intelligent synchronization options to ensure audio and visual alignment.

Installation

To begin, ensure you have Node.js version 20 or higher installed on your system. You can install the skill by executing: clawhub install openclaw/skills/skills/gizmogremlin/voiceai-voiceover-creator. Once installed, you must configure your authentication by creating a .env file in the skill root containing VOICE_AI_API_KEY=your-key-here. For optimal performance, including features like master audio stitching and video muxing, ensure ffmpeg is installed and accessible in your system PATH.

Use Cases

This tool is ideal for creators producing long-form educational content or documentation videos, as it automates the tedious aspects of audio editing. It is perfectly suited for podcasters requiring consistent voice branding across episodes, YouTube creators needing to generate high-quality voiceovers for shorts or long-form videos quickly, and corporate trainers building out course libraries. The smart-caching mechanism allows for efficient iteration, meaning if a single sentence in a lengthy script needs a change, only that specific segment is re-rendered.

Example Prompts

  1. "Build a voiceover for my-script.md using the 'oliver' voice, save it to the output folder, and include YouTube chapters."
  2. "Update my project with the new audio, mux it into my-recording.mp4 using the 'pad' sync setting."
  3. "Run a test build using the shortform template for my YouTube hook script using mock audio."

Tips & Limitations

Always check your VOICE_AI_API_KEY for sufficient credits before running large batch jobs. While the skill handles most audio tasks, complex video editing (like visual cuts or transitions) must be done in external software; this tool focuses strictly on audio-to-video integration. Use the --mock flag during initial configuration to verify your pipeline flow without consuming API credits.

Metadata

Stars2387
Views3
Updated2026-03-09
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-gizmogremlin-voiceai-voiceover-creator": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#audio#voiceover#automation#video#tts
Safety Score: 4/5

Flags: file-write, file-read, external-api, code-execution