voiceai-creator-voiceover-pipeline
Turn scripts into publishable voiceovers with Voice.ai TTS, including segments, chapters, captions, and video muxing.
Why use this skill?
Automate your production workflow with the Voice.ai Creator Voiceover Pipeline. Generate professional voiceovers, YouTube chapters, captions, and mux video.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/gizmogremlin/voiceai-voiceover-creatorWhat This Skill Does
The voiceai-creator-voiceover-pipeline is a powerful, integrated CLI tool designed to streamline the production of professional-grade audio and video content. By leveraging Voice.ai's advanced TTS engine, this skill transforms text scripts into studio-quality voiceovers. It automates complex post-production tasks including the segmentation of long-form scripts, generation of YouTube-ready chapter timestamps, creation of SRT caption files, and the production of a web-based review page for segment auditing. For video creators, it features an automated muxing capability that replaces existing audio tracks in videos with the newly generated AI voiceover, supporting intelligent synchronization options to ensure audio and visual alignment.
Installation
To begin, ensure you have Node.js version 20 or higher installed on your system. You can install the skill by executing: clawhub install openclaw/skills/skills/gizmogremlin/voiceai-voiceover-creator. Once installed, you must configure your authentication by creating a .env file in the skill root containing VOICE_AI_API_KEY=your-key-here. For optimal performance, including features like master audio stitching and video muxing, ensure ffmpeg is installed and accessible in your system PATH.
Use Cases
This tool is ideal for creators producing long-form educational content or documentation videos, as it automates the tedious aspects of audio editing. It is perfectly suited for podcasters requiring consistent voice branding across episodes, YouTube creators needing to generate high-quality voiceovers for shorts or long-form videos quickly, and corporate trainers building out course libraries. The smart-caching mechanism allows for efficient iteration, meaning if a single sentence in a lengthy script needs a change, only that specific segment is re-rendered.
Example Prompts
- "Build a voiceover for my-script.md using the 'oliver' voice, save it to the output folder, and include YouTube chapters."
- "Update my project with the new audio, mux it into my-recording.mp4 using the 'pad' sync setting."
- "Run a test build using the shortform template for my YouTube hook script using mock audio."
Tips & Limitations
Always check your VOICE_AI_API_KEY for sufficient credits before running large batch jobs. While the skill handles most audio tasks, complex video editing (like visual cuts or transitions) must be done in external software; this tool focuses strictly on audio-to-video integration. Use the --mock flag during initial configuration to verify your pipeline flow without consuming API credits.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-gizmogremlin-voiceai-voiceover-creator": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, external-api, code-execution
Related Skills
voice-ai-tts
High-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API.
voice.ai-voice-agents
Create, manage, and deploy Voice.ai conversational AI agents. Use when the user wants to work with voice agents, list their agents, create new ones, or manage agent configurations.
voice-ai-tts
High-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API.
dub-youtube-with-voiceai
Dub YouTube videos with Voice.ai TTS. Turn scripts into publish-ready voiceovers with chapters, captions, and audio replacement for YouTube long-form and Shorts.
voice-ai-agents
Create, manage, and deploy Voice.ai conversational AI agents. Use when the user wants to work with voice agents, list their agents, create new ones, or manage agent configurations.