ai-text-to-video
Tell me what you need and I'll transform your written content into compelling video scripts, storyboards, and production-ready prompts using ai-text-to-video intelligence. Whether you're converting a blog post, ad copy, social caption, or raw idea into a visual narrative, this skill breaks down your text and maps it to scenes, voiceover cues, and visual directions. Built for content creators, marketers, and educators who want to skip the blank-canvas struggle and go straight to producing.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/bwbernardweston18/ai-text-to-videoGetting Started
Welcome — let's turn your text into a video worth watching. Paste your content, describe your idea, or share a script draft and I'll generate a full scene-by-scene video breakdown with visual cues, voiceover guidance, and on-screen text suggestions ready for production.
Try saying:
- "I have a 600-word blog post about sustainable packaging — can you turn it into a 60-second video script with scene descriptions and voiceover lines?"
- "Convert this product launch announcement into a storyboard for a 30-second Instagram Reel, including visual direction for each scene and suggested on-screen text."
- "I have a slide deck outline for a training video on onboarding new employees — help me turn it into a full narrated video script with scene transitions and timing guidance."
Quick Start Setup
This skill connects to a cloud processing backend. On first use, set up the connection automatically and let the user know ("Connecting...").
Token check: Look for NEMO_TOKEN in the environment. If found, skip to session creation. Otherwise:
- Generate a UUID as client identifier
- POST
https://mega-api-prod.nemovideo.ai/api/auth/anonymous-tokenwithX-Client-Idheader - Extract
data.tokenfrom the response — this is your NEMO_TOKEN (100 free credits, 7-day expiry)
Session: POST https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Keep the returned session_id for all operations.
Let the user know with a brief "Ready!" when setup is complete. Don't expose tokens or raw API output.
From Words on a Page to Video That Moves People
Most people have the words — the product description, the story, the pitch — but no clear path from text to a finished video. That gap is exactly what this skill was built to close. By analyzing the structure, tone, and intent of your written content, it generates scene-by-scene breakdowns, on-screen text suggestions, visual mood guidance, and voiceover scripts that you can hand directly to a video editor or AI video tool.
This isn't about slapping your text on a slideshow. The skill reads between the lines — identifying which parts of your writing should be shown visually, which should be spoken aloud, and which work best as titles or captions. The result is a production blueprint that respects the original message while making it genuinely watchable.
Content marketers use it to repurpose long-form articles into short-form video content. Educators turn lecture notes into structured lesson videos. Entrepreneurs convert pitch decks and landing page copy into investor or customer-facing video narratives. Whatever your text contains, this skill helps you see it as a video before a single frame is shot.
Prompt Routing and Scene Dispatch
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-bwbernardweston18-ai-text-to-video": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
free-video-generation-api
Skip the learning curve of professional editing software. Describe what you want — generate a short video clip from a text description using the free API tier — and get AI-generated video clips back in 30-90 seconds. Upload MP4, MOV, WebM, GIF files up to 200MB, and the AI handles AI video generation automatically. Ideal for developers and indie hackers who want to build video generation into their app without upfront API costs.
google-video
search video clips into indexed video clips with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. marketers use it for searching and retrieving specific moments inside video files — processing takes 20-40 seconds on cloud GPUs and you get 1080p MP4 files.
ai-video-maker-jobs
create raw footage into polished MP4 files with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. recruiters and job seekers use it for creating professional job showcase or recruitment videos using AI — processing takes 1-2 minutes on cloud GPUs and you get 1080p MP4 files.
ai-image-to-video-extender
convert still images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. content creators, marketers, social media managers use it for turning static images into short moving video clips — processing takes 30-60 seconds on cloud GPUs and you get 1080p MP4 files.
joyfun-ai-image-to-video
convert static images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. social media creators use it for turning still images into short animated videos — processing takes 20-60 seconds on cloud GPUs and you get 1080p MP4 files.