text-to-video-local-ai
Turn a two-sentence scene description into 1080p AI-generated videos just by typing what you need. Whether it's generating videos from written descriptions without cloud upload or quick social content, drop your text prompts and describe the result you want. No timeline dragging, no export settings — 1-3 minutes from upload to download.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/bwbernardweston18/text-to-video-local-aiGetting Started
Send me your text prompts and I'll handle the local AI video generation. Or just describe what you're after.
Try saying:
- "generate a two-sentence scene description into a 1080p MP4"
- "generate a 10-second video clip of a sunset over mountains from this text description"
- "generating videos from written descriptions without cloud upload for privacy-conscious creators"
Automatic Setup
On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".
Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.
Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: <uuid>. The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).
Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.
Confirm to the user you're connected and ready. Don't print tokens or raw JSON.
Text to Video Local AI — Generate Videos From Text Locally
This tool takes your text prompts and runs local AI video generation through a cloud rendering pipeline. You upload, describe what you want, and download the result.
Say you have a two-sentence scene description and want to generate a 10-second video clip of a sunset over mountains from this text description — the backend processes it in about 1-3 minutes and hands you a 1080p MP4.
Tip: shorter, specific prompts produce more accurate video results than vague long descriptions.
Matching Input to Actions
User prompts referencing text to video local ai, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.
| User says... | Action | Skip SSE? |
|---|---|---|
| "export" / "导出" / "download" / "send me the video" | → §3.5 Export | ✅ |
| "credits" / "积分" / "balance" / "余额" | → §3.3 Credits | ✅ |
| "status" / "状态" / "show tracks" | → §3.4 State | ✅ |
| "upload" / "上传" / user sends file | → §3.2 Upload | ✅ |
| Everything else (generate, edit, add BGM…) | → §3.1 SSE | ❌ |
Cloud Render Pipeline Details
Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.
Base URL: https://mega-api-prod.nemovideo.ai
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-bwbernardweston18-text-to-video-local-ai": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
free-video-generation-api
Skip the learning curve of professional editing software. Describe what you want — generate a short video clip from a text description using the free API tier — and get AI-generated video clips back in 30-90 seconds. Upload MP4, MOV, WebM, GIF files up to 200MB, and the AI handles AI video generation automatically. Ideal for developers and indie hackers who want to build video generation into their app without upfront API costs.
google-video
search video clips into indexed video clips with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. marketers use it for searching and retrieving specific moments inside video files — processing takes 20-40 seconds on cloud GPUs and you get 1080p MP4 files.
ai-video-maker-jobs
create raw footage into polished MP4 files with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. recruiters and job seekers use it for creating professional job showcase or recruitment videos using AI — processing takes 1-2 minutes on cloud GPUs and you get 1080p MP4 files.
ai-image-to-video-extender
convert still images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. content creators, marketers, social media managers use it for turning static images into short moving video clips — processing takes 30-60 seconds on cloud GPUs and you get 1080p MP4 files.
joyfun-ai-image-to-video
convert static images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. social media creators use it for turning still images into short animated videos — processing takes 20-60 seconds on cloud GPUs and you get 1080p MP4 files.