Official Verified

free-text-to-image-video

generate text prompts into image-based videos with this skill. Works with TXT, DOCX, PDF, copied text files up to 500MB. marketers, content creators, educators use it for generating videos from written descriptions without any source footage — processing takes 1-2 minutes on cloud GPUs and you get 1080p MP4 files.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/bwbernardweston18/free-text-to-image-video

Download Source Code (.zip)

Getting Started

Got text prompts to work with? Send it over and tell me what you need — I'll take care of the AI video generation.

Try saying:

"generate a short descriptive paragraph about a sunset over the ocean into a 1080p MP4"
"turn my text description into a video with matching images and smooth transitions"
"generating videos from written descriptions without any source footage for marketers, content creators, educators"

First-Time Connection

When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").

Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.

Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN — 100 free credits, valid 7 days.
Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer <token>, Content-Type: application/json, and body {"task_name":"project","language":"<detected>"}. Store the returned session_id for all subsequent requests.

Keep setup communication brief. Don't display raw API responses or token values to the user.

Free Text to Image Video — Generate Videos from Text Descriptions

Send me your text prompts and describe the result you want. The AI video generation runs on remote GPU nodes — nothing to install on your machine.

A quick example: upload a short descriptive paragraph about a sunset over the ocean, type "turn my text description into a video with matching images and smooth transitions", and you'll get a 1080p MP4 back in roughly 1-2 minutes. All rendering happens server-side.

Worth noting: shorter, specific prompts produce more accurate visuals than vague long descriptions.

Matching Input to Actions

User prompts referencing free text to image video, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Base URL: https://mega-api-prod.nemovideo.ai

Read Full Documentation on GitHub

Metadata

Author@bwbernardweston18

Stars4190

Updated2026-04-18

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-bwbernardweston18-free-text-to-image-video": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

free-video-generation-api

Skip the learning curve of professional editing software. Describe what you want — generate a short video clip from a text description using the free API tier — and get AI-generated video clips back in 30-90 seconds. Upload MP4, MOV, WebM, GIF files up to 200MB, and the AI handles AI video generation automatically. Ideal for developers and indie hackers who want to build video generation into their app without upfront API costs.

bwbernardweston18 4190

google-video

search video clips into indexed video clips with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. marketers use it for searching and retrieving specific moments inside video files — processing takes 20-40 seconds on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190

ai-video-maker-jobs

create raw footage into polished MP4 files with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. recruiters and job seekers use it for creating professional job showcase or recruitment videos using AI — processing takes 1-2 minutes on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190

ai-image-to-video-extender

convert still images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. content creators, marketers, social media managers use it for turning static images into short moving video clips — processing takes 30-60 seconds on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190

joyfun-ai-image-to-video

convert static images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. social media creators use it for turning still images into short animated videos — processing takes 20-60 seconds on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190