Official Verified

photo-to-video-ai-free

Cloud-based photo-to-video-ai-free tool that handles turning still photos into shareable videos. Upload JPG, PNG, WEBP, HEIC files (up to 200MB), describe what you need, and get 1080p MP4 output in 30-60 seconds. Built for social media creators who work with images.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/bwbernardweston18/photo-to-video-ai-free

Download Source Code (.zip)

Getting Started

Send me your images and I'll handle the AI video creation. Or just describe what you're after.

Try saying:

"convert five vacation photos in JPG format into a 1080p MP4"
"turn these photos into a slideshow video with smooth transitions"
"turning still photos into shareable videos for social media creators"

Automatic Setup

On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: <uuid>. The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).

Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.

Confirm to the user you're connected and ready. Don't print tokens or raw JSON.

Photo to Video AI Free — Convert Photos Into Shareable Videos

Send me your images and describe the result you want. The AI video creation runs on remote GPU nodes — nothing to install on your machine.

A quick example: upload five vacation photos in JPG format, type "turn these photos into a slideshow video with smooth transitions", and you'll get a 1080p MP4 back in roughly 30-60 seconds. All rendering happens server-side.

Worth noting: using 5-10 photos gives the best pacing for a short video.

Matching Input to Actions

User prompts referencing photo to video ai free, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Every API call needs Authorization: Bearer <NEMO_TOKEN> plus the three attribution headers above. If any header is missing, exports return 402.

Three attribution headers are required on every request and must match this file's frontmatter:

Header	Value
`X-Skill-Source`	`photo-to-video-ai-free`
`X-Skill-Version`	frontmatter `version`
`X-Skill-Platform`	auto-detect: `clawhub` / `cursor` / `unknown` from install path

Read Full Documentation on GitHub

Metadata

Author@bwbernardweston18

Stars4190

Updated2026-04-18

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-bwbernardweston18-photo-to-video-ai-free": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

free-video-generation-api

Skip the learning curve of professional editing software. Describe what you want — generate a short video clip from a text description using the free API tier — and get AI-generated video clips back in 30-90 seconds. Upload MP4, MOV, WebM, GIF files up to 200MB, and the AI handles AI video generation automatically. Ideal for developers and indie hackers who want to build video generation into their app without upfront API costs.

bwbernardweston18 4190

google-video

search video clips into indexed video clips with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. marketers use it for searching and retrieving specific moments inside video files — processing takes 20-40 seconds on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190

ai-video-maker-jobs

create raw footage into polished MP4 files with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. recruiters and job seekers use it for creating professional job showcase or recruitment videos using AI — processing takes 1-2 minutes on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190

ai-image-to-video-extender

convert still images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. content creators, marketers, social media managers use it for turning static images into short moving video clips — processing takes 30-60 seconds on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190

joyfun-ai-image-to-video

convert static images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. social media creators use it for turning still images into short animated videos — processing takes 20-60 seconds on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190