ai-music-video-generator
Get synced music videos ready to post, without touching a single slider. Upload your audio files (MP3, WAV, AAC, FLAC, up to 200MB), say something like "create a music video for my song with visuals that match the beat and mood", and download 1080p MP4 when it's done. Built for musicians and content creators who move fast and want professional music videos without hiring a videographer.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/bwbernardweston18/ai-music-video-generatorGetting Started
Ready when you are. Drop your audio files here or describe what you want to make.
Try saying:
- "generate a 3-minute MP3 song file into a 1080p MP4"
- "create a music video for my song with visuals that match the beat and mood"
- "generating visual music videos from audio tracks for musicians and content creators"
Quick Start Setup
This skill connects to a cloud processing backend. On first use, set up the connection automatically and let the user know ("Connecting...").
Token check: Look for NEMO_TOKEN in the environment. If found, skip to session creation. Otherwise:
- Generate a UUID as client identifier
- POST
https://mega-api-prod.nemovideo.ai/api/auth/anonymous-tokenwithX-Client-Idheader - Extract
data.tokenfrom the response — this is your NEMO_TOKEN (100 free credits, 7-day expiry)
Session: POST https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Keep the returned session_id for all operations.
Let the user know with a brief "Ready!" when setup is complete. Don't expose tokens or raw API output.
AI Music Video Generator — Turn Songs Into Synced Videos
This tool takes your audio files and runs AI music video creation through a cloud rendering pipeline. You upload, describe what you want, and download the result.
Say you have a 3-minute MP3 song file and want to create a music video for my song with visuals that match the beat and mood — the backend processes it in about 1-3 minutes and hands you a 1080p MP4.
Tip: shorter tracks under 2 minutes process significantly faster.
Matching Input to Actions
User prompts referencing ai music video generator, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.
| User says... | Action | Skip SSE? |
|---|---|---|
| "export" / "导出" / "download" / "send me the video" | → §3.5 Export | ✅ |
| "credits" / "积分" / "balance" / "余额" | → §3.3 Credits | ✅ |
| "status" / "状态" / "show tracks" | → §3.4 State | ✅ |
| "upload" / "上传" / user sends file | → §3.2 Upload | ✅ |
| Everything else (generate, edit, add BGM…) | → §3.1 SSE | ❌ |
Cloud Render Pipeline Details
Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.
Headers are derived from this file's YAML frontmatter. X-Skill-Source is ai-music-video-generator, X-Skill-Version comes from the version field, and X-Skill-Platform is detected from the install path (~/.clawhub/ = clawhub, ~/.cursor/skills/ = cursor, otherwise unknown).
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-bwbernardweston18-ai-music-video-generator": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
free-video-generation-api
Skip the learning curve of professional editing software. Describe what you want — generate a short video clip from a text description using the free API tier — and get AI-generated video clips back in 30-90 seconds. Upload MP4, MOV, WebM, GIF files up to 200MB, and the AI handles AI video generation automatically. Ideal for developers and indie hackers who want to build video generation into their app without upfront API costs.
google-video
search video clips into indexed video clips with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. marketers use it for searching and retrieving specific moments inside video files — processing takes 20-40 seconds on cloud GPUs and you get 1080p MP4 files.
ai-video-maker-jobs
create raw footage into polished MP4 files with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. recruiters and job seekers use it for creating professional job showcase or recruitment videos using AI — processing takes 1-2 minutes on cloud GPUs and you get 1080p MP4 files.
ai-image-to-video-extender
convert still images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. content creators, marketers, social media managers use it for turning static images into short moving video clips — processing takes 30-60 seconds on cloud GPUs and you get 1080p MP4 files.
joyfun-ai-image-to-video
convert static images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. social media creators use it for turning still images into short animated videos — processing takes 20-60 seconds on cloud GPUs and you get 1080p MP4 files.