vap-media
AI image, video, and music generation + editing via VAP API. Flux, Veo 3.1, Suno V5.
Why use this skill?
Automate high-quality image, video, and music generation with VAP Media for OpenClaw. Access Flux, Veo 3.1, and Suno V5 for unlimited content creation.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/elestirelbilinc-sketch/vap-multimedia-generationWhat This Skill Does
VAP Media acts as a high-powered AI orchestration layer within OpenClaw, providing direct, unified access to industry-leading generative models through the VAP API. By abstracting complex API interactions, this skill allows users to generate high-quality visual and auditory content on-demand. Whether you require professional-grade imagery via Black Forest Labs' Flux.2 Pro, cinematic video sequences powered by Google's Veo 3.1, or custom musical compositions via Suno V5, VAP Media handles the request routing, task polling, and status management automatically. The skill supports two operational modes: a 'Free Mode' for quick, rate-limited prototyping, and 'Full Mode' for enterprise-grade, unlimited creative workflows including advanced editing tasks like inpainting, upscaling, and background removal.
Installation
To integrate this skill into your OpenClaw environment, execute the following command in your terminal:
clawhub install openclaw/skills/skills/elestirelbilinc-sketch/vap-multimedia-generation
Once installed, if you have a paid subscription, ensure your environment variable is set: export VAP_API_KEY='your_api_key_here'. Without this key, the agent will default to the trial tier with a limit of 3 generations per day.
Use Cases
- Content Creation: Rapidly generate social media assets, marketing imagery, and stock footage without leaving your chat interface.
- Rapid Prototyping: Visualize UI/UX mockups, storyboard video concepts, or generate mood-setting soundtracks for creative projects.
- Asset Refinement: Utilize built-in editing tools to upscale low-resolution images, remove distracting backgrounds from product photography, or trim existing video clips for precise narrative flow.
Example Prompts
- "Generate a cinematic 16:9 landscape photo of a neon-lit cyberpunk city street in the rain, high quality."
- "Create an 8-second video of a golden retriever running through a meadow, resolution 1080p, and include ambient audio."
- "Compose a lo-fi hip hop track with a melancholic piano melody suitable for a rainy evening."
Tips & Limitations
- Aspect Ratio Detection: The skill is smart enough to interpret spatial cues in your text. For instance, mentioning 'wide landscape' will automatically toggle the 16:9 aspect ratio, reducing the need for manual parameter tuning.
- Tier Requirements: Note that video and music generation features are reserved for Tier 2+ access. If you are operating under the free trial, attempting to invoke these endpoints will return an error.
- Polling Efficiency: Since all operations are asynchronous, ensure your workflow accounts for the polling mechanism. Large video generation tasks may take longer to reach the 'completed' state; always check the
statuskey in the response object before attempting to access theoutput_url.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-elestirelbilinc-sketch-vap-multimedia-generation": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api