wavespeed-veo-31-fast
Generate and extend videos using Google's Veo 3.1 Fast model via WaveSpeed AI. Supports text-to-video, image-to-video, and video extension. Features up to 4K resolution, audio generation, and chained extensions up to 148 seconds. Use when the user wants to create videos from text or images, or extend existing Veo-generated videos.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/chengzeyi/wavespeed-veo-31-fastWhat This Skill Does
The wavespeed-veo-31-fast skill provides an interface to Google's powerful Veo 3.1 Fast model through the WaveSpeed AI API. It serves as an advanced creative tool for OpenClaw agents to generate high-fidelity, cinematic-quality videos directly from text, static images, or existing video files. Beyond simple generation, it allows for chaining video extensions to create long-form content up to 148 seconds, supports custom resolution scaling up to 4K, and can automatically generate synchronized audio for your visual assets.
Installation
To integrate this skill into your agent, use the OpenClaw CLI tool. Run the following command in your terminal:
clawhub install openclaw/skills/skills/chengzeyi/wavespeed-veo-31-fast
After installation, ensure you have set your API key in your environment variables using export WAVESPEED_API_KEY="your-api-key" to enable authorized communication with the WaveSpeed servers.
Use Cases
This skill is highly versatile for creative professionals and automation tasks. Use it for:
- Marketing & Ad Creation: Generate short, high-impact promotional videos from brand images or text slogans.
- Content Production: Create B-roll footage for YouTube videos or social media storytelling.
- Storyboarding: Rapidly iterate on cinematic sequences by extending initial video clips.
- AI Artistry: Experiment with video-to-video style transfers or character animation workflows.
Example Prompts
- "Create a 1080p, 8-second video of a cyberpunk city street in the rain with neon lights reflecting on the puddles."
- "Take this image [URL] and create a video where the subjects in the photo start walking towards the camera."
- "Extend my previous cat video. Make it look like the cat jumps over a fence and lands in a field of sunflowers."
Tips & Limitations
To maximize the quality of your output, always provide descriptive prompts that specify lighting, camera movement (e.g., 'drone shot', 'dolly zoom'), and stylistic tones. For complex scenes, use the negative_prompt parameter to avoid unwanted artifacts. Be mindful that video extension is limited to 7-second increments and a total project limit of 148 seconds. Always verify your network connectivity, as large video assets require stable outbound requests to the WaveSpeed API.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-chengzeyi-wavespeed-veo-31-fast": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api
Related Skills
wavespeed-watermark-remover
Remove watermarks, logos, captions, and text overlays from images and videos using WaveSpeed AI. Intelligently detects and removes watermarks while preserving texture and background. Supports images and videos up to 10 minutes. Use when the user wants to remove watermarks or text overlays from media.
wavespeed-face-swapper
Swap faces in images and videos using WaveSpeed AI. Supports image face swap and video face swap with multi-face targeting. Produces watermark-free results with automatic lighting and skin tone adaptation. Use when the user wants to replace a face in an image or video with another face.
wavespeed-infinitetalk
Generate talking head videos from a portrait image and audio using WaveSpeed AI's InfiniteTalk model. Produces lip-synced video up to 10 minutes long at 480p or 720p. Supports optional mask images to target specific faces and text prompts for additional guidance. Use when the user wants to animate a face with audio or create talking avatar videos.
wavespeed-minimax-speech-26
Convert text to speech using MiniMax Speech 2.6 Turbo via WaveSpeed AI. Features ultra-human voice cloning, sub-250ms latency, 40+ languages, emotion control, and 200+ voice presets. Use when the user wants to generate speech audio from text.
wavespeed-nano-banana-2
Generate and edit images using Google's Nano Banana 2 model via WaveSpeed AI. Supports text-to-image generation and image editing with natural language prompts. Features native 4K resolution, flexible aspect ratios including ultra-narrow (1:8, 8:1), multilingual text rendering, and camera-style controls. Use when the user wants to create images from text or edit existing images.