Official Verified media Safety 4/5

wavespeed-veo-31-fast

Generate and extend videos using Google's Veo 3.1 Fast model via WaveSpeed AI. Supports text-to-video, image-to-video, and video extension. Features up to 4K resolution, audio generation, and chained extensions up to 148 seconds. Use when the user wants to create videos from text or images, or extend existing Veo-generated videos.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/chengzeyi/wavespeed-veo-31-fast

Download Source Code (.zip)

What This Skill Does

The wavespeed-veo-31-fast skill provides an interface to Google's powerful Veo 3.1 Fast model through the WaveSpeed AI API. It serves as an advanced creative tool for OpenClaw agents to generate high-fidelity, cinematic-quality videos directly from text, static images, or existing video files. Beyond simple generation, it allows for chaining video extensions to create long-form content up to 148 seconds, supports custom resolution scaling up to 4K, and can automatically generate synchronized audio for your visual assets.

Installation

To integrate this skill into your agent, use the OpenClaw CLI tool. Run the following command in your terminal: clawhub install openclaw/skills/skills/chengzeyi/wavespeed-veo-31-fast After installation, ensure you have set your API key in your environment variables using export WAVESPEED_API_KEY="your-api-key" to enable authorized communication with the WaveSpeed servers.

Use Cases

This skill is highly versatile for creative professionals and automation tasks. Use it for:

Marketing & Ad Creation: Generate short, high-impact promotional videos from brand images or text slogans.
Content Production: Create B-roll footage for YouTube videos or social media storytelling.
Storyboarding: Rapidly iterate on cinematic sequences by extending initial video clips.
AI Artistry: Experiment with video-to-video style transfers or character animation workflows.

Example Prompts

"Create a 1080p, 8-second video of a cyberpunk city street in the rain with neon lights reflecting on the puddles."
"Take this image [URL] and create a video where the subjects in the photo start walking towards the camera."
"Extend my previous cat video. Make it look like the cat jumps over a fence and lands in a field of sunflowers."

Tips & Limitations

To maximize the quality of your output, always provide descriptive prompts that specify lighting, camera movement (e.g., 'drone shot', 'dolly zoom'), and stylistic tones. For complex scenes, use the negative_prompt parameter to avoid unwanted artifacts. Be mindful that video extension is limited to 7-second increments and a total project limit of 148 seconds. Always verify your network connectivity, as large video assets require stable outbound requests to the WaveSpeed API.

Read Full Documentation on GitHub

Metadata

Author@chengzeyi

Stars3840

Updated2026-04-06

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-chengzeyi-wavespeed-veo-31-fast": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#video-generation#ai-video#veo#media-production#multimedia

Safety Score: 4/5

Flags: external-api

Related Skills

wavespeed-watermark-remover

Remove watermarks, logos, captions, and text overlays from images and videos using WaveSpeed AI. Intelligently detects and removes watermarks while preserving texture and background. Supports images and videos up to 10 minutes. Use when the user wants to remove watermarks or text overlays from media.

chengzeyi 3840

wavespeed-face-swapper

Swap faces in images and videos using WaveSpeed AI. Supports image face swap and video face swap with multi-face targeting. Produces watermark-free results with automatic lighting and skin tone adaptation. Use when the user wants to replace a face in an image or video with another face.

chengzeyi 3840

wavespeed-infinitetalk

Generate talking head videos from a portrait image and audio using WaveSpeed AI's InfiniteTalk model. Produces lip-synced video up to 10 minutes long at 480p or 720p. Supports optional mask images to target specific faces and text prompts for additional guidance. Use when the user wants to animate a face with audio or create talking avatar videos.

chengzeyi 3840

wavespeed-minimax-speech-26

Convert text to speech using MiniMax Speech 2.6 Turbo via WaveSpeed AI. Features ultra-human voice cloning, sub-250ms latency, 40+ languages, emotion control, and 200+ voice presets. Use when the user wants to generate speech audio from text.

chengzeyi 3840

wavespeed-nano-banana-2

Generate and edit images using Google's Nano Banana 2 model via WaveSpeed AI. Supports text-to-image generation and image editing with natural language prompts. Features native 4K resolution, flexible aspect ratios including ultra-narrow (1:8, 8:1), multilingual text rendering, and camera-style controls. Use when the user wants to create images from text or edit existing images.

chengzeyi 3840