vidu-video
使用 Vidu Q3 Pro 模型生成视频。当用户想要文生视频、生成带音频的视频,或提到 vidu 时使用此 skill。
Why use this skill?
Use the Vidu-video skill to generate high-quality, audio-enabled AI videos from text. Support for multiple resolutions, custom durations, and aspect ratios.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/hexiaochun/vidu-videoWhat This Skill Does
The vidu-video skill provides a direct interface to the state-of-the-art Vidu Q3 Pro text-to-video model. This powerful agent capability allows users to generate high-quality video content from natural language descriptions. The skill supports flexible resolution settings ranging from 360p to 1080p and allows for precise control over video duration (1 to 16 seconds) and aspect ratios. A key feature of the Vidu Q3 model is its native ability to generate synchronized audio for the produced video, making it an all-in-one solution for multimedia creation.
Installation
To integrate this skill into your OpenClaw environment, execute the following command in your terminal:
clawhub install openclaw/skills/skills/hexiaochun/vidu-video
Ensure your API credentials for the underlying Fal AI infrastructure are correctly configured in your OpenClaw environment settings to enable communication with the model.
Use Cases
- Creative Content Production: Quickly draft visual concepts for short-form social media videos, advertisements, or storyboards.
- Multimedia Prototyping: Generate video sequences to test lighting, cinematography styles, or narrative flows before high-budget filming.
- Automated Media Generation: Integrate this skill into workflows that require dynamic visual storytelling based on data-driven inputs or automated narrative triggers.
- Concept Art Animation: Transform static prompts into dynamic video, perfect for conceptualizing scenes in game development or film pre-production.
Example Prompts
- "Vidu, generate a 10-second cinematic video of a futuristic cyberpunk city at night with neon lights reflecting on wet pavement."
- "Create a 5-second video of a cute cat wearing a space helmet floating in zero gravity, resolution 720p, 16:9 ratio."
- "Use the Vidu model to generate a video of a calm forest stream, make sure to include ambient audio, set the duration to 8 seconds."
Tips & Limitations
To achieve the best results, use descriptive, English-language prompts. Incorporate professional terminology like 'cinematic lighting', 'ultra-realistic', or specific camera movements to guide the model's output quality. Note that 720p and 1080p resolutions incur higher costs (62 credits/second) compared to 360p/540p (28 credits/second). While the model is highly versatile, it is bound by a maximum prompt length of 2000 characters and a 16-second duration limit per generation. If a task fails or takes time, utilize the get_task command with your unique task_id to monitor the processing status.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-hexiaochun-vidu-video": {
"enabled": true,
"auto_update": true
}
}
}Tags
Flags: external-api
Related Skills
clawdvine
Short-form video for AI agents. Generate videos using the latest models, pay with USDC via x402.
voice-ai-tts
High-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API.
youtube-summarizer
Automatically fetch YouTube video transcripts, generate structured summaries, and send full transcripts to messaging platforms. Detects YouTube URLs and provides metadata, key insights, and downloadable transcripts.
ressemble
Text-to-Speech and Speech-to-Text integration using Resemble AI HTTP API.
voice-ai-tts
High-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API.