omnihuman-video
使用 OmniHuman v1.5 生成音频驱动的口型同步视频。当用户想要让图片中的人物说话、配音、口型同步,或提到 omnihuman 时使用此 skill。
Why use this skill?
Generate professional lip-synced talking videos from static portraits using OmniHuman v1.5 with OpenClaw. High-quality AI video animation.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/hexiaochun/omnihuman-videoWhat This Skill Does
The omnihuman-video skill is a powerful integration that enables OpenClaw to leverage Bytedance's OmniHuman v1.5 model. It is designed to transform static portraits into high-quality, lifelike talking videos driven by audio. By processing a combination of an image and an audio file, this skill performs advanced facial animation, lip-syncing, and expression rendering. Whether you need to generate professional presentation videos, social media content, or personalized avatars, this skill manages the entire pipeline—from task submission and status tracking to the final rendering of video content.
Installation
To install this skill, use the following command in your terminal or OpenClaw management console:
clawhub install openclaw/skills/skills/hexiaochun/omnihuman-video
Ensure that you have the necessary API credentials configured in your environment, as this skill interacts with the fal.ai infrastructure for processing.
Use Cases
- Virtual Presentations: Convert a professional headshot into a video presentation by uploading an audio script.
- Content Creation: Bring characters to life for social media or marketing campaigns without expensive video production equipment.
- Educational Content: Create engaging, consistent tutor avatars for remote learning materials.
- Personalized Messaging: Send personalized birthday or greeting messages where the subject appears to speak the custom audio.
Example Prompts
- "Use this image [link] and this audio file [link] to generate a professional 1080p talking head video for my team presentation."
- "I need a video of the person in this image saying 'Welcome to our platform' using OmniHuman; please use 720p for faster results."
- "Create an AI-driven video using the provided portrait and the TTS audio clip I just generated. Make sure the lips are perfectly synced."
Tips & Limitations
- Image Quality: Always use clear, high-resolution front-facing or semi-profile portraits. Avoid blurry images or pictures where the face is obstructed.
- Audio Clarity: For the best results, use studio-quality voice recordings. Heavy background music or excessive environmental noise may degrade the lip-sync quality.
- Resolution vs. Time: Remember that 1080p is limited to 30 seconds of audio, while 720p supports up to 60 seconds. Choose the resolution based on your video length requirements.
- Performance: The 'turbo_mode' can be toggled for faster generation, but if visual fidelity is your priority, keep it set to false to ensure maximum model quality.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-hexiaochun-omnihuman-video": {
"enabled": true,
"auto_update": true
}
}
}Tags
Flags: external-api
Related Skills
水浒传故事小人书
水浒传故事小人书创建。使用 Nano Banana Pro 模型生成手绘卡通风格的水浒传故事信息图。当用户想要创建水浒传故事插画、小人书、信息图时使用此 skill。
style-extractor
从参考剧本或参考素材中提取统一风格锚点(STYLE_BASE),确保全剧视觉一致性。当需要匹配参考风格、提取画风、建立风格基准、生成风格资产包时使用。
视频链接解析
解析视频分享链接,获取无水印视频下载地址。当用户想要下载视频、解析抖音/快手/小红书/B站链接、获取无水印视频时使用此 skill。
vidu-video
使用 Vidu Q3 Pro 模型生成视频。当用户想要文生视频、生成带音频的视频,或提到 vidu 时使用此 skill。
character-creator
创建AI角色的完整流程,包括生成详细角色描述、文生图肖像和多角度参考图。使用即梦4.5模型。当用户要求创建角色、生成人物立绘、制作角色参考图、或需要多角度人物图时使用此技能。