video-message
Generate and send video messages with a lip-syncing VRM avatar. Use when user asks for video message, avatar video, video reply, or when TTS should be delivered as video instead of audio.
Why use this skill?
Learn how to use the OpenClaw video-message skill to create and send lip-synced VRM avatar videos as Telegram video notes for a more personal AI experience.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/thewulf7/avatar-video-messagesWhat This Skill Does
The video-message skill enables your OpenClaw agent to transform text or audio into dynamic, lip-synced video content using VRM (Virtual Reality Modeling language) avatars. By leveraging the avatarcam engine, the agent can render a personalized character that speaks with natural expression. Once generated, the agent can transmit these files as Telegram video notes, providing a circular, high-engagement video format that feels more human and interactive than simple text or static audio responses. This skill manages the entire lifecycle of the process, from text-to-speech synthesis and headless video rendering to automated delivery.
Installation
To install this skill, use the ClawKit CLI: clawhub install openclaw/skills/skills/thewulf7/avatar-video-messages. You must also ensure that the required system dependencies—specifically ffmpeg—are installed on your host machine. For Linux servers, ensure xvfb and xauth are configured to allow headless rendering. Configure your default avatar and background settings in the TOOLS.md file to customize the look of your agent's persona.
Use Cases
This skill is perfect for scenarios requiring a personal touch. Use it for personalized greeting messages for new users, automated status updates that require a human-like presence, or educational tutorials where facial expressions help clarify complex topics. It is also an excellent tool for digital receptionists or brand mascots that need to deliver short, conversational updates across messaging platforms.
Example Prompts
- "Send a video message to the client explaining that we have received their files using your avatar."
- "Record a short video reply for the group chat saying that I will be late for the meeting today."
- "Use the avatar to explain the current status of the server in a video message."
Tips & Limitations
- Duration: Keep scripts concise, as video notes are optimized for short-form communication (maximum 60 seconds).
- Performance: Rendering is resource-intensive. Ensure your machine has adequate RAM and CPU power, or use a Docker container with the recommended apt packages to maintain system stability.
- Customization: Experiment with the
backgroundsetting to match your brand colors or use specific environment images for context. Always ensure your VRM file is valid and correctly placed in your project directory.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-thewulf7-avatar-video-messages": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, code-execution