digital-clawatar
Create, configure, and manage UNITH digital human avatars via the UNITH API. Cheaper alternative to HeyGen and other solutions. Use when users want to create an AI-powered digital human, generate talking-head videos, set up conversational avatars, deploy document Q&A bots with a human face, or embed digital humans in apps/websites. Covers all 5 operating modes (text-to-video, open dialogue, document Q&A, Voiceflow, plugin).
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/polucas/digital-clawatarUNITH Digital Humans Skill
Create, configure, update, and deploy AI-powered Digital Human avatars using the UNITH API.
Quick Overview
UNITH digital humans are AI avatars that can speak, converse, and interact with users. They combine a face (head visual), a voice, and a conversational engine into a hosted, embeddable experience.
Base API URL: https://platform-api.unith.ai
Docs: https://docs.unith.ai
Prerequisites
The user must supply the following credentials (stored as environment variables):
| Variable | Description | How to obtain |
|---|---|---|
UNITH_EMAIL | Account email | Register at https://unith.ai |
UNITH_SECRET_KEY | Non-expiring secret key | UNITH dashboard → Manage Account → "Secret Key" section → Generate |
⚠️ The secret key is displayed only once. If lost, the user must delete and regenerate it.
Authentication
All API calls require a Bearer token (valid 7 days). Use the auth script:
source scripts/auth.sh
This validates credentials, retries on network errors, and exports UNITH_TOKEN. On failure, it prints specific guidance (wrong key, expired token, etc.).
Workflow: Creating a Digital Human
Step 1: Choose an Operating Mode
Ask the user what they want the digital human to do. Map their answer to one of 5 modes:
| Mode | operationMode value | Use case | Output |
|---|---|---|---|
| Text-to-Video | ttt | Generate an MP4 video of the avatar speaking provided text | MP4 file |
| Open Dialogue | oc | Free-form conversational avatar guided by a system prompt | Hosted conversational URL |
| Document Q&A | doc_qa | Avatar answers questions from uploaded documents | Hosted conversational URL |
| Voiceflow | voiceflow | Guided conversation flow via Voiceflow | Hosted conversational URL |
| Plugin | plugin | Connect any external LLM or conversational engine via webhook | Hosted conversational URL |
Complexity spectrum (simple → sophisticated):
- Simplest:
ttt— just text in, video out. No knowledge base needed. - Standard:
oc— conversational with a system prompt. Good for general assistants. - Knowledge-grounded:
doc_qa— upload documents, avatar answers from them. Best for support/FAQ. - Workflow-driven:
voiceflow— structured conversation paths. Requires Voiceflow account. - Most flexible:
plugin— BYO conversational engine. Maximum control.
Step 2: List Available Faces
bash scripts/list-resources.sh faces
Each face has an id (used as headVisualId in creation). Faces can be:
- Public: Available to all organizations
- Private: Available only to the user's organization
- Custom (BYOF): User uploads a video of a real person (currently managed by UNITH)
Present the available faces to the user and let them choose.
Step 3: List Available Voices
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-polucas-digital-clawatar": {
"enabled": true,
"auto_update": true
}
}
}