UNITH Digital Humans Skill

Create, configure, update, and deploy AI-powered Digital Human avatars using the UNITH API.

Quick Overview

UNITH digital humans are AI avatars that can speak, converse, and interact with users. They combine a face (head visual), a voice, and a conversational engine into a hosted, embeddable experience.

Base API URL: https://platform-api.unith.ai Docs: https://docs.unith.ai

Prerequisites

The user must supply the following credentials (stored as environment variables):

Variable	Description	How to obtain
`UNITH_EMAIL`	Account email	Register at https://unith.ai
`UNITH_SECRET_KEY`	Non-expiring secret key	UNITH dashboard → Manage Account → "Secret Key" section → Generate

⚠️ The secret key is displayed only once. If lost, the user must delete and regenerate it.

Authentication

All API calls require a Bearer token (valid 7 days). Use the auth script:

source scripts/auth.sh

This validates credentials, retries on network errors, and exports UNITH_TOKEN. On failure, it prints specific guidance (wrong key, expired token, etc.).

Workflow: Creating a Digital Human

Step 1: Choose an Operating Mode

Ask the user what they want the digital human to do. Map their answer to one of 5 modes:

Mode	`operationMode` value	Use case	Output
Text-to-Video	`ttt`	Generate an MP4 video of the avatar speaking provided text	MP4 file
Open Dialogue	`oc`	Free-form conversational avatar guided by a system prompt	Hosted conversational URL
Document Q&A	`doc_qa`	Avatar answers questions from uploaded documents	Hosted conversational URL
Voiceflow	`voiceflow`	Guided conversation flow via Voiceflow	Hosted conversational URL
Plugin	`plugin`	Connect any external LLM or conversational engine via webhook	Hosted conversational URL

Complexity spectrum (simple → sophisticated):

Simplest: ttt — just text in, video out. No knowledge base needed.
Standard: oc — conversational with a system prompt. Good for general assistants.
Knowledge-grounded: doc_qa — upload documents, avatar answers from them. Best for support/FAQ.
Workflow-driven: voiceflow — structured conversation paths. Requires Voiceflow account.
Most flexible: plugin — BYO conversational engine. Maximum control.

Step 2: List Available Faces

bash scripts/list-resources.sh faces

Each face has an id (used as headVisualId in creation). Faces can be:

Public: Available to all organizations
Private: Available only to the user's organization
Custom (BYOF): User uploads a video of a real person (currently managed by UNITH)

Present the available faces to the user and let them choose.

digital-clawatar

Install via CLI (Recommended)

UNITH Digital Humans Skill

Quick Overview

Prerequisites

Authentication

Workflow: Creating a Digital Human

Step 1: Choose an Operating Mode

Step 2: List Available Faces

Step 3: List Available Voices

Metadata

Related Skills

clawshell