ressemble
Text-to-Speech and Speech-to-Text integration using Resemble AI HTTP API.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/adriano-vr/ressembleWhat This Skill Does
The Resemble skill serves as a high-fidelity bridge between OpenClaw and the Resemble AI ecosystem. It provides a robust interface for both Text-to-Speech (TTS) and Speech-to-Text (STT) tasks. By leveraging Resemble's production-grade APIs, the skill allows your AI agent to produce natural-sounding voiceovers from text inputs using specific voice clones, as well as process incoming audio files to extract accurate transcriptions. This bidirectional capability is essential for building agents that function as automated customer support representatives, voice assistants, or content accessibility tools.
Installation
To integrate this skill into your environment, ensure you have the OpenClaw CLI installed and access to your Resemble AI developer portal.
- Install the package using the terminal:
clawhub install openclaw/skills/skills/adriano-vr/ressemble - Set your environment variable:
export RESEMBLE_API_KEY="your_api_key_here". Ensure this key has appropriate scopes to access both the voice synthesis and transcription endpoints. - Verify the installation by running
claw skills listto ensure the module is active.
Use Cases
- Voiceover Automation: Automatically generate audio files for video tutorials or documentation summaries.
- Accessibility Enhancement: Transcribe voice-based inputs from users, converting audio messages into actionable text for the AI logic.
- Interactive Voice Response: Build dynamic, text-driven voice assistants that respond to user queries with human-like prosody and intonation.
- Content Summarization: Process long-form audio meeting recordings and transform them into readable text minutes.
Example Prompts
- "Resemble, convert the following text 'Welcome to our platform, how can I help you today?' into an audio file using my custom voice profile."
- "Resemble, transcribe the audio file located at /data/inbound_voice_message.wav and save the text output to summary.txt."
- "Create an audio response to the user's latest query using the voice assigned to the support assistant."
Tips & Limitations
When using the TTS feature, ensure your voice_uuid is correctly mapped in your environment configuration to avoid synthesis errors. Note that the STT function utilizes asynchronous polling; for very long audio files, there may be a slight delay as the skill waits for the server-side transcription process to complete. Always monitor your API usage credits on the Resemble dashboard, as high-volume transcription can consume tokens rapidly. Keep audio files under the recommended file size limit for optimal performance.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-adriano-vr-ressemble": {
"enabled": true,
"auto_update": true
}
}
}Tags
Flags: network-access, file-read, file-write, external-api
Related Skills
designer-intelligence-station
Designer intelligence collection tool. Monitors 46 public sources (AI/hardware/mobile/design), dynamic quality-based filtering v2.1.8, generates structured daily/weekly reports. All data stored locally.
narrator-ai-cli
Create AI-narrated film/drama commentary videos via CLI. Two workflow paths (Original & Adapted narration), 100+ movies, 146 BGM tracks, 63 dubbing voices in 11 languages, 90+ narration templates. Use when creating narration videos, film commentary, short drama dubbing, or video production.
ai-workflow-red-team-lite
对 AI 自动化流程做轻量红队演练,聚焦误用路径、边界失败和数据泄露风险。;use for red-team, ai, workflow workflows;do not use for 输出可直接滥用的攻击脚本, 帮助破坏系统.
podcast-agent
Search articles on any topic, generate a two-host dialogue script, and synthesize podcast audio via TTS. Turn long reads into listenable content.
soul-weaver
AI Soul Weaver - 12 Curated Celebrity Templates + Custom Generation. Generate OpenClaw agent configurations through dialog or select from 12 world-famous tech & business leaders.