clonev
Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) Clone their voice or someone else's voice, (2) Generate speech that sounds like a specific person, (3) Create personalized voice messages, (4) Multi-lingual voice cloning (speak any language with cloned voice).
Why use this skill?
Clone any voice and generate speech in 14+ languages with CloneV. A simple, local, and powerful tool for personalized audio generation and voice synthesis.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/instant-picture/clonevWhat This Skill Does
The CloneV skill is a powerful, straightforward tool designed to perform voice cloning and text-to-speech synthesis using the Coqui XTTS v2 engine. Unlike complex configurations, CloneV simplifies the process into a single, automated script. By providing a short WAV file (ideally 6-30 seconds long), users can capture the tonal characteristics, cadence, and unique sound of any speaker. Once the voice profile is analyzed, the skill generates natural-sounding speech in over 14 supported languages, including English, Czech, German, French, Spanish, and more. This makes it an ideal solution for creating personalized voice messages, generating synthetic narration for creative projects, or simply testing the capabilities of neural voice synthesis without managing complex server dependencies.
Installation
To install this skill, use the ClawHub command-line interface. Run the following command in your terminal:
clawhub install openclaw/skills/skills/instant-picture/clonev
Ensure that you have sufficient disk space for temporary audio file generation, as the system will output the resulting audio as an OGG file.
Use Cases
- Personalized Messaging: Create voice messages that sound like you for automated notifications, even when you aren't available to speak.
- Content Creation: Generate voiceovers for videos, presentations, or digital characters using a specific, consistent voice.
- Language Learning & Practice: Listen to text spoken in a cloned voice across multiple languages, making it a great tool for accessibility or language experimentation.
- Accessibility: Provide a voice for users who cannot speak, allowing them to communicate naturally using a synthesized version of their own captured audio.
Example Prompts
- "Clone the voice from my sample at /home/user/voice.wav and say 'Your meeting is starting in five minutes' in English."
- "Use the sample recording I sent earlier to generate a Czech greeting that says 'Dobrý den, jak se máte?' and save the output."
- "Create a voice message using my uploaded voice sample that says 'I am currently away from my desk' and send it as a telegram audio message."
Tips & Limitations
- Voice Quality: Always use a clean, high-quality WAV file. Background noise in your sample will be carried over into the generated output, potentially degrading quality.
- Length Requirements: Keep your samples between 6 and 30 seconds. Samples that are too short may lack the necessary data to accurately capture voice patterns, while overly long samples may process slower.
- Do Not Use External APIs: The skill is optimized to run locally via
scripts/clonev.sh. Do not attempt to reconfigure the internal Docker containers, as they are managed by the script wrapper. - Language Support: While 14+ languages are supported, clarity may vary based on the linguistic structure of the language and the complexity of the source audio sample.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-instant-picture-clonev": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, code-execution
Related Skills
ddg
Use ddgr (DuckDuckGo from the terminal) to perform privacy-focused web searches from the command line. Use when the user wants to (1) Search the web from the terminal, (2) Perform DuckDuckGo searches without opening a browser, (3) Get quick search results in text format, (4) Search privately without tracking, (5) Use DuckDuckGo bangs (!) from terminal.
qmd
Local search/indexing CLI (BM25 + vectors + rerank) with MCP mode.