Official Verified ai models Safety 4/5

alicloud-ai-audio-cosyvoice-voice-clone

Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from reference audio and then reusing the returned voice_id in later TTS calls.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/cinience/alicloud-ai-audio-cosyvoice-voice-clone

Download Source Code (.zip)

What This Skill Does

The alicloud-ai-audio-cosyvoice-voice-clone skill provides an interface to the Alibaba Cloud Model Studio CosyVoice customization service. It enables users to perform voice enrollment, allowing the creation of high-fidelity cloned voices from provided reference audio samples. Once a reference audio sample is submitted, the service processes the voice signature and returns a unique voice_id. This voice_id is essential for subsequent text-to-speech (TTS) tasks within the OpenClaw ecosystem, ensuring that generated speech consistently maintains the cloned voice profile. The skill supports advanced models like cosyvoice-v3.5-plus and cosyvoice-v3.5-flash, which provide state-of-the-art voice synthesis capabilities.

Installation

To install this skill, use the OpenClaw command-line interface: clawhub install openclaw/skills/skills/cinience/alicloud-ai-audio-cosyvoice-voice-clone

Ensure you have configured your authentication credentials by setting the DASHSCOPE_API_KEY environment variable or by adding your credentials to the ~/.alibabacloud/credentials file. This is a strict prerequisite for successful API communication with the Alibaba Cloud backend.

Use Cases

This skill is designed for scenarios requiring personalized audio experiences, such as:

Creating custom AI avatars that speak with a specific, recognizable human voice.
Generating consistent brand voices for corporate videos, product demonstrations, or interactive customer service agents.
Developing localized audio content where a specific speaker's tone and prosody must be preserved across different languages.
Prototyping voice-interactive applications by cloning specific personas for character testing.

Example Prompts

"Clone the voice from this sample audio at https://example.com/speaker1.wav for my project, use the cosyvoice-v3.5-plus model, and assign it the prefix 'myBrandVoice'."
"I need a new voice clone for Chinese language content. Please use the sample at https://example.com/audio.wav with the cosyvoice-v3.5-flash model and set the language hint to 'zh'."
"Enroll a new custom voice with the prefix 'agent01' using the audio file located at https://example.com/reference.mp3 for the latest cosyvoice-v3.5-plus model."

Tips & Limitations

Regional Requirements: Be aware that high-end models like cosyvoice-v3.5-plus are currently limited to the China mainland deployment. Ensure your regional settings match the requirements of the model you select.
Voice Consistency: Always use the same target_model during the enrollment phase and the subsequent TTS synthesis phase; mismatching these will result in operational failures.
Quota Management: Each enrollment consumes credits. Avoid unnecessary API calls by reusing existing voice_id tokens once a voice has been successfully cloned.
Audio Quality: Ensure the reference audio provided is high quality, clear, and without significant background noise to maximize the fidelity of the final cloned voice.

Read Full Documentation on GitHub

Metadata

Author@cinience

Stars3562

Updated2026-03-29

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-cinience-alicloud-ai-audio-cosyvoice-voice-clone": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#audio#voice-cloning#tts#alicloud#cosyvoice

Safety Score: 4/5

Flags: network-access, external-api

Related Skills

volcengine-compute-ecs

Manage Volcengine ECS instances and related resources. Use when users need instance inventory, lifecycle operations, troubleshooting, or automation templates for ECS.

cinience 3562

alicloud-ai-search-opensearch

Use OpenSearch vector search edition via the Python SDK (ha3engine) to push documents and run HA/SQL searches. Ideal for RAG and vector retrieval pipelines in Claude Code/Codex.

cinience 3562

alicloud-storage-oss-ossutil

Alibaba Cloud OSS CLI (ossutil 2.0) skill. Install, configure, and operate OSS from the command line based on the official ossutil overview.

cinience 3562

alicloud-platform-openapi-product-api-discovery

Discover and reconcile Alibaba Cloud product catalogs from Ticket System, Support & Service, and BSS OpenAPI; fetch OpenAPI product/version/API metadata; and summarize API coverage to plan new skills. Use when you need a complete product list, product-to-API mapping, or coverage/gap reports for skill generation.

cinience 3562

alicloud-ai-image-qwen-image

Generate images with Model Studio DashScope SDK using Qwen Image generation models (qwen-image, qwen-image-plus, qwen-image-max and snapshots). Use when implementing or documenting image.generate requests/responses, mapping prompt/negative_prompt/size/seed/reference_image, or integrating image generation into the video-agent pipeline.

cinience 3562