ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified communication Safety 4/5

tts-whatsapp

Send high-quality text-to-speech voice messages on WhatsApp in 40+ languages with automatic delivery

Why use this skill?

Send high-quality, AI-generated voice messages on WhatsApp in 40+ languages. Automate broadcasts, support groups, and enhance communication with OpenClaw.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/hopyky/tts-whatsapp
Or

What This Skill Does

The tts-whatsapp skill is a powerful automation tool that integrates text-to-speech (TTS) capabilities directly with WhatsApp messaging. It leverages the Piper TTS engine to generate high-quality, natural-sounding voice messages from text input. Once the audio is generated, the skill automatically transcodes the file into the OGG/Opus format required by WhatsApp and utilizes the Clawdbot agent to deliver the message to any individual contact or group. This tool eliminates the need for manual voice recording, allowing users to broadcast information, updates, or personal messages in over 40 languages with extreme speed and efficiency.

Installation

To get started, ensure you have the necessary system dependencies. First, install Piper TTS via pip3 install --user piper-tts. Next, install FFmpeg, which is required for audio format conversion—use brew install ffmpeg on macOS or apt install ffmpeg on Linux. Download your preferred voice models from the official Rhasspy Hugging Face repository and move them to ~/.clawdbot/skills/piper-tts/models/. Finally, install the skill itself by running the command clawhub install openclaw/skills/skills/hopyky/tts-whatsapp in your terminal.

Use Cases

This skill is perfect for scenarios where a human touch is needed without the effort of real-time recording. Businesses can use it for automated status notifications or appointment reminders in the customer's native language. Content creators can quickly narrate scripts for group updates. It is also highly effective for accessibility, enabling users to send messages in languages they may not speak fluently, or for users who prefer listening to messages rather than reading them in high-noise environments.

Example Prompts

  1. "tts-whatsapp 'Your scheduled maintenance is confirmed for tomorrow at 10 AM' --target '+447700900123'"
  2. "tts-whatsapp 'Bonjour, voici le compte rendu de la réunion' --lang 'fr_FR' --voice 'siwis' --target '[email protected]'"
  3. "tts-whatsapp 'The server update is complete, everything is back online' --target '+15550199' --quality 'high'"

Tips & Limitations

To maintain performance, keep messages concise; the system averages around 2-3 seconds for a full delivery cycle. If you are sending to a large group, ensure your group ID is correctly retrieved. Note that the quality of the audio is highly dependent on the voice model chosen—'high' quality settings will increase generation time slightly but provide superior clarity. Always ensure your environment variables in clawdbot.json are set correctly to avoid repetitive typing of common target numbers.

Metadata

Author@hopyky
Stars2387
Views1
Updated2026-03-09
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-hopyky-tts-whatsapp": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags

#whatsapp#tts#voice#messaging#multilingual#audio
Safety Score: 4/5

Flags: network-access, file-write, file-read, external-api