ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified communication Safety 4/5

imessage-voice-reply

Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not file attachments. Use when receiving a voice message in iMessage and wanting to reply with voice, enabling voice-to-voice iMessage conversations, or sending audio responses. Zero cost — all TTS runs locally. Requires BlueBubbles channel configured in OpenClaw.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/bolander72/imessage-voice-reply
Or

What This Skill Does

The imessage-voice-reply skill allows your OpenClaw agent to generate and send native, high-fidelity voice messages within iMessage. Unlike standard file attachments, these messages render exactly like those recorded in the native Messages.app, complete with the interactive waveform playback. By leveraging local Kokoro-ONNX models for text-to-speech (TTS) and macOS native 'afconvert' tools, the skill provides high-quality, low-latency audio synthesis without external cloud dependencies or API costs. This ensures your agent can participate in voice-to-voice conversations naturally, providing a human-like touch to automated responses.

Installation

Installation is streamlined through the OpenClaw ecosystem. First, ensure you have the BlueBubbles channel configured in your agent settings. Run the setup command: bash ${baseDir}/scripts/setup.sh. This script handles the installation of critical dependencies including kokoro-onnx, soundfile, and numpy. It also manages the automatic download of the necessary voice models (approx. 136MB) to your local cache. Once the setup completes, the tool is ready to be invoked by your agent's orchestration layer.

Use Cases

This skill is ideal for scenarios requiring a more personal or accessible interaction. Use it for voice-for-voice reply cycles when the recipient has initiated a voice note, or when providing audio-first responses in a conversational flow. It is particularly effective for users who prefer listening to messages rather than reading, or when tone and inflection are necessary to convey information clearly. Always consider including a brief text summary alongside the voice message for maximum accessibility and readability.

Example Prompts

  1. "The user just sent me a voice note asking for a recap of today's meeting; generate a polite voice reply acknowledging their message and provide the summary."
  2. "I need to respond to this iMessage voice thread. Please generate an audio response using the af_heart voice, confirming that I'll be home in ten minutes."
  3. "Send a friendly voice acknowledgement to this contact. Make sure to use the native iMessage bubble format so they can listen to it immediately."

Tips & Limitations

To ensure your messages render as native bubbles, you must strictly follow the output format requirements: the filename must be set to 'Audio Message.caf', the content type must be 'audio/x-caf', and the 'asVoice' parameter in the BlueBubbles tool must be set to true. While the skill supports multiple languages and voices like 'af_heart' or 'ef_dora', note that some languages have limited voice options. Keep your response texts concise to ensure optimal synthesis performance and to prevent long audio files which may feel cumbersome in a chat interface. The current implementation relies on local processing, meaning performance depends on your machine's CPU capabilities.

Metadata

Stars4190
Views1
Updated2026-04-18
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-bolander72-imessage-voice-reply": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#imessage#tts#voice-messages#automation#kokoro
Safety Score: 4/5

Flags: file-write, file-read, code-execution