ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

audio-reply

Generate audio replies using TTS. Trigger with "read it to me [public URL]" to fetch and read content aloud, or "talk to me [topic]" to generate a spoken response. Also responds to "speak", "say it", "voice reply".

Why use this skill?

Enable text-to-speech for OpenClaw with the audio-reply skill. Listen to web articles, conversational agent responses, and enjoy hands-free interaction.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/matrixy/audio-reply-skill
Or

What This Skill Does

The audio-reply skill enables your OpenClaw agent to speak directly to you. Powered by the high-performance MLX Audio TTS (chatterbox-turbo), this skill transforms text-based inputs and web content into natural-sounding speech. Whether you want to listen to a news article while multitasking or prefer a conversational audio response to your queries, this skill brings an auditory dimension to your agent interaction. It includes built-in safety guardrails to ensure that external content is fetched and processed securely without risking your local system integrity.

Installation

To add this capability to your agent, run the following command in your terminal: clawhub install openclaw/skills/skills/matrixy/audio-reply-skill This installs the necessary dependencies and the speech engine model, ensuring the skill is ready for immediate execution.

Use Cases

  1. Hands-Free Reading: Use the 'read it to me' command to listen to long-form articles, documentation, or blog posts without needing to look at your screen. This is ideal for commutes, chores, or visual breaks.
  2. Voice-First Interaction: Use 'talk to me' for quick updates or information retrieval where listening is more convenient than reading.
  3. Accessibility: Enhance your workflow by having complex technical responses read aloud, aiding in comprehension for auditory learners.

Example Prompts

  1. "read it to me https://example.com/latest-tech-trends"
  2. "talk to me about the current status of my project queue"
  3. "speak, please explain the concept of neural networks like I am five"

Tips & Limitations

  • Safety First: The skill enforces strict URL validation. It will block attempts to reach private networks, internal IPs (like 127.0.0.1), or links containing sensitive credentials. If a site is behind a login, copy the text and ask the agent to 'speak' the pasted excerpt instead.
  • Content Length: For best quality, keep conversational responses under 200 words. When using 'read it to me', the agent will summarize content over 500 words to ensure the audio remains punchy and relevant.
  • Cleanup: The skill automatically cleans up temporary files after playback, ensuring your machine does not become cluttered with audio fragments. Avoid using this skill to process sensitive financial or personal records, as the intermediate text/audio might persist in local logs.

Metadata

Author@matrixy
Stars1401
Views0
Updated2026-02-24
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-matrixy-audio-reply-skill": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#tts#audio#voice#accessibility#media
Safety Score: 4/5

Flags: network-access, file-write