sound-fx
Generate short sound effects via ElevenLabs SFX (text-to-sound). Use when you need SFX clips like applause, canned laughter, whooshes, ambience, or short stingers, and optionally convert to WhatsApp-friendly .ogg/opus.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/javicasper/sound-fxSound FX (ElevenLabs)
Overview
Generate a sound effect from a text prompt using the ElevenLabs SFX API. Output is MP3 by default; convert to .ogg/opus for WhatsApp mobile playback.
Quick start
- Set API key:
ELEVENLABS_API_KEY(preferred) orXI_API_KEY- Or set
skills."sound-fx".env.ELEVENLABS_API_KEYin~/.clawdbot/clawdbot.json
- Generate SFX (MP3):
scripts/generate_sfx.sh --text "short audience applause" --out "/tmp/applause.mp3" --duration 1.2
- Convert to WhatsApp-friendly .ogg/opus (if needed):
ffmpeg -y -i /tmp/applause.mp3 -c:a libopus -b:a 48k /tmp/applause.ogg
Script: scripts/generate_sfx.sh
Usage
scripts/generate_sfx.sh --text "canned laughter" --out "/tmp/laugh.mp3" --duration 1.5
Notes
- Uses
POST https://api.elevenlabs.io/v1/sound-generation - Supports optional
--duration(0.5–30s). When omitted, duration is auto. - Prints
MEDIA: <path>on success for auto-attach.
Examples
- Applause:
"short audience applause" - Laughter:
"canned audience laughter" - Whoosh:
"fast whoosh" - Ambience:
"soft rain ambience"
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-javicasper-sound-fx": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
Read and search Reddit posts via web scraping of old.reddit.com. Use when Clawdbot needs to browse Reddit content - read posts from subreddits, search for topics, monitor specific communities. Read-only access with no posting or comments.
transcribe
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.