Discord Voice Memo Upgrade
Skill by koto9x
Why use this skill?
Resolve voice memo TTS failures in Clawdbot with this essential patch. Fix block streaming issues and enable seamless audio-to-speech auto-replies.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/koto9x/discord-voice-memo-upgradeWhat This Skill Does
The Discord Voice Memo Upgrade by koto9x is a specialized core patch designed to rectify a critical bug within the Clawdbot framework. In standard deployments, the internal block-streaming mechanism sometimes interrupts the data pipeline, preventing voice memo audio payloads from reaching the Text-To-Speech (TTS) synthesis engine. This patch injects necessary logic to ensure that audio metadata is correctly parsed and passed forward, allowing your bot to interpret incoming voice memos as triggerable events for TTS auto-replies. By stabilizing the delivery of the final payload, the skill ensures seamless integration between audio input and automated speech response workflows.
Installation
Installation requires manual modification of the Clawdbot core files. It is strictly recommended to perform a full backup of the installation directory before proceeding. Users must navigate to the node_modules/clawdbot/dist/ path and replace the existing dispatch-from-config.js and tts.js files with the provided patched versions. After replacing these files, a full process restart using the clawdbot restart command is necessary to apply the changes. Please note that this is a surgical patch to the core engine, not a standard library extension, and may be overwritten if you run npm update without caution.
Use Cases
This skill is essential for power users and bot developers who rely on a hands-free interactive experience via Discord. Use this patch if your bot is currently ignoring voice messages while successfully responding to standard text inputs. It is particularly valuable for accessibility-focused implementations or automated assistants where the bot is expected to provide voice-based feedback to users who prefer leaving voice memos over typing long-form text. It resolves the specific error where the TTS pipeline fails to initiate due to stream truncation.
Example Prompts
- "(User sends a 15-second voice note describing a task) Bot: [Processes audio, converts to text, and replies with TTS-generated confirmation]"
- "(User sends a voice memo: 'What is the schedule for today?') Bot: [Analyzes audio, fetches calendar, and responds with spoken schedule]"
- "(User sends a voice note: 'Turn off the lights') Bot: [Detects command in voice memo and triggers smart home API via TTS confirmation]"
Tips & Limitations
Before deploying this in a production environment, you should clean up the debug logging statements included in the patch to maintain console clarity. Keep in mind that disabling block streaming to facilitate TTS may have minor impacts on the throughput of other real-time media processing tasks. Always monitor your bot's memory usage post-patch, as the change in payload handling might slightly increase the duration that audio buffers are held in memory before synthesis begins. If the official Clawdbot repository releases an update, verify if the patch has been merged upstream before attempting to overwrite your modified files with an official release.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-koto9x-discord-voice-memo-upgrade": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, code-execution