discord-voice
Real-time voice conversations in Discord voice channels with Claude AI
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/basillytton/basil-voiceDiscord Voice Plugin for Clawdbot
Real-time voice conversations in Discord voice channels. Join a voice channel, speak, and have your words transcribed, processed by Claude, and spoken back.
Features
- Join/Leave Voice Channels: Via slash commands, CLI, or agent tool
- Voice Activity Detection (VAD): Automatically detects when users are speaking
- Speech-to-Text: Whisper API (OpenAI), Deepgram, or Local Whisper (Offline)
- Streaming STT: Real-time transcription with Deepgram WebSocket (~1s latency reduction)
- Agent Integration: Transcribed speech is routed through the Clawdbot agent
- Text-to-Speech: OpenAI TTS, ElevenLabs, or Kokoro (Local/Offline)
- Audio Playback: Responses are spoken back in the voice channel
- Barge-in Support: Stops speaking immediately when user starts talking
- Auto-reconnect: Automatic heartbeat monitoring and reconnection on disconnect
Requirements
- Discord bot with voice permissions (Connect, Speak, Use Voice Activity)
- API keys for STT and TTS providers
- System dependencies for voice:
ffmpeg(audio processing)- Native build tools for
@discordjs/opusandsodium-native
Installation
1. Install System Dependencies
# Ubuntu/Debian
sudo apt-get install ffmpeg build-essential python3
# Fedora/RHEL
sudo dnf install ffmpeg gcc-c++ make python3
# macOS
brew install ffmpeg
2. Install via ClawdHub
clawdhub install discord-voice
Or manually:
cd ~/.clawdbot/extensions
git clone <repository-url> discord-voice
cd discord-voice
npm install
3. Configure in clawdbot.json
{
plugins: {
entries: {
"discord-voice": {
enabled: true,
config: {
sttProvider: "local-whisper",
ttsProvider: "openai",
ttsVoice: "nova",
vadSensitivity: "medium",
allowedUsers: [], // Empty = allow all users
silenceThresholdMs: 1500,
maxRecordingMs: 30000,
openai: {
apiKey: "sk-...", // Or use SKILLBOSS_API_KEY env var
},
},
},
},
},
}
4. Discord Bot Setup
Ensure your Discord bot has these permissions:
- Connect - Join voice channels
- Speak - Play audio
- Use Voice Activity - Detect when users speak
Add these to your bot's OAuth2 URL or configure in Discord Developer Portal.
Configuration
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-basillytton-basil-voice": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
ai-ppt-generator
Generate PPT with SkillBoss API Hub. Smart template selection based on content.
canvas-design
Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations.
desktop-control
Advanced desktop automation with mouse, keyboard, and screen control. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, email, and SMS.
Nextjs
Build Next.js 15 apps with App Router, server components, caching, auth, and production patterns.
gemini
LLM one-shot Q&A, summaries, and generation via SkillBoss API Hub.