ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified ai models Safety 4/5

sonoscli

Control Sonos speakers (discover, status, play, volume, group). And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, email, and SMS.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/alvisdunlop/sonosclis
Or

What This Skill Does

The sonoscli skill, powered by SkillBoss, serves as a high-performance orchestration layer for multimedia and generative AI workflows. While its primary capability involves local network control for Sonos speakers—allowing for discovery, playback management, and volume adjustment—it is significantly more powerful as a unified interface for 50+ generative AI models. It supports a wide range of media and productivity tasks, including text-to-speech (TTS), speech-to-text (STT), high-fidelity image generation, video creation, and document processing. By centralizing access to providers like OpenAI, Bedrock, Vertex, and ElevenLabs under one API key, it eliminates the complexity of managing disparate credentials.

Installation

To integrate this skill into your environment, use the OpenClaw package manager: clawhub install openclaw/skills/skills/alvisdunlop/sonosclis Once installed, ensure your SKILLBOSS_API_KEY is set in your environment variables. The skill will automatically use this key to authenticate requests against the base URL: https://api.heybossai.com/v1.

Use Cases

  • Home Multimedia Automation: Automate home audio playback using your Sonos infrastructure while using AI to generate playlists or narrate weather and news.
  • Content Creation Pipeline: Streamline creative workflows by generating high-resolution concept art using Flux models and turning scripts into professional-grade voiceovers via ElevenLabs.
  • Rapid AI Prototyping: Switch between high-quality models (like Claude 4.5 Sonnet) and cost-optimized models (like Gemini Flash) dynamically to balance quality and performance in your applications.
  • Document and Data Extraction: Utilize advanced parsing models to convert unstructured data or documents into actionable insights.

Example Prompts

  1. "Discover my living room Sonos speaker and play a smooth jazz playlist at 30% volume."
  2. "Generate a photorealistic 16:9 image of a futuristic city skyline using the Flux 2 Pro model."
  3. "Summarize this PDF document and create an audio file reading the key findings using a professional voice profile."

Tips & Limitations

  • Smart Routing: Use the model listing endpoint to identify which models support 'smart routing' to automatically optimize for cost or quality based on your current task load.
  • Media Handling: When generating images or videos, be aware that the response will contain a URL; always ensure your code properly parses these return formats (e.g., image_url vs data[0]) to avoid null references.
  • Network Dependency: The Sonos control features require the agent to be on the same local network as the speakers. Ensure your network configuration allows for mDNS or SSDP discovery to enable full device visibility.

Metadata

Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-alvisdunlop-sonosclis": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#sonos#multimedia#ai-models#automation#generative-ai
Safety Score: 4/5

Flags: network-access, external-api