openai-tts
Text-to-speech via OpenAI Audio Speech API.
Why use this skill?
Convert text to high-quality human speech with the OpenAI TTS skill for OpenClaw. Supports multiple voices, file formats, and adjustable speeds.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/pors/openai-ttsWhat This Skill Does
The OpenAI TTS skill provides a robust interface for the OpenClaw AI agent to convert written text into high-quality, human-sounding speech using OpenAI's powerful audio generation engine. It leverages the /v1/audio/speech API to process natural language input and deliver synthesized audio in various formats, including mp3, opus, aac, flac, wav, and pcm. The skill is highly configurable, allowing users to switch between fast inference models like 'tts-1' and high-fidelity 'tts-1-hd' models. With support for multiple voice profiles—ranging from the neutral 'alloy' to the authoritative 'onyx'—this skill transforms static text logs or AI responses into audible, engaging content suitable for diverse applications.
Installation
To install this skill, run the following command in your terminal within the OpenClaw environment: clawhub install openclaw/skills/skills/pors/openai-tts. After installation, ensure your API credentials are configured properly. You can either export the OPENAI_API_KEY environment variable in your shell session or define it permanently within your ~/.clawdbot/clawdbot.json configuration file under the skills entry for openai-tts. Once the key is set, the skill is ready for immediate invocation via the speak.sh script located in your base directory.
Use Cases
This skill is ideal for accessibility, content creation, and real-time feedback loops. Developers can use it to add audio responses to terminal-based CLI tools, enabling hands-free operation. Content creators can quickly generate voice-overs for short videos or prototypes by piping text through the script. It is also perfect for notification systems where hearing a spoken alert provides better context than a simple beep. Educational applications may utilize the varying voice styles to simulate character dialogue or narrate long-form text summaries effectively.
Example Prompts
- "Speak the following text aloud using the nova voice: 'Your system backup has completed successfully.'"
- "Generate an mp3 file named 'morning_briefing.mp3' containing the text of my latest daily summary with the speed set to 1.1."
- "Narrate this weather report using the British-accented fable voice and save it as an opus file."
Tips & Limitations
When using the OpenAI TTS skill, always be mindful of the cost implications. The 'tts-1-hd' model provides superior quality but costs twice as much as the 'tts-1' model, which is optimized for speed and cost-efficiency. Experiment with the --speed flag, which supports values between 0.25 and 4.0, to find the perfect cadence for your audio files. Note that while the tool is versatile, it requires active network access to reach OpenAI's servers; ensure your firewall or proxy allows these requests. Always store your sensitive API keys securely rather than hardcoding them into scripts to maintain system safety.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-pors-openai-tts": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, external-api
Related Skills
research
Deep research via Gemini CLI — runs in background sub-agent so you don't burn your Claude tokens.
yc-cold-outreach
Expert in Y Combinator (YC) cold email outreach techniques based on Startup School principles. Use to draft, critique, or iterate on cold emails to potential customers, partners, or investors. Based on Aaron Epstein's methodology for high-conversion outreach.
clawdbot-release-check
Check for new clawdbot releases and notify once per new version.
b2b-first-ten
Expert in getting the first 10 B2B customers, based on Lenny Rachitsky's "First 1000 Users" research. Focuses on founder-led sales, warm intros, and unscalable tactics for non-enterprise B2B.
crucial-conversations-coach
Friendly executive life coach for crucial conversations based on the "Tools for Talking When Stakes Are High" methodology. Mimics the "Crucial Conversations Coach" GPT. Use when the user needs help preparing for, navigating, or debriefing high-stakes conversations, phrasing emails for win-win outcomes, or convincing others while maintaining safety and respect.