audiopod
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API_KEY env var or pass api_key directly.
Why use this skill?
Use the AudioPod AI skill to generate custom music, rap, and instrumentals, perform audio stem separation, transcribe speech, and clean up noisy audio files.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/rakesh1002/audiopodWhat This Skill Does
The AudioPod AI skill for OpenClaw provides a comprehensive interface to the AudioPod AI API, a powerful suite of cloud-based audio processing tools. This skill enables users to leverage advanced artificial intelligence for creative and analytical audio tasks. Its core capabilities span across generative music creation, where users can synthesize professional-quality songs, rap, instrumentals, and loops from simple text descriptions. Additionally, it offers utility-driven audio engineering features, including stem separation for isolating vocals or instruments, high-fidelity noise reduction, precise speech-to-text transcription, and media extraction from external URLs or YouTube videos. Whether you are a content creator looking to generate royalty-free background music, a podcaster needing to transcribe interviews, or an audio engineer looking for quick AI-driven stem separation, this skill acts as a bridge between your natural language commands and high-performance audio models.
Installation
To add this capability to your agent, run the following command in your terminal:
clawhub install openclaw/skills/skills/rakesh1002/audiopod
Once installed, you must provide your API credentials to begin using the service. Sign up at https://audiopod.ai/auth/signup to receive your API key, then ensure that the AUDIOPOD_API_KEY environment variable is set in your system environment. Alternatively, you can configure the client directly within your workflow using the provided SDK patterns.
Use Cases
This skill is designed for high-impact audio workflows:
- Creative Media Production: Instantly generate custom soundtracks, background scores, or jingles for videos and social media content.
- Content Repurposing: Transcribe long-form audio or video files into readable text for blog posts or show notes.
- Post-Production Cleanup: Clean up noisy field recordings or podcast audio using AI-powered denoising.
- Audio Editing: Isolate stems from mixed tracks for remixes or re-sampling.
- Accessibility: Generate high-quality synthetic speech from text documents.
Example Prompts
- "Generate a 60-second upbeat pop song with female vocals about a sunny day at the beach."
- "Extract the audio from this YouTube link and transcribe the spoken content into a text document."
- "Take this raw audio file and separate the vocals from the background instrumental tracks."
Tips & Limitations
When using AudioPod, remember that generation quality is highly dependent on the specificity of your prompt. For music generation, include stylistic keywords like BPM, genre, mood, and instrumentation. Be aware that most tasks are processed asynchronously for longer audio clips; the SDK provides a wait_for_completion method that is essential for ensuring your agent handles longer jobs without timing out. Always ensure your account wallet is funded, as usage is based on a pay-as-you-go model. Keep in mind that heavy processing of long-duration audio files may incur higher costs and longer wait times.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-rakesh1002-audiopod": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, external-api
Related Skills
Mailmolt
Skill by rakesh1002
unsearch
Search the web, scrape content, and conduct deep research using the UnSearch API. Use when the user needs real-time web search results, content extraction from URLs, fact verification, or multi-source research for AI agents, RAG pipelines, or LLM applications.
go2gg
Use Go2.gg API for URL shortening, link analytics, QR code generation, webhooks, and link-in-bio pages. Use when the user needs to create short links, track clicks, generate QR codes, set up link-in-bio pages, or manage branded URLs. Free tier includes short links, QR codes, and analytics. Requires GO2GG_API_KEY env var. QR code generation is free without auth.