listenhub
Explain anything — turn ideas into podcasts, explainer videos, or voice narration. Use when the user wants to "make a podcast", "create an explainer video", "read this aloud", "generate an image", or share knowledge in audio/visual form. Supports: topic descriptions, YouTube links, article URLs, plain text, and image prompts.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/0xfango/listenhub-2What This Skill Does
ListenHub is a transformative AI agent skill designed to bridge the gap between static information and dynamic, multi-modal content. It acts as an abstraction layer that converts raw input—ranging from YouTube URLs and long-form articles to simple text notes—into professional-grade audio and visual formats. By automating the production process, ListenHub eliminates the need for complex software or manual editing, turning your raw ideas into high-quality output automatically.
The skill operates via a robust shell-script interface, ensuring that all interactions with the proprietary Marswave API are handled securely and consistently. Whether you are aiming to create a two-person conversational podcast to explore deep topics, an explainer video with AI-generated visuals for a product launch, or a pure text-to-speech audio file, ListenHub handles the technical heavy lifting.
Installation
To install the ListenHub skill, ensure your OpenClaw environment is properly configured. Run the following command in your terminal:
clawhub install openclaw/skills/skills/0xfango/listenhub-2
Once installed, the skill will register its scripts in your designated local directory. You can verify the installation by listing the contents of your **/skills/listenhub/scripts/ folder to ensure the necessary shell scripts are available for execution.
Use Cases
ListenHub excels in various professional and creative workflows:
- Education & Learning: Convert complex documentation or long-form research papers into podcasts to listen to while commuting.
- Marketing & Growth: Rapidly generate explainer videos for products by feeding the agent your landing page URL.
- Content Creation: Repurpose blog posts into engaging audio narrations or illustrative AI visuals.
- Accessibility: Transform text-heavy articles into fluid speech for easier consumption.
Example Prompts
- "Make a podcast episode about the impact of quantum computing based on this article: [URL]"
- "Create an explainer video for my new task management tool using this feature list: [Text]"
- "Generate an image visualizing a futuristic city at sunset for my slide deck presentation"
Tips & Limitations
- Use Scripts, Not APIs: Always interact with the skill via the provided shell scripts in the
/scripts/directory. Direct API calls are not supported and will fail. - Discover Voices: Use
get-speakers.shto explore available voice options before generating long-form audio to ensure the tone matches your content. - Be Concise: While the skill handles large inputs, breaking complex topics into modular prompts often yields more coherent results.
- Internal Constraints: Remember that speaker IDs and API endpoints are managed internally; never hardcode these values in your workflows.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-0xfango-listenhub-2": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api, code-execution
Related Skills
explainer
Create explainer videos with narration and AI-generated visuals. Triggers on: "解说视频", "explainer video", "explain this as a video", "tutorial video", "introduce X (video)", "解释一下XX(视频形式)".
listenhub
Explain anything — turn ideas into podcasts, explainer videos, or voice narration. Use when the user wants to "make a podcast", "create an explainer video", "read this aloud", "generate an image", or share knowledge in audio/visual form. Supports: topic descriptions, YouTube links, article URLs, plain text, and image prompts.
image-gen
Generate AI images from text prompts. Triggers on: "生成图片", "画一张", "AI图", "generate image", "配图", "create picture", "draw", "visualize", "generate an image".
content-parser
Extract and parse content from URLs. Triggers on: user provides a URL to extract content from, another skill needs to parse source material, "parse this URL", "extract content", "解析链接", "提取内容".
asr
Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".