Official Verified media Safety 4/5

listenhub

Explain anything — turn ideas into podcasts, explainer videos, or voice narration. Use when the user wants to "make a podcast", "create an explainer video", "read this aloud", "generate an image", or share knowledge in audio/visual form. Supports: topic descriptions, YouTube links, article URLs, plain text, and image prompts.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/0xfango/listenhub-2

Download Source Code (.zip)

What This Skill Does

ListenHub is a transformative AI agent skill designed to bridge the gap between static information and dynamic, multi-modal content. It acts as an abstraction layer that converts raw input—ranging from YouTube URLs and long-form articles to simple text notes—into professional-grade audio and visual formats. By automating the production process, ListenHub eliminates the need for complex software or manual editing, turning your raw ideas into high-quality output automatically.

The skill operates via a robust shell-script interface, ensuring that all interactions with the proprietary Marswave API are handled securely and consistently. Whether you are aiming to create a two-person conversational podcast to explore deep topics, an explainer video with AI-generated visuals for a product launch, or a pure text-to-speech audio file, ListenHub handles the technical heavy lifting.

Installation

To install the ListenHub skill, ensure your OpenClaw environment is properly configured. Run the following command in your terminal:

clawhub install openclaw/skills/skills/0xfango/listenhub-2

Once installed, the skill will register its scripts in your designated local directory. You can verify the installation by listing the contents of your **/skills/listenhub/scripts/ folder to ensure the necessary shell scripts are available for execution.

Use Cases

ListenHub excels in various professional and creative workflows:

Education & Learning: Convert complex documentation or long-form research papers into podcasts to listen to while commuting.
Marketing & Growth: Rapidly generate explainer videos for products by feeding the agent your landing page URL.
Content Creation: Repurpose blog posts into engaging audio narrations or illustrative AI visuals.
Accessibility: Transform text-heavy articles into fluid speech for easier consumption.

Example Prompts

"Make a podcast episode about the impact of quantum computing based on this article: [URL]"
"Create an explainer video for my new task management tool using this feature list: [Text]"
"Generate an image visualizing a futuristic city at sunset for my slide deck presentation"

Tips & Limitations

Use Scripts, Not APIs: Always interact with the skill via the provided shell scripts in the /scripts/ directory. Direct API calls are not supported and will fail.
Discover Voices: Use get-speakers.sh to explore available voice options before generating long-form audio to ensure the tone matches your content.
Be Concise: While the skill handles large inputs, breaking complex topics into modular prompts often yields more coherent results.
Internal Constraints: Remember that speaker IDs and API endpoints are managed internally; never hardcode these values in your workflows.

Read Full Documentation on GitHub

Metadata

Author@0xfango

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-0xfango-listenhub-2": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#podcast#tts#media-generation#audio#ai-video

Safety Score: 4/5

Flags: external-api, code-execution

Related Skills

explainer

Create explainer videos with narration and AI-generated visuals. Triggers on: "解说视频", "explainer video", "explain this as a video", "tutorial video", "introduce X (video)", "解释一下XX（视频形式）".

0xfango 4473

listenhub

0xfango 4473

image-gen

Generate AI images from text prompts. Triggers on: "生成图片", "画一张", "AI图", "generate image", "配图", "create picture", "draw", "visualize", "generate an image".

0xfango 4473

content-parser

Extract and parse content from URLs. Triggers on: user provides a URL to extract content from, another skill needs to parse source material, "parse this URL", "extract content", "解析链接", "提取内容".

0xfango 4473

asr

Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".

0xfango 4473