What This Skill Does

The SiliconFlow Media skill is a comprehensive multi-modal AI interface designed to streamline generative media workflows directly within OpenClaw. It serves as a unified bridge to the SiliconFlow API, offering high-performance tools for image generation (supporting FLUX and Qwen models), video synthesis (using the Wan-AI suite), text-to-speech (TTS) conversion, and automated speech recognition (ASR). By utilizing pre-allocated vouchers, this skill allows users to integrate high-quality AI creative outputs into their automation pipelines without manual payment handling.

Installation

To integrate this skill into your OpenClaw environment, execute the following command in your terminal: clawhub install openclaw/skills/skills/axdlee/siliconflow-media Ensure you have configured your SILICONFLOW_API_KEY in your environment variables before running any script to authenticate your requests successfully.

Use Cases

This skill is ideal for content creators and developers seeking programmatic media production. You can use it to generate custom assets for marketing, automate video narrations using varied voice synthesis models, transcribe meeting recordings or audio clips through advanced speech-to-text models, or perform batch image generation tasks. It is particularly powerful when used in multi-step automation sequences—for example, converting text inputs into audio files and then merging them into generated video clips.

Example Prompts

"Generate a high-quality image of a futuristic city using the FLUX model and save it as city_concept.png."
"Convert this text file into an MP3 audio clip using the Fish Speech model."
"Transcribe the audio file recording.mp3 using the SenseVoice model and save the output."

Tips & Limitations

Performance: While image generation is rapid (5-10 seconds), video generation is a resource-intensive process that can take up to 5 minutes per request. Patience is required for larger media tasks.
File Handling: All scripts automatically output a 'MEDIA:' log line, which the OpenClaw agent uses to automatically attach the resulting file to your chat interface.
Cost: All operations are charged against your existing voucher balance (currently 3000+), ensuring a frictionless experience for frequent users.

siliconflow-media

Install via CLI (Recommended)

What This Skill Does

Installation

Use Cases

Example Prompts

Tips & Limitations

Metadata

Tags(AI)

Related Skills

toutiao-publish