openclaw-media-gen
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/aisadocs/openclaw-aisa-llm-image-video-qwen-wan26-gemini-3-pro-imageWhat This Skill Does
The openclaw-media-gen skill serves as a unified interface for high-fidelity generative AI media creation. By leveraging a single AIsa API key, users can access Gemini 3 Pro for advanced image synthesis and Qwen Wan 2.6 (Tongyi Wanxiang) for professional-grade video generation. This tool bridges the gap between complex model-specific APIs and developer-friendly workflows, providing asynchronous video task management and high-quality image generation in one package.
Installation
To install this skill, use the ClawHub CLI within your project environment:
clawhub install openclaw/skills/skills/aisadocs/openclaw-aisa-llm-image-video-qwen-wan26-gemini-3-pro-image
After installation, export your credentials to your shell environment variables:
export AISA_API_KEY="your-key-here"
Use Cases
- Marketing Assets: Rapidly generate high-quality images and short promotional video clips for social media campaigns.
- Concept Prototyping: Create cinematic visual references for character design or storyboarding with precise camera control prompts.
- Production Workflows: Integrate cost-effective video generation into automated pipelines, benefiting from contract pricing that is significantly lower than official Bailian or standalone provider rates.
- Educational Content: Generate illustrative visual aids for presentations or documentation that require specific, generated imagery.
Example Prompts
- "Generate an image of a futuristic mechanical hummingbird resting on a neon-lit cybernetic flower in a high-tech laboratory setting."
- "Create a 5-second video using the reference image I provided: add a slow-motion rain effect and have the camera perform a gentle pan to the left, cinematic style."
- "Generate an image of a serene mountain landscape at sunset, 8k resolution, photorealistic style, with vibrant orange and purple sky tones."
Tips & Limitations
- Async Handling: Video generation via Wan 2.6 is an asynchronous task. Ensure your application logic includes a polling mechanism to check the task status via the API after the initial request.
- Cost Efficiency: This skill is currently one of the most cost-effective methods for high-resolution 1080P video generation compared to OpenRouter or direct official API channels.
- API Constraints: Always verify the
inline_datafield in your Gemini image responses, as the binary data is encoded in base64 format and will require decoding before saving as a file on your local machine. Monitor your API credit usage regularly as video generation tasks consume balance based on seconds of footage generated.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-aisadocs-openclaw-aisa-llm-image-video-qwen-wan26-gemini-3-pro-image": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api, file-write, network-access
Related Skills
aisa-tavily
AI-optimized web search via AIsa's Tavily API proxy. Returns concise, relevant results for AI agents through AIsa's unified API gateway.
openclaw-search
Intelligent search for agents. Multi-source retrieval with confidence scoring - web, academic, and Tavily in one unified API.
Twitter Command Center (Search + Post)
Searches and reads X (Twitter): profiles, timelines, mentions, followers, tweet search, trends, lists, communities, and Spaces. Publishes posts after the user completes OAuth in the browser. Use when the user asks about Twitter/X data, social listening, or posting without sharing account passwords.
openclaw-media-gen
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
stock-rumors
Scan M&A, insider, analyst, social, and regulatory rumor signals through AISA. Use when: the user asks about early market signals, rumors, insider activity, analyst changes, or takeover chatter.