openclaw-media-gen
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/bowen-dotcom/aisa-media-gen-skillWhat This Skill Does
The openclaw-media-gen skill serves as a powerful multimedia engine for the OpenClaw agent, providing seamless access to generative AI capabilities via the AIsa API. It consolidates two distinct workflows into a unified interface: image generation using Gemini 3 Pro and video generation using the state-of-the-art Qwen Wan 2.6 model. Users can request complex visual content by simply specifying their creative intent, allowing the agent to handle the intricacies of API interaction, model configuration, and task orchestration.
Installation
To integrate this capability into your OpenClaw environment, execute the following command in your terminal:
clawhub install openclaw/skills/skills/bowen-dotcom/aisa-media-gen-skill
Ensure you have configured your environment variables by exporting your API key:
export AISA_API_KEY="your-key-here"
Use Cases
- Creative Asset Generation: Quickly generate concept art, textures, or marketing visuals from text descriptions.
- Cinematic Storyboarding: Utilize the video synthesis features to create short, motion-controlled clips from static reference images for film or video production prototyping.
- Content Automation: Integrate image and video creation into broader automated pipelines for social media, newsletters, or presentation materials.
- Rapid Prototyping: Iterate on visual ideas instantly without needing to configure separate complex GPU environments.
Example Prompts
- "Generate a high-resolution, cinematic photograph of a futuristic cyberpunk city street during a heavy rainstorm with vibrant neon signage reflections."
- "Create a 5-second video from this image: [URL] with a slow camera zoom-in effect and a shallow depth of field to emphasize the subject."
- "Visualize a cozy, rustic cabin in the woods during autumn, ultra-detailed style with warm sunlight filtering through the trees."
Tips & Limitations
- Image Quality: For best results with Gemini 3 Pro, keep your image prompts descriptive and focused on style, lighting, and composition elements.
- Video Handling: The Wan 2.6 video generator operates asynchronously. After initiating a video task, always save your Task ID and use the status polling command to retrieve the output once processing is complete.
- Resource Management: Large video generation tasks consume significant API quota; monitor your usage via the AIsa dashboard.
- Network Dependency: This skill relies entirely on external connectivity to the AIsa gateway. Ensure your firewall allows outbound requests to
api.aisa.one.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-bowen-dotcom-aisa-media-gen-skill": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api, file-write
Related Skills
Twitter Command Center (Search + Post)
Searches and reads X (Twitter): profiles, timelines, mentions, followers, tweet search, trends, lists, communities, and Spaces. Publishes posts after the user completes OAuth in the browser. Use when the user asks about Twitter/X data, social listening, or posting without sharing account passwords.
perplexity-search
Perplexity Sonar search and answer generation through AIsa. Use when the task is specifically to call Perplexity Sonar, Sonar Pro, Sonar Reasoning Pro, or Sonar Deep Research for citation-backed web answers, analytical reasoning, or long-form research reports.
openclaw-youtube
YouTube SERP Scout for agents. Search top-ranking videos, channels, and trends for content research and competitor tracking.
openclaw-search
Intelligent search for agents. Multi-source retrieval across web, scholar, Tavily, and Perplexity Sonar models.
MarketPulse
Query real-time and historical financial data across equities and crypto—prices, market moves, metrics, and trends for analysis, alerts, and reporting.