ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

openclaw-media-gen

Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/aisadocs/openclaw-aisa-llm-image-video
Or

What This Skill Does

The OpenClaw Media Gen skill acts as a unified gateway to advanced multimodal AI generation. By leveraging the AIsa API ecosystem, it allows you to generate high-fidelity images using Google's Gemini 3 Pro model and cinematic videos using Alibaba's state-of-the-art Qwen Wan 2.6 (Tongyi Wanxiang) model. This skill simplifies the workflow by utilizing a single API key to bridge the gap between static image generation and dynamic video synthesis, enabling users to transform descriptive text into visual media effortlessly.

Installation

To integrate this skill into your OpenClaw environment, execute the following command in your terminal:

clawhub install openclaw/skills/skills/aisadocs/openclaw-aisa-llm-image-video

Ensure your AISA_API_KEY is configured as an environment variable before running any generation tasks to ensure the agent has proper authorization to access the AIsa endpoints.

Use Cases

This skill is designed for creators, developers, and researchers who need rapid visual prototyping. Use cases include:

  • Marketing & Content Creation: Generate consistent social media assets and short-form video clips from simple text prompts.
  • Concept Art: Quickly iterate on character designs or environmental textures using the Gemini image model.
  • Cinematic Storyboarding: Turn static concepts into short video sequences using the Wan 2.6 video model, allowing for deeper exploration of movement and composition.
  • Automated Workflow Integration: Embed media generation directly into your CLI-based development or automation pipelines.

Example Prompts

  1. "Generate a hyper-realistic photograph of a futuristic coffee shop in a neon-lit Tokyo street, cinematic lighting, 8k resolution."
  2. "Create a 5-second video from this image URL [https://url.com/image.jpg]: camera movement, slow zoom in on the subject, dramatic atmospheric fog, movie grade."
  3. "Draft a visual concept for a steampunk-style flying machine, then synthesize a short 5-second video showcasing it soaring through clouds."

Tips & Limitations

  • Asynchronous Processing: Video generation is an asynchronous task. Always save your task_id returned from the initial call and use the video-status command to check progress before expecting a result.
  • Image-to-Video Requirements: The video model performs best with high-quality source images. Ensure the provided img_url is publicly accessible and clear.
  • Cost Management: Be mindful of your API quota. Both Gemini 3 Pro and Qwen Wan 2.6 are premium models; frequent generation will consume your AIsa credits rapidly.
  • Error Handling: Always ensure network connectivity to the AIsa API endpoints (https://api.aisa.one) to avoid task initiation failures.

Metadata

Author@aisadocs
Stars4473
Views2
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-aisadocs-openclaw-aisa-llm-image-video": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#ai-video#image-generation#multimodal#gemini#qwen
Safety Score: 4/5

Flags: external-api, file-write