Official Verified media Safety 4/5

openclaw-media-gen

Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/0xjordansg-yolo/openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana

Download Source Code (.zip)

What This Skill Does

The openclaw-media-gen skill serves as a unified interface for high-fidelity media production, leveraging the AIsa API to orchestrate both image and video generation workflows. This skill bridges the gap between text-based AI commands and visual output by integrating Gemini 3 Pro Image (gemini-3-pro-image-preview) for text-to-image synthesis and Qwen Wan 2.6 (wan2.6-t2v) for text-to-video generation. It simplifies complex API interactions into a developer-friendly command-line tool, handling authentication, request structure, and asynchronous task management for video rendering.

Installation

To integrate this skill into your OpenClaw environment, ensure you have the OpenClaw CLI tool installed. Execute the following command in your terminal: clawhub install openclaw/skills/skills/0xjordansg-yolo/openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana

Once installed, you must configure your environment variable by adding your API key: export AISA_API_KEY="your-key". The skill uses this key for both the image generation endpoints and the asynchronous video synthesis services.

Use Cases

This skill is ideal for content creators, developers, and automation engineers looking to automate visual assets production. You can use it to build rapid prototyping pipelines for social media content, create assets for digital storytelling, or integrate generative media into larger autonomous workflows. Because it utilizes asynchronous polling for video, it is perfectly suited for batch-processing workflows where multiple videos are queued for generation.

Example Prompts

"Generate a high-quality image of a cyberpunk city street at night, with neon reflections on wet pavement, cinematic lighting, 8k resolution."
"Create a 5-second video from this image: https://example.com/character.jpg, using the prompt 'the character turns their head slowly and smiles, soft studio lighting'."
"Check the status of my recent video generation request using the task ID: 12345-abc-67890."

Tips & Limitations

Video Latency: Note that Wan 2.6 generation is an asynchronous process. Always use the video-status command to check the progress of your task instead of expecting an immediate response.
Image Source: When generating video, the quality of your input img_url significantly influences the final result. High-resolution, clear subject images perform best.
API Limits: Ensure your AIsa account has sufficient credits, as video generation via Wan 2.6 is compute-intensive and consumes more resources than static image generation.
Error Handling: The script automates base64 decoding for images; ensure your local environment has write permissions to the directory specified in the --out parameter.

Read Full Documentation on GitHub

Metadata

Author@0xjordansg-yolo

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-0xjordansg-yolo-openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#generative-ai#video-gen#image-gen#automation#api-integration

Safety Score: 4/5

Flags: external-api, file-write

Related Skills

Twitter Command Center (Search + Post)

Searches and reads X (Twitter): profiles, timelines, mentions, followers, tweet search, trends, lists, communities, and Spaces. Publishes posts after the user completes OAuth in the browser. Use when the user asks about Twitter/X data, social listening, or posting without sharing account passwords.

0xjordansg-yolo 4473

youtube-search

YouTube Search API via AIsa unified endpoint. Search YouTube videos, channels, and playlists with a single AIsa API key — no Google API key or OAuth required. Use this skill when users want to search YouTube content. For other AIsa capabilities (LLM, financial data, Twitter, web search), see the aisa-core skill.

0xjordansg-yolo 4473

llm-router

Unified LLM Gateway - One API for 70+ AI models. Route to GPT, Claude, Gemini, Qwen, Deepseek, Grok and more with a single API key.

0xjordansg-yolo 4473

aisa-tavily

AI-optimized web search via AIsa's Tavily API proxy. Returns concise, relevant results for AI agents through AIsa's unified API gateway.

0xjordansg-yolo 4473

openclaw-search

Intelligent search for agents. Multi-source retrieval with confidence scoring - web, academic, and Tavily in one unified API.

0xjordansg-yolo 4473