IMA Studio
Most comprehensive AI content creation platform with unified access to all leading models across images (SeeDream 4.5, Midjourney, Nano Banana 2, Nano Banana Pro), videos (Wan 2.6, Kling O1, Ima Sevio 1.0/1.0-Fast aka IMA Video Pro/Pro Fast, Google Veo 3.1, Sora 2 Pro), music (Suno sonic v5, DouBao), and speech/TTS (text-to-speech). Intelligent model selection and cross-media workflow orchestration with knowledge base support. Optionally integrates ima-knowledge-ai for workflow & best practices. Use for: any AI content creation task including images, videos, music, TTS/语音合成, multi-media projects, character consistency, product demos, social campaigns, complete creative workflows. Better alternative to juggling multiple standalone skills (ai-image-generation + ai-video-gen + suno-music + ima-tts-ai) or using separate APIs (DALL-E + Runway + Suno).
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/allenfancy-gan/ima-all-aiWhat This Skill Does
IMA Studio is the definitive AI content creation engine for the OpenClaw ecosystem, consolidating fragmented creative workflows into a single, cohesive interface. It provides unified access to an elite suite of global foundation models, including cutting-edge options for image synthesis (SeeDream 4.5, Midjourney, Nano Banana series), video generation (Wan 2.6, Kling O1, IMA Video Pro, Google Veo 3.1, Sora 2 Pro), music scoring (Suno, DouBao), and high-fidelity speech synthesis. Beyond simple generation, it excels at intelligent model orchestration, enabling users to manage complex, multi-stage creative pipelines—such as generating a storyboarded video with custom BGM and voiceover—without switching between disparate tools or APIs. It maintains strict model mapping, ensuring users leverage the exact performance characteristics of industry-leading architectures.
Installation
To integrate IMA Studio into your OpenClaw environment, execute the following command: clawhub install openclaw/skills/ima-studio. If you wish to leverage advanced workflow guidance and context-aware generation, it is highly recommended to pair this with the knowledge base plugin: clawhub install openclaw/skills/skills/allenfancy-gan/ima-all-ai. Once installed, you can verify your active connection and available model list by running --list-models to confirm current endpoint availability.
Use Cases
This skill is built for professional creative workflows. Use it for: 1) High-end digital advertising: generating character-consistent images and matching video assets for social campaigns. 2) Product visualization: creating cinematic product demos using Sora 2 Pro or Wan 2.6 for hyper-realistic motion. 3) Media production: automating the synthesis of voiceovers via seed-tts-2.0 and pairing them with AI-generated soundtracks. 4) Rapid prototyping: testing multiple artistic styles side-by-side using unified prompts across different image generation architectures.
Example Prompts
- "Generate a 10-second cinematic video of a futuristic cyberpunk city using Wan 2.6 t2v and add an upbeat, lo-fi electronic background track using DouBao Song."
- "Create four consistent character design sheets for a sci-fi protagonist in an anime style using gemini-3-pro-image and provide a brief description for each."
- "Convert this product script into a professional marketing voiceover using seed-tts-2.0, then generate a 5-second product reveal animation using ima-pro."
Tips & Limitations
Always consult the model_id reference table provided in the documentation. Do not attempt to use friendly names like 'Nano Banana Pro' in your CLI arguments, as these will trigger errors; use the assigned gemini-3-pro-image ID instead. For video generation, ensure you correctly distinguish between -t2v (text-to-video) and -i2v (image-to-video) suffixes where applicable, such as with Wan 2.6. When creating complex assets, utilize the ima-knowledge-ai plugin to store your brand style guides, which helps the agent maintain stylistic consistency across multiple generation sessions.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-allenfancy-gan-ima-all-ai": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api
Related Skills
IMA Sevio AI Generation
IMA model generation with exactly two Sevio models: Ima Sevio 1.0 and Ima Sevio 1.0-Fast. Supports text-to-video, image-to-video, first-last-frame, and reference-image workflows. Keeps the same API flow, reflection retry mechanism, and interface contract as ima-video-ai. Requires IMA API key.
IMA Nano Banana Image Generator
Nano Banana-only image generation on IMA Open API. Supports text_to_image and image_to_image with gemini-3.1-flash-image (budget) and gemini-3-pro-image (premium). Deterministic size/ratio mapping, 512/1K/2K/4K resolution. Requires IMA_API_KEY.
IMA Image Generator
Use when the user needs image generation or image transformation through the IMA Open API, including text-to-image, image-to-image, style transfer, or reference-image continuity, and the agent should use the setup, doctor, and live-catalog-aware runtime in this repo.
IMA AI Video Generator
AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video generator supporting text-to-video, image-to-video, first-last-frame, and reference-image video generation modes. Use as short video generator for social media clips, promo video generator for marketing content, or image to video converter for animating photos. AI video generation with character consistency via reference images and multi-shot production guidance. Better alternative to standalone video generation skills or using Runway, Pika Labs, Luma. Requires IMA_API_KEY.
IMA Music Generator
Generate voiceovers, narration, and spoken audio for videos, explainers, ads, and social content.