ai-image-generation
Generate AI images with FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image
Why use this skill?
Generate stunning AI images using FLUX, Gemini, Grok & more. Supports text-to-image, editing, upscaling & 50+ models via inference.sh CLI.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/okaris/ai-image-generationWhat This Skill Does
The AI Image Generation skill leverages the inference.sh CLI to provide access to a vast array of over 50 AI models for generating and editing images. This powerful tool enables users to create stunning visuals through text-to-image, image-to-image transformations, inpainting, and advanced editing features. It supports cutting-edge models like FLUX, Gemini, Grok, Seedream, and Reve, offering capabilities such as LoRA integration, image upscaling, and precise text rendering. Whether you're an artist, marketer, or designer, this skill is your gateway to creating high-quality AI art, product mockups, concept art, social media graphics, and compelling illustrations with unparalleled flexibility and quality.
Installation
To begin using the AI Image Generation skill, you first need to install the inference.sh CLI. Follow these steps:
-
Install the CLI: Open your terminal and run the following command:
curl -fsSL https://cli.inference.sh | sh && infsh loginThis script automatically detects your operating system and architecture, downloads the appropriate binary, and verifies its integrity. No elevated permissions or background processes are required.
-
Login: After installation, run
infsh loginto authenticate your CLI.
Once the CLI is set up, you can immediately start generating images by specifying the desired model and input parameters.
Use Cases
This skill is incredibly versatile and can be applied to a wide range of creative and professional needs:
- AI Art Generation: Create unique and artistic images from textual descriptions.
- Product Mockups: Visualize product designs in various settings and styles.
- Concept Art: Develop visual concepts for games, films, or other projects.
- Social Media Graphics: Design eye-catching visuals for social media campaigns.
- Marketing Visuals: Generate compelling imagery for advertisements and promotional materials.
- Illustrations: Create custom illustrations for websites, articles, or books.
- Image Editing & Enhancement: Utilize features like inpainting for seamless object removal or addition, and upscaling for professional-grade image resolution.
- Text Rendering: Generate images with specific text incorporated, thanks to models like Reve and Seedream 3.0.
Example Prompts
Here are some examples of how you can use the AI Image Generation skill:
generate image with falai/flux-dev-lora: a hyperrealistic portrait of a robot reading a book in a cozy librarycreate image with xai/grok-imagine-image: a futuristic cityscape with flying cars, aspect ratio 16:9ai art with bytedance/seedream-4-5: a majestic dragon soaring over a snow-capped mountain range, 4k quality
Tips & Limitations
- Explore Models: With over 50 models available, experiment with different ones to find the best results for your specific needs. Check the model descriptions in the
inference.shdocumentation for guidance. - Prompt Engineering: The quality of your generated image heavily depends on the prompt. Be descriptive and specific. Consider including details about style, lighting, composition, and subject matter.
- Aspect Ratios: Some models support specific aspect ratios. Check the model documentation or examples for how to specify this (e.g.,
"aspect_ratio": "16:9"). - Parameters: Different models accept various parameters beyond just the prompt. Refer to the
inference.shCLI documentation for detailed parameter options for each app. - Image Input: For tasks like image-to-image or inpainting, you'll need to provide a
image_url. Ensure the URL is publicly accessible. - Rate Limits & Costs: Be aware of potential rate limits or costs associated with using certain models or the
inference.shservice, as outlined on their platform. - Iterative Refinement: Don't expect perfection on the first try. Generate multiple images and refine your prompts based on the results.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-okaris-ai-image-generation": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api, code-execution
Related Skills
content-repurposing
Content atomization — turn one piece of content into many formats. Covers blog-to-thread, blog-to-carousel, podcast-to-blog, video-to-quotes, and more. Use for: content marketing, social media, multi-platform distribution, content strategy. Triggers: content repurposing, repurpose content, content atomization, content recycling, one to many content, multi platform content, cross post, adapt content, reformat content, blog to thread, blog to video, podcast to blog, content multiplication
product-changelog
Product changelog and release notes that users actually read. Covers categorization, user-facing language, visuals, and distribution. Use for: release notes, changelogs, product updates, feature announcements, versioning. Triggers: changelog, release notes, product update, version notes, what's new, feature announcement, product changelog, update log, release announcement, version release, product release, ship notes
logo-design-guide
Logo design principles and AI image generation best practices for creating logos. Covers logo types, prompting techniques, scalability rules, and iteration workflows. Use for: brand identity, startup logos, app icons, favicons, logo concepts. Triggers: logo design, create logo, brand logo, logo generation, ai logo, logo maker, icon design, brand mark, logo concept, startup logo, app icon logo
product-photography
AI product photography with studio lighting, lifestyle shots, and packshot conventions. Covers angles, backgrounds, shadow types, hero shots, and e-commerce image requirements. Use for: product photos, e-commerce images, Amazon listings, packshots, lifestyle photography. Triggers: product photography, product photo, packshot, e-commerce photography, product shot, product image, studio photography, lifestyle product, amazon product photo, product listing image, hero shot, product mockup, commercial photography
newsletter-curation
Newsletter curation with content sourcing, editorial structure, and subscriber growth strategies. Covers issue formatting, link roundups, commentary style, and sending cadence. Use for: email newsletters, link roundups, weekly digests, curated content, creator newsletters. Triggers: newsletter, email newsletter, newsletter curation, weekly digest, link roundup, curated newsletter, newsletter writing, newsletter format, subscriber growth, newsletter strategy, content curation, newsletter template