image2prompt
Analyze images and generate detailed prompts for image generation. Supports portrait, landscape, product, animal, illustration categories with structured or natural output.
Why use this skill?
Convert any image into high-quality, detailed AI generation prompts. Automatically detect portraits, landscapes, and products to reproduce stunning visual results.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/zhang-shubo/image2promptWhat This Skill Does
The image2prompt skill for OpenClaw is a powerful visual analysis engine designed to bridge the gap between existing imagery and generative AI. By leveraging advanced computer vision, the tool deconstructs uploaded images into granular, descriptive prompts. It automatically detects the category of the image—whether it is a studio portrait, a sprawling landscape, a commercial product shot, an animal photo, or a technical illustration—and tailors the output accordingly. It goes beyond simple captioning by providing structured metadata, technical camera settings, and stylistic nuance required to reproduce high-quality AI generations that mirror the source material's artistic and technical intent.
Installation
To integrate this skill into your local OpenClaw environment, execute the following command in your terminal: clawhub install openclaw/skills/skills/zhang-shubo/image2prompt Ensure that you have the latest version of OpenClaw installed to maintain compatibility with the image processing backend.
Use Cases
This skill is ideal for content creators looking to reverse-engineer aesthetic styles, graphic designers needing to recreate specific product staging, and AI artists aiming for consistency across their image library. It is especially useful for photographers who want to document their unique lighting setups or for marketing teams who need to generate variations of brand assets without recreating the entire shoot from scratch.
Example Prompts
- "Analyze this image and write a detailed, flowing prompt description. I want a 800-word paragraph focused on the lighting, texture of the fabric, and the specific camera lens effect used in this portrait."
- "Analyze this image with dimension extraction. Tag phrases for: backgrounds, objects, lighting, and composition, then output as a structured JSON object."
- "Analyze this product image and output a prompt suitable for Midjourney that captures the studio lighting, bokeh background, and the commercial aesthetic of the bottle."
Tips & Limitations
The image2prompt skill performs best with high-resolution images. While it is highly accurate at capturing composition and style, specific proprietary branding or text inside images may require additional manual refinement. For the best results in reproduction, combine the 'dimension extraction' feature with your generation tool's specific parameter requirements. Note that this skill reads the files locally or via provided paths; ensure permissions are set correctly to allow the agent read access to your image directory.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-zhang-shubo-image2prompt": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read