image-model-evaluation
评估图像生成模型的效果。对指定模型进行全面的文生图、图生图测试,包括不同参数、不同提示词、人物一致性等测试项,生成详细的HTML测试报告。当用户想要测试、评估、对比图像模型效果时使用此 skill。
Why use this skill?
Use the OpenClaw image-model-evaluation skill to comprehensively test and benchmark image generation models. Get detailed HTML reports on T2I, I2I, and character consistency.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/hexiaochun/image-model-evaluationWhat This Skill Does
The image-model-evaluation skill is a professional-grade testing suite designed to assess the performance, consistency, and reliability of image generation models. It provides a structured framework to evaluate how well a model handles text-to-image (T2I) and image-to-image (I2I) tasks. By conducting systematic tests—ranging from stylistic variations and aspect ratios to complex, multi-variable character consistency—the skill generates comprehensive, easy-to-read HTML reports. It serves as an essential tool for developers and content creators who need empirical data to choose the right AI model for their projects, ensuring that the chosen model delivers expected output quality and stable character preservation under diverse conditions.
Installation
To integrate this skill into your OpenClaw agent, use the following installation command in your terminal:
clawhub install openclaw/skills/skills/hexiaochun/image-model-evaluation
Use Cases
- Model Selection: Compare different models like Stable Diffusion, Flux, or custom enterprise models to decide which fits your production pipeline.
- Consistency Audit: Verify that character features (facial structure, hair, body proportions) are preserved when generating images across different poses, lighting, and environments.
- Quality Assurance: Automated regression testing to ensure that model updates or fine-tuning haven't negatively impacted specific stylistic or structural capabilities.
- API Benchmarking: Analyze generation speed and success rates to estimate production costs and latency.
Example Prompts
- "帮我测试一下 jimeng-4.5 模型的效果,我想了解它的文生图表现。"
- "执行一次完整的评估,看看 Stable Diffusion XL 在人物一致性测试中的表现如何。"
- "对 mi-journey-v6 进行快速测试,检查它在写实和动漫风格转换上的准确度。"
Tips & Limitations
- Concurrency: To maintain stability, the skill caps parallel API requests at 4. Please wait for the process to complete to avoid hitting rate limits.
- Timeouts: Individual tasks have a maximum timeout of 120 seconds. If your model response is slow, consider adjusting your network settings or checking model availability.
- Cost Awareness: Always review the cost estimate provided by the agent before confirming the test execution. Full testing involves 31 distinct scenarios and may incur significant API usage costs.
- Error Recovery: If a specific test fails due to network spikes, the tool allows for selective retries rather than restarting the entire batch.
- Data Privacy: Ensure that any input images used for testing do not contain sensitive personal or proprietary information, as these are processed through the model evaluation pipeline.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-hexiaochun-image-model-evaluation": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, external-api
Related Skills
水浒传故事小人书
水浒传故事小人书创建。使用 Nano Banana Pro 模型生成手绘卡通风格的水浒传故事信息图。当用户想要创建水浒传故事插画、小人书、信息图时使用此 skill。
style-extractor
从参考剧本或参考素材中提取统一风格锚点(STYLE_BASE),确保全剧视觉一致性。当需要匹配参考风格、提取画风、建立风格基准、生成风格资产包时使用。
视频链接解析
解析视频分享链接,获取无水印视频下载地址。当用户想要下载视频、解析抖音/快手/小红书/B站链接、获取无水印视频时使用此 skill。
vidu-video
使用 Vidu Q3 Pro 模型生成视频。当用户想要文生视频、生成带音频的视频,或提到 vidu 时使用此 skill。
character-creator
创建AI角色的完整流程,包括生成详细角色描述、文生图肖像和多角度参考图。使用即梦4.5模型。当用户要求创建角色、生成人物立绘、制作角色参考图、或需要多角度人物图时使用此技能。