AB-Agents-Vision
👁️ Image analysis using MiniMax VL API. Describe images, extract text from screenshots, analyze photos. Works with local files and URLs. Simple shell wrapper.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/alexburrstudio/ab-agents-visionAB Agents Vision 👁️
Image analysis using MiniMax VL API — simple, fast, reliable.
What It Does
- 📸 Describe images — Get detailed scene descriptions
- 📝 Extract text — Read text from screenshots, photos, documents
- 🔍 Analyze photos — Identify objects, people, settings
- 🌐 URL support — Analyze images from the web
Quick Start
# Install
curl -LsSf https://astral.sh/uv/install.sh | sh
# Set your MiniMax API key
export MINIMAX_API_KEY="sk-cp-your-key"
# Use
./vision.sh image.jpg "Describe this image"
Usage
# Basic description
./vision.sh photo.jpg
# With custom prompt
./vision.sh screenshot.png "What text do you see?"
# URL support
./vision.sh "https://example.com/image.jpg" "Describe this"
Requirements
- MiniMax Token Plan API key (get one)
- Linux/macOS
uvx(auto-installed via script)
Examples
Screenshot analysis:
Input: screenshot.png + "What text is in the image?"
Output: "The screenshot shows a code editor with Python code...
Photo description:
Input: photo.jpg + "Describe in detail"
Output: "A person's bare foot and lower leg resting on a brown
textured waffle-weave blanket. The skin is light-toned with
visible fine hairs..."
Installation
git clone https://github.com/alexburrstudio/ab-agents-skills.git
cd ab-agents-skills/skills/vision
chmod +x vision.sh
Or via ClaWHub:
clawhub install AB-Agents-Vision
Troubleshooting
| Error | Solution |
|---|---|
| API Error: 1033 | Retry — system error on MiniMax side |
| No response | Check MINIMAX_API_KEY is set correctly |
| Slow | Use smaller images (<10MB) |
AB-Agents 🦀
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-alexburrstudio-ab-agents-vision": {
"enabled": true,
"auto_update": true
}
}
}Tags
Related Skills
xiaohongshu-browser
Browse Xiaohongshu (小红书) and take screenshots of posts. Supports keyword search, post modal screenshots, and returns post links. Requires prior manual login.
competitor-message-differ
比较竞品在信息架构和叙事上的差异,找出可借鉴点与应避免模仿点。;use for competitor, messaging, positioning workflows;do not use for 抄袭竞品文案, 编造竞品数据.
skill-gap-finder
扫描现有 Skill 套装,找重复、缺口、职责冲突和最该补的空位。;use for skills, bundle, analysis workflows;do not use for 臆造目录内容, 直接删除 skill.
survey-response-coder
将开放式问卷回答编码成主题、情绪与标签,并生成可复核手册。;use for survey, coding, qualitative workflows;do not use for 把少量样本当总体结论, 暴露受访者隐私.
proposal-comparator
对比多个方案文档,输出差异、隐含成本、风险与推荐结论。;use for proposal, comparison, decision workflows;do not use for 忽略用户给的约束, 伪造数据支持结论.