Official Verified

AB-Agents-Vision

👁️ Image analysis using MiniMax VL API. Describe images, extract text from screenshots, analyze photos. Works with local files and URLs. Simple shell wrapper.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/alexburrstudio/ab-agents-vision

Download Source Code (.zip)

AB Agents Vision 👁️

Image analysis using MiniMax VL API — simple, fast, reliable.

What It Does

📸 Describe images — Get detailed scene descriptions
📝 Extract text — Read text from screenshots, photos, documents
🔍 Analyze photos — Identify objects, people, settings
🌐 URL support — Analyze images from the web

Quick Start

# Install
curl -LsSf https://astral.sh/uv/install.sh | sh

# Set your MiniMax API key
export MINIMAX_API_KEY="sk-cp-your-key"

# Use
./vision.sh image.jpg "Describe this image"

Usage

# Basic description
./vision.sh photo.jpg

# With custom prompt
./vision.sh screenshot.png "What text do you see?"

# URL support
./vision.sh "https://example.com/image.jpg" "Describe this"

Requirements

MiniMax Token Plan API key (get one)
Linux/macOS
uvx (auto-installed via script)

Examples

Screenshot analysis:

Input: screenshot.png + "What text is in the image?"
Output: "The screenshot shows a code editor with Python code...

Photo description:

Input: photo.jpg + "Describe in detail"
Output: "A person's bare foot and lower leg resting on a brown
textured waffle-weave blanket. The skin is light-toned with
visible fine hairs..."

Installation

git clone https://github.com/alexburrstudio/ab-agents-skills.git
cd ab-agents-skills/skills/vision
chmod +x vision.sh

Or via ClaWHub:

clawhub install AB-Agents-Vision

Troubleshooting

Error	Solution
API Error: 1033	Retry — system error on MiniMax side
No response	Check MINIMAX_API_KEY is set correctly
Slow	Use smaller images (<10MB)

AB-Agents 🦀

Read Full Documentation on GitHub

Metadata

Author@alexburrstudio

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-alexburrstudio-ab-agents-vision": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Related Skills

xiaohongshu-browser

Browse Xiaohongshu (小红书) and take screenshots of posts. Supports keyword search, post modal screenshots, and returns post links. Requires prior manual login.

besty0121 4473

competitor-message-differ

比较竞品在信息架构和叙事上的差异，找出可借鉴点与应避免模仿点。；use for competitor, messaging, positioning workflows；do not use for 抄袭竞品文案, 编造竞品数据.

52yuanchangxing 4473

skill-gap-finder

扫描现有 Skill 套装，找重复、缺口、职责冲突和最该补的空位。；use for skills, bundle, analysis workflows；do not use for 臆造目录内容, 直接删除 skill.

52yuanchangxing 4473

survey-response-coder

将开放式问卷回答编码成主题、情绪与标签，并生成可复核手册。；use for survey, coding, qualitative workflows；do not use for 把少量样本当总体结论, 暴露受访者隐私.

52yuanchangxing 4473

proposal-comparator

对比多个方案文档，输出差异、隐含成本、风险与推荐结论。；use for proposal, comparison, decision workflows；do not use for 忽略用户给的约束, 伪造数据支持结论.

52yuanchangxing 4473

AB-Agents-Vision

Install via CLI (Recommended)

AB Agents Vision 👁️

What It Does

Quick Start

Usage

Requirements

Examples

Installation

Troubleshooting

Metadata

Tags

Related Skills

xiaohongshu-browser

competitor-message-differ

skill-gap-finder

survey-response-coder

proposal-comparator