AB-Agents-Vision-MiniMax
ðïļ Image analysis via MiniMax VL API. Describe images, extract text from screenshots, analyze photos. Requires MiniMax Token Plan API key (free tier available).
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/alexburrstudio/ab-agents-vision-minimaxAB Agents Vision (MiniMax) ðïļ
Image analysis via MiniMax VL API â simple, fast, reliable.
â ïļ Requires MiniMax Token Plan API key â get free key
What It Does
- ðļ Describe images â Get detailed scene descriptions
- ð Extract text â Read from screenshots, photos, documents
- ð Analyze photos â Identify objects, people, settings
- ð URL support â Analyze images from the web
Requirements
- MiniMax Token Plan API key â Subscribe free
- Linux/macOS
uvx(auto-installed)
Quick Start
# 1. Install uvx
curl -LsSf https://astral.sh/uv/install.sh | sh
# 2. Get free MiniMax API key
# https://platform.minimax.io â Subscribe â Token Plan (free tier)
# 3. Use
export MINIMAX_API_KEY="sk-cp-your-key"
./vision.sh image.jpg "Describe this image"
Usage
# Basic description
./vision.sh photo.jpg
# With custom prompt
./vision.sh screenshot.png "What text do you see?"
# URL support
./vision.sh "https://example.com/image.jpg" "Describe this"
Examples
Screenshot analysis:
Input: screenshot.png + "What text is in the image?"
Output: "The screenshot shows a code editor with Python code..."
Photo description:
Input: photo.jpg + "Describe in detail"
Output: "A person's bare foot and lower leg resting on a brown
textured waffle-weave blanket. The skin is light-toned..."
Installation
git clone https://github.com/alexburrstudio/ab-agents-vision.git
cd ab-agents-vision/skills/vision
chmod +x vision.sh
Or via ClaWHub:
clawhub install AB-Agents-Vision-MiniMax
Troubleshooting
| Error | Solution |
|---|---|
| API Error: 1033 | Retry â MiniMax system error |
| No response | Check MINIMAX_API_KEY is set correctly |
| Slow | Use smaller images (<10MB) |
AB-Agents ðĶ
Related Skills
ð AB Agents Meter Reader â Read meter readings from photos (uses this skill for vision)
AB-Agents ðĶ
Metadata
Not sure this is the right skill?
Describe what you want to build â we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-alexburrstudio-ab-agents-vision-minimax": {
"enabled": true,
"auto_update": true
}
}
}Tags
Related Skills
xiaohongshu-browser
Browse Xiaohongshu (å°įšĒäđĶ) and take screenshots of posts. Supports keyword search, post modal screenshots, and returns post links. Requires prior manual login.
AB-Agents-Memory
ð§ Long-term memory system for OpenClaw agents. Manages entities, context, and knowledge base with Obsidian integration. By AB-Agents (Alex Burr).
AB-Agents-Meter-Reader
ð Read meter readings from photos. Electricity (day/night tariffs) and water meters. Saves history and generates messages for landlord.
AB-Agents-Vision
ðïļ Image analysis using MiniMax VL API. Describe images, extract text from screenshots, analyze photos. Works with local files and URLs. Simple shell wrapper.
DocPilot
æšč―ææĄĢåĪįäļåŪķïžæŊæææĄĢč§ĢæãäŋĄæŊæ―åãææĄĢåįąŧ