model-council
Multi-model consensus system — send a query to 3+ different LLMs via OpenRouter simultaneously, then a judge model evaluates all responses and produces a winner, reasoning, and synthesized best answer. Like having a board of AI advisors. Use for important decisions, code review, research verification.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/aiwithabidi/model-council-proModel Council 🏛️
Get consensus from multiple AI models on any question.
Send your query to 3+ different LLMs simultaneously via OpenRouter. A judge model evaluates all responses and produces a winner, reasoning, and synthesized best answer.
When to Use
- Important decisions — Don't trust one model's opinion
- Code review — Get multiple perspectives on architecture choices
- Research verification — Cross-check facts across models
- Creative work — Compare writing styles and pick the best
- Debugging — When one model is stuck, others might see the issue
How It Works
Your Question
├──→ Claude Sonnet 4 ──→ Response A
├──→ GPT-4o ──→ Response B
└──→ Gemini 2.0 Flash ──→ Response C
│
Judge (Opus) evaluates all
│
├── Winner + Reasoning
├── Synthesized Best Answer
└── Cost Breakdown
Quick Start
# Basic usage
python3 {baseDir}/scripts/model_council.py "What's the best database for a real-time analytics dashboard?"
# Custom models
python3 {baseDir}/scripts/model_council.py --models "anthropic/claude-sonnet-4,openai/gpt-4o,google/gemini-2.5-pro" "Your question"
# Custom judge
python3 {baseDir}/scripts/model_council.py --judge "openai/gpt-4o" "Your question"
# JSON output
python3 {baseDir}/scripts/model_council.py --json "Your question"
# Set max tokens per response
python3 {baseDir}/scripts/model_council.py --max-tokens 2000 "Your question"
Configuration
| Flag | Default | Description |
|---|---|---|
--models | claude-sonnet-4, gpt-4o, gemini-2.0-flash | Comma-separated model list |
--judge | anthropic/claude-opus-4-6 | Judge model |
--max-tokens | 1024 | Max tokens per council member |
--json | false | Output as JSON |
--timeout | 60 | Timeout per model (seconds) |
Environment
Requires OPENROUTER_API_KEY environment variable.
Output Example
═══ MODEL COUNCIL RESULTS ═══
Question: What's the best way to handle auth in a microservices architecture?
── Council Member Responses ──
🤖 anthropic/claude-sonnet-4 ($0.0043)
Use a centralized auth service with JWT tokens...
🤖 openai/gpt-4o ($0.0038)
Implement OAuth 2.0 with an API gateway...
🤖 google/gemini-2.0-flash-001 ($0.0012)
Consider using service mesh with mTLS...
── Judge Verdict (anthropic/claude-opus-4-6, $0.0125) ──
🏆 Winner: anthropic/claude-sonnet-4
Reasoning: Most comprehensive and practical approach...
📝 Synthesized Answer:
The best approach combines elements from all three...
💰 Total Cost: $0.0218
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-aiwithabidi-model-council-pro": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
freshsales
Freshsales CRM integration — manage contacts, leads, deals, accounts, tasks, and sales sequences via the Freshsales API. Track deal pipelines, automate lead assignments, log activities, and generate sales reports. Built for AI agents — Python stdlib only, no dependencies. Use for sales CRM, contact management, deal tracking, pipeline reporting, and sales automation.
gemini-video-analyzer
Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe speech, identify objects and actions. Use when: (1) User sends a video file and wants it analyzed, (2) Video summarization or description needed, (3) Extracting text, UI elements, or information from screen recordings, (4) Answering questions about video content, (5) Comparing multiple videos, (6) Analyzing tutorials, demos, or walkthroughs.
agent-memory
Full AI agent memory stack — Mem0 unified memory engine with vector search (Qdrant) and knowledge graph (Neo4j), plus SQLite for structured data. Complete setup script and tools. Give your OpenClaw agent a real brain with semantic recall, entity relationships, and structured storage.
neon
Neon serverless Postgres — manage projects, branches, databases, roles, endpoints, and compute via the Neon API. Create database branches for development, manage connection endpoints, scale compute, and monitor usage. Built for AI agents — Python stdlib only, zero dependencies. Use for serverless Postgres, database branching, database management, development workflows, and cloud database automation.
onepassword
1Password Connect — vaults, items, secrets management for server-side applications.