Official Verified

Model Failover Doctor

Skill by halfmoon82

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/halfmoon82/model-failover-doctor

Download Source Code (.zip)

model-failover-doctor

诊断和修复 OpenClaw "All models failed" 错误的专用工具。

触发条件

遇到以下任何一种情况时，调用此工具：

日志或用户报告 All models failed (N)，且 N 个 provider 的错误信息中 model ID 全部相同
- 例：kimi-coding/k2p5: No available channel for model openai/gpt-5.3-codex ← provider 和 model 对不上
agent 重启后第一条消息必然失败，但后续消息正常（冷启动 session 无 fallbackChain）
pools.json 或 session_model_state.json 手动编辑后 agent 开始报 503 model_not_found

诊断命令

# 仅诊断，不修改任何文件
python3 ~/.openclaw/workspace/skills/model-failover-doctor/model_failover_doctor.py

# 诊断 + 自动修复 + 重启 gateway
python3 ~/.openclaw/workspace/skills/model-failover-doctor/model_failover_doctor.py --fix --restart

# 预览将要修改的内容（不实际写入）
python3 ~/.openclaw/workspace/skills/model-failover-doctor/model_failover_doctor.py --dry-run

根因速查表

症状	代码	严重	自动修复
所有 fallback 的 model ID 相同（provider 已切换但 model 没变）	MI-1	🔴	✅
同一死亡模型被不同 session/子代理反复踩坑	MI-2	🟡	❌ 需手动
pools.json 中引用了不存在的 provider	P-1	🔴	✅
session 无 fallbackChain，runtime fallback 永远无法推进	S-1	🔴	✅
session fallbackChain 含无效 provider 前缀	S-2	🔴	✅

根因 MI-1 详解（最常见）

问题：message-injector 的 before_agent_start 无条件返回：

return { modelOverride, providerOverride, ... }

后果：Gateway 尝试每个 fallback 时都携带相同的 modelOverride，导致 kimi-coding、zai、minimax 等收到了错误的 model ID。

修复：包装在 lockModel 条件中，正常路由只依赖 sessions.patch：

return { ...(lockModel ? { modelOverride, providerOverride } : {}), ... }

备份说明

所有自动修复操作会在 ~/.openclaw/workspace/.lib/.mfd_backups/ 创建时间戳备份，可随时手动恢复。

Read Full Documentation on GitHub

Metadata

Author@halfmoon82

Stars2387

Updated2026-03-09

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-halfmoon82-model-failover-doctor": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

Skill Trigger V2

Skill by halfmoon82

halfmoon82 2387

Complex Task Methodology

Skill by halfmoon82

halfmoon82 2387

semantic-router

让 AI 代理根据对话内容自动选择最合适的模型。四层识别（系统过滤→关键词→指示词→语义相似度），四池架构（高速/智能/人文/代理），五分支路由，全自动 Fallback 回路。支持 trigger_groups_all 非连续词组命中。

halfmoon82 2387

subagent-isolation-guard

固化子代理物理隔离与语义路由旁路。防止跨代理上下文污染及由于语义路由导致的子代理切模/重置问题。

halfmoon82 2387

skill-safe-install

L0 级技能安全安装流程。触发“安装技能/安全安装/审查权限”时，强制执行 Step0-5（查重→检索→审查→沙箱→正式安装→白名单）。

halfmoon82 2387