provider-probe
Probe and verify whether an OpenAI-compatible baseURL is a real single-model endpoint or a multi-model aggregation pool. Use when auditing model providers, checking /models output, comparing completions vs responses support, validating claimed model IDs like gpt-5.4, or producing a provider trust/stability report for OpenClaw routing decisions.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/andyrenxu7255/provider-probeProvider Probe
Use this skill to investigate model providers behind OpenAI-compatible base URLs.
When to use
Trigger this skill when the user asks to:
- verify whether a provider's claimed model is real
- inspect a baseURL for hidden/mixed model pools
- compare multiple providers for the same claimed model
- determine whether a provider is better suited as primary or fallback
- create a trust/stability report for model routing
Core method
Always use a layered evidence approach:
- Read provider config or ask for baseURL + apiKey + claimed model id.
- Call
/modelsand inspect whether the returned pool contains mixed vendors or suspicious aliases. - Check metadata like
owned_by, model naming conventions, and whether one baseURL exposes many unrelated model families. - Probe both
/responsesand/chat/completionswith minimal prompts. - Run short capability tests and repeated stability tests.
- Summarize with a confidence rating rather than absolute certainty.
Confidence labels
- High confidence real / most likely genuine: stable, coherent endpoint behavior, believable output structure, low ambiguity.
- Medium confidence / likely routed or wrapped: works, but signs suggest aggregation, aliasing, or proxy adaptation.
- Low confidence / unusable now: 404, repeated timeout, incompatible shape, or too little evidence.
Output contract
Always report:
- 当前做到哪了 / what was tested
- 当前阻塞点 / what remains uncertain
- 下一步动作 / recommended next step
For final results, include:
- Config facts
/modelsfindings- Endpoint compatibility findings
- Repeated stability findings
- Capability/format findings
- Final trust judgment
- Recommendation: primary / fallback / avoid
Tooling
Prefer the bundled script for deterministic testing:
scripts/provider_probe.py
Usage:
python3 scripts/provider_probe.py --config /root/.openclaw/openclaw.json --providers ypemc omgteam vpsai --model gpt-5.4
Or probe a custom URL directly:
python3 scripts/provider_probe.py --base-url https://example.com/v1 --api-key sk-xxx --model gpt-5.4
Interpretation heuristics
Treat a provider as a likely aggregation pool when several of these appear together:
/modelsreturns many unrelated model familiesowned_byvalues are mixed or inconsistent- the claimed model id looks like a routing alias rather than a canonical model id
/responsesand/chat/completionscompatibility is uneven or surprising- behavior is stable enough to work but not coherent enough to look like a single official upstream
Files
- Reference checklist:
references/provider-probe-checklist.md - Probe script:
scripts/provider_probe.py
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-andyrenxu7255-provider-probe": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
meddic-b2b-sales-review
Structured B2B sales review, deal review, pipeline review, and sales coaching using MEDDIC as the core framework. Helps sales leaders review opportunities, projects, forecasts, losses, and rep 1:1s while guiding next-best actions, resource decisions, and forecast calibration. Use when you need MEDDIC-based review for complex B2B sales management.
llm-provider-forensics
Forensically verify what model family or routing layer may actually sit behind a claimed LLM endpoint or model ID. Use when an agent must investigate whether a provider is genuine, proxied, aliased, aggregated, wrapped, or currently unusable across OpenAI-compatible protocol layers, GPT/OpenAI, Anthropic/Claude, Google Gemini, GLM/Zhipu, Qwen/Tongyi, Kimi/Moonshot, MiniMax, DeepSeek, and mixed compatibility gateways. Supports deeper family-fingerprint analysis, long-context tests, structured-output stress, refusal and variance profiling, streaming/error clues, repeated stability checks, and cross-provider comparison reports.