Official Verified

provider-probe

Probe and verify whether an OpenAI-compatible baseURL is a real single-model endpoint or a multi-model aggregation pool. Use when auditing model providers, checking /models output, comparing completions vs responses support, validating claimed model IDs like gpt-5.4, or producing a provider trust/stability report for OpenClaw routing decisions.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/andyrenxu7255/provider-probe

Download Source Code (.zip)

Provider Probe

Use this skill to investigate model providers behind OpenAI-compatible base URLs.

When to use

Trigger this skill when the user asks to:

verify whether a provider's claimed model is real
inspect a baseURL for hidden/mixed model pools
compare multiple providers for the same claimed model
determine whether a provider is better suited as primary or fallback
create a trust/stability report for model routing

Core method

Always use a layered evidence approach:

Read provider config or ask for baseURL + apiKey + claimed model id.
Call /models and inspect whether the returned pool contains mixed vendors or suspicious aliases.
Check metadata like owned_by, model naming conventions, and whether one baseURL exposes many unrelated model families.
Probe both /responses and /chat/completions with minimal prompts.
Run short capability tests and repeated stability tests.
Summarize with a confidence rating rather than absolute certainty.

Confidence labels

High confidence real / most likely genuine: stable, coherent endpoint behavior, believable output structure, low ambiguity.
Medium confidence / likely routed or wrapped: works, but signs suggest aggregation, aliasing, or proxy adaptation.
Low confidence / unusable now: 404, repeated timeout, incompatible shape, or too little evidence.

Output contract

Always report:

当前做到哪了 / what was tested
当前阻塞点 / what remains uncertain
下一步动作 / recommended next step

For final results, include:

Config facts
/models findings
Endpoint compatibility findings
Repeated stability findings
Capability/format findings
Final trust judgment
Recommendation: primary / fallback / avoid

Tooling

Prefer the bundled script for deterministic testing:

scripts/provider_probe.py

Usage:

python3 scripts/provider_probe.py --config /root/.openclaw/openclaw.json --providers ypemc omgteam vpsai --model gpt-5.4

Or probe a custom URL directly:

python3 scripts/provider_probe.py --base-url https://example.com/v1 --api-key sk-xxx --model gpt-5.4

Interpretation heuristics

Treat a provider as a likely aggregation pool when several of these appear together:

/models returns many unrelated model families
owned_by values are mixed or inconsistent
the claimed model id looks like a routing alias rather than a canonical model id
/responses and /chat/completions compatibility is uneven or surprising
behavior is stable enough to work but not coherent enough to look like a single official upstream

Files

Reference checklist: references/provider-probe-checklist.md
Probe script: scripts/provider_probe.py

Read Full Documentation on GitHub

Metadata

Author@andyrenxu7255

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-andyrenxu7255-provider-probe": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

meddic-b2b-sales-review

Structured B2B sales review, deal review, pipeline review, and sales coaching using MEDDIC as the core framework. Helps sales leaders review opportunities, projects, forecasts, losses, and rep 1:1s while guiding next-best actions, resource decisions, and forecast calibration. Use when you need MEDDIC-based review for complex B2B sales management.

andyrenxu7255 4473

llm-provider-forensics

Forensically verify what model family or routing layer may actually sit behind a claimed LLM endpoint or model ID. Use when an agent must investigate whether a provider is genuine, proxied, aliased, aggregated, wrapped, or currently unusable across OpenAI-compatible protocol layers, GPT/OpenAI, Anthropic/Claude, Google Gemini, GLM/Zhipu, Qwen/Tongyi, Kimi/Moonshot, MiniMax, DeepSeek, and mixed compatibility gateways. Supports deeper family-fingerprint analysis, long-context tests, structured-output stress, refusal and variance profiling, streaming/error clues, repeated stability checks, and cross-provider comparison reports.

andyrenxu7255 4473