llm-provider-forensics
Forensically verify what model family or routing layer may actually sit behind a claimed LLM endpoint or model ID. Use when an agent must investigate whether a provider is genuine, proxied, aliased, aggregated, wrapped, or currently unusable across OpenAI-compatible protocol layers, GPT/OpenAI, Anthropic/Claude, Google Gemini, GLM/Zhipu, Qwen/Tongyi, Kimi/Moonshot, MiniMax, DeepSeek, and mixed compatibility gateways. Supports deeper family-fingerprint analysis, long-context tests, structured-output stress, refusal and variance profiling, streaming/error clues, repeated stability checks, and cross-provider comparison reports.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/andyrenxu7255/llm-provider-forensicsLLM Provider Forensics
Agent-facing forensic skill for identifying what an LLM endpoint most likely is.
Trigger conditions
Use this skill when asked to:
- verify whether a claimed model is genuine
- identify which family an endpoint most resembles
- distinguish focused route vs wrapped route vs aggregation pool
- compare multiple providers claiming to expose the same model
- evaluate primary/fallback/avoid decisions
- deeply audit suspicious gateways for GPT / Claude / Gemini / GLM / Qwen / Kimi / MiniMax / DeepSeek behavior
Core rule
Do not output false certainty. Produce a confidence-based operational judgment.
Coverage
Families:
- OpenAI-compatible protocol layer
- GPT / OpenAI-style
- Claude / Anthropic-style
- Gemini / Google-style
- GLM / Zhipu-style
- Qwen / Tongyi-style
- Kimi / Moonshot-style
- MiniMax-style
- DeepSeek-style
- mixed aggregation pool / compatibility gateway
Dimensions:
- catalog topology
- protocol compatibility
- response schema shape
- repeated stability
- strict formatting control
- family fingerprinting
- long-context retention
- structured-output stress
- refusal/safety style
- randomness / variance profile
- streaming / error fingerprints
- cross-protocol consistency
Current implementation note:
openai-compatiblenow means protocol layer only, not GPT-family proof.- The deepest automatic suite is strongest for OpenAI-compatible / mixed gateway providers.
- Anthropic-native and Gemini-native routes currently have solid protocol/family checks, plus native deep tests, but protocol success alone must not be read as family proof.
- Treat all family conclusions as confidence-based and inspect references before overclaiming.
Investigation workflow
- Identify likely protocol family or families.
- Probe catalog/list endpoints when available.
- Probe minimal inference endpoints for each plausible protocol family.
- Separate protocol-layer conclusion from suspected model family conclusion.
- Run repeated stability tests on the best working route.
- Run strict formatting tests.
- Run deeper advanced dimensions when the user prioritizes realism over speed.
- Inspect family fingerprint evidence and produce a confidence-based judgment.
References to load as needed
- Main checklist:
references/forensics-checklist.md - Advanced dimensions:
references/advanced-dimensions.md - Error/stream/variance:
references/error-stream-variance.md - Protocol specifics:
references/protocol-openai.md,references/protocol-anthropic.md,references/protocol-gemini.md,references/protocol-glm.md - Family fingerprints:
references/fingerprint-*.md - Native deep tests:
references/deep-claude.md,references/deep-gemini.md
Final labels
high-confidence-focused-or-genuine-routemedium-confidence-likely-routed-or-wrappedhigh-confidence-multi-model-aggregation-poollow-confidence-or-unusable
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-andyrenxu7255-llm-provider-forensics": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
meddic-b2b-sales-review
Structured B2B sales review, deal review, pipeline review, and sales coaching using MEDDIC as the core framework. Helps sales leaders review opportunities, projects, forecasts, losses, and rep 1:1s while guiding next-best actions, resource decisions, and forecast calibration. Use when you need MEDDIC-based review for complex B2B sales management.
provider-probe
Probe and verify whether an OpenAI-compatible baseURL is a real single-model endpoint or a multi-model aggregation pool. Use when auditing model providers, checking /models output, comparing completions vs responses support, validating claimed model IDs like gpt-5.4, or producing a provider trust/stability report for OpenClaw routing decisions.