Official Verified

prompt-ab-lab

Design, log, compare, and score prompt experiments so users can systematically improve outputs instead of guessing.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/52yuanchangxing/prompt-ab-lab

Download Source Code (.zip)

Prompt A/B Lab

Purpose

Design, log, compare, and score prompt experiments so users can systematically improve outputs instead of guessing.

Trigger phrases

比较两个提示词
prompt ab test
提示词实验
哪个 prompt 更好
建一个评测表

Ask for these inputs

prompt A and B
task
evaluation criteria
test set
weights if any

Workflow

Define what success looks like before comparing prompts.
Generate an evaluation rubric and structured test table.
Log outputs per test case and compute weighted scores.
Summarize tradeoffs instead of declaring a winner too early.
Recommend the next experiment iteration.

Output contract

experiment plan
scored comparison table
rubric
next-iteration suggestions

Files in this skill

Script: {baseDir}/scripts/prompt_experiment_logger.py
Resource: {baseDir}/resources/eval_rubric.md

Operating rules

Be concrete and action-oriented.
Prefer preview / draft / simulation mode before destructive changes.
If information is missing, ask only for the minimum needed to proceed.
Never fabricate metrics, legal certainty, receipts, credentials, or evidence.
Keep assumptions explicit.

Suggested prompts

比较两个提示词
prompt ab test
提示词实验

Use of script and resources

Use the bundled script when it helps the user produce a structured file, manifest, CSV, or first-pass draft. Use the resource file as the default schema, checklist, or preset when the user does not provide one.

Boundaries

This skill supports planning, structuring, and first-pass artifacts.
It should not claim that files were modified, messages were sent, or legal/financial decisions were finalized unless the user actually performed those actions.

Compatibility notes

Directory-based AgentSkills/OpenClaw skill.
Runtime dependency declared through metadata.openclaw.requires.
Helper script is local and auditable: scripts/prompt_experiment_logger.py.
Bundled resource is local and referenced by the instructions: resources/eval_rubric.md.

Read Full Documentation on GitHub

Metadata

Author@52yuanchangxing

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-52yuanchangxing-prompt-ab-lab": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

portfolio-case-study-forge

Turn rough project notes into polished portfolio case studies with metrics, visuals checklist, and interviewer talking points.

52yuanchangxing 4473

evidence-gap-mapper

在报告、方案或演示稿中定位结论先行但证据不足的位置，并给出补证优先级。；use for evidence, gap-analysis, research workflows；do not use for 伪造数据支撑结论, 忽略高风险假设.

52yuanchangxing 4473

policy-to-checklist

把征稿启事、通知、比赛规则、制度文件、招标要求等转成可执行检查清单与时间线。

52yuanchangxing 4473

incident-postmortem-assistant

将事故线索整理成复盘草案，区分根因、诱因、放大器、影响与修复动作。；use for incident, postmortem, sre workflows；do not use for 归责个人, 篡改时间线.

52yuanchangxing 4473

ecommerce-customer-service-pro

行业可选的智能电商客服技能。用于售前咨询、售中跟进、催付催单、发货物流、售后处理、退款退换、投诉安抚、差评挽回、FAQ整理、达人与机构商务沟通等场景；先识别行业与场景，再输出全面、合规、可直接发送的话术与处理建议。

52yuanchangxing 4473