prompt-ab-lab
Design, log, compare, and score prompt experiments so users can systematically improve outputs instead of guessing.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/52yuanchangxing/prompt-ab-labPrompt A/B Lab
Purpose
Design, log, compare, and score prompt experiments so users can systematically improve outputs instead of guessing.
Trigger phrases
- 比较两个提示词
- prompt ab test
- 提示词实验
- 哪个 prompt 更好
- 建一个评测表
Ask for these inputs
- prompt A and B
- task
- evaluation criteria
- test set
- weights if any
Workflow
- Define what success looks like before comparing prompts.
- Generate an evaluation rubric and structured test table.
- Log outputs per test case and compute weighted scores.
- Summarize tradeoffs instead of declaring a winner too early.
- Recommend the next experiment iteration.
Output contract
- experiment plan
- scored comparison table
- rubric
- next-iteration suggestions
Files in this skill
- Script:
{baseDir}/scripts/prompt_experiment_logger.py - Resource:
{baseDir}/resources/eval_rubric.md
Operating rules
- Be concrete and action-oriented.
- Prefer preview / draft / simulation mode before destructive changes.
- If information is missing, ask only for the minimum needed to proceed.
- Never fabricate metrics, legal certainty, receipts, credentials, or evidence.
- Keep assumptions explicit.
Suggested prompts
- 比较两个提示词
- prompt ab test
- 提示词实验
Use of script and resources
Use the bundled script when it helps the user produce a structured file, manifest, CSV, or first-pass draft. Use the resource file as the default schema, checklist, or preset when the user does not provide one.
Boundaries
- This skill supports planning, structuring, and first-pass artifacts.
- It should not claim that files were modified, messages were sent, or legal/financial decisions were finalized unless the user actually performed those actions.
Compatibility notes
- Directory-based AgentSkills/OpenClaw skill.
- Runtime dependency declared through
metadata.openclaw.requires. - Helper script is local and auditable:
scripts/prompt_experiment_logger.py. - Bundled resource is local and referenced by the instructions:
resources/eval_rubric.md.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-52yuanchangxing-prompt-ab-lab": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
portfolio-case-study-forge
Turn rough project notes into polished portfolio case studies with metrics, visuals checklist, and interviewer talking points.
evidence-gap-mapper
在报告、方案或演示稿中定位结论先行但证据不足的位置,并给出补证优先级。;use for evidence, gap-analysis, research workflows;do not use for 伪造数据支撑结论, 忽略高风险假设.
policy-to-checklist
把征稿启事、通知、比赛规则、制度文件、招标要求等转成可执行检查清单与时间线。
incident-postmortem-assistant
将事故线索整理成复盘草案,区分根因、诱因、放大器、影响与修复动作。;use for incident, postmortem, sre workflows;do not use for 归责个人, 篡改时间线.
ecommerce-customer-service-pro
行业可选的智能电商客服技能。用于售前咨询、售中跟进、催付催单、发货物流、售后处理、退款退换、投诉安抚、差评挽回、FAQ整理、达人与机构商务沟通等场景;先识别行业与场景,再输出全面、合规、可直接发送的话术与处理建议。