Official Verified data analysis Safety 5/5

experiment-designer

Use when planning product experiments, writing testable hypotheses, estimating sample size, prioritizing tests, or interpreting A/B outcomes with practical statistical rigor.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/alirezarezvani/experiment-designer

Download Source Code (.zip)

What This Skill Does

The experiment-designer skill is a specialized framework designed to inject statistical rigor into the product development lifecycle. It functions as an analytical partner that ensures product changes are backed by testable hypotheses rather than intuition. By standardizing the approach to experiment design, the skill assists in defining metrics, calculating necessary sample sizes, prioritizing backlog items through the ICE framework, and interpreting statistical outcomes with a focus on business significance rather than just p-values. It bridges the gap between raw data analysis and actionable product strategy.

Installation

To integrate this skill into your OpenClaw environment, execute the following command in your terminal: clawhub install openclaw/skills/skills/alirezarezvani/experiment-designer Ensure you have the required dependencies for the script execution environment installed, specifically Python 3, as the included sample size calculator script requires it to run accurate power analyses.

Use Cases

Hypothesis Validation: Rapidly draft if/then/because statements to align teams on the causal mechanism behind proposed features.
Risk Mitigation: Use the skill to calculate exact sample sizes, preventing the common mistake of launching underpowered tests that result in inconclusive data.
Backlog Prioritization: Utilize the ICE (Impact, Confidence, Ease) scoring system to make objective decisions about which experiments to run first in your roadmap.
Result Analysis: Interpret complex A/B testing output to identify novelty effects, segment heterogeneity, and ensure that declared winners are statistically robust enough to scale.

Example Prompts

"I'm planning to test a new checkout button color. Help me write a robust hypothesis using the If/Then/Because format, and then determine the sample size needed for a 2% absolute uplift given a 15% baseline conversion rate."
"We have three experiments in our queue. Use the ICE framework to help me prioritize them based on the potential conversion impact, our current user confidence levels, and the engineering effort for each."
"Our latest A/B test shows a p-value of 0.04 with a 95% confidence interval that barely touches zero. How should I interpret these results, and should we roll out the feature?"

Tips & Limitations

Fixed Horizons: Always define your test duration or sample size before the test begins. Avoid 'peeking' at results, as this inflates false positive rates (Type I errors).
Guardrails: Always track secondary metrics. A win in primary conversion is invalid if it causes a significant drop in customer support tickets or site performance.
Business vs. Stat Significance: A statistically significant result isn't always useful. If the effect size is smaller than the cost to maintain the feature, prioritize other work.
Limitations: This skill is an analytical aid. It assumes your instrumentation and data collection are reliable. Always perform a sanity check for sample ratio mismatch before interpreting test outcomes.

Read Full Documentation on GitHub

Metadata

Author@alirezarezvani

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-alirezarezvani-experiment-designer": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#experimentation#ab-testing#data-science#product-strategy#statistics

Safety Score: 5/5

Flags: code-execution

Related Skills

intl-expansion

International market expansion strategy. Market selection, entry modes, localization, regulatory compliance, and go-to-market by region. Use when expanding to new countries, evaluating international markets, planning localization, or building regional teams.

alirezarezvani 4473

marketing-strategy-pmm

Product marketing skill for positioning, GTM strategy, competitive intelligence, and product launches. Use when the user asks about product positioning, go-to-market planning, competitive analysis, target audience definition, ICP definition, market research, launch plans, or sales enablement. Covers April Dunford positioning, ICP definition, competitive battlecards, launch playbooks, and international market entry. Produces deliverables including positioning statements, battlecard documents, launch plans, and go-to-market strategies.

alirezarezvani 4473

paid-ads

When the user wants help with paid advertising campaigns on Google Ads, Meta (Facebook/Instagram), LinkedIn, Twitter/X, or other ad platforms. Also use when the user mentions 'PPC,' 'paid media,' 'ad copy,' 'ad creative,' 'ROAS,' 'CPA,' 'ad campaign,' 'retargeting,' or 'audience targeting.' This skill covers campaign strategy, ad creation, audience targeting, and optimization.

alirezarezvani 4473

qms-audit-expert

ISO 13485 internal audit expertise for medical device QMS. Covers audit planning, execution, nonconformity classification, and CAPA verification. Use for internal audit planning, audit execution, finding classification, external audit preparation, or audit program management.

alirezarezvani 4473

code-reviewer

Code review automation for TypeScript, JavaScript, Python, Go, Swift, Kotlin. Analyzes PRs for complexity and risk, checks code quality for SOLID violations and code smells, generates review reports. Use when reviewing pull requests, analyzing code quality, identifying issues, generating review checklists.

alirezarezvani 4473