ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified data analysis Safety 4/5

senior-data-scientist

World-class senior data scientist skill specialising in statistical modeling, experiment design, causal inference, and predictive analytics. Covers A/B testing (sample sizing, two-proportion z-tests, Bonferroni correction), difference-in-differences, feature engineering pipelines (Scikit-learn, XGBoost), cross-validated model evaluation (AUC-ROC, AUC-PR, SHAP), and MLflow experiment tracking — using Python (NumPy, Pandas, Scikit-learn), R, and SQL. Use when designing or analysing controlled experiments, building and evaluating classification or regression models, performing causal analysis on observational data, engineering features for structured tabular datasets, or translating statistical findings into data-driven business decisions.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/alirezarezvani/senior-data-scientist
Or

What This Skill Does

The Senior Data Scientist skill is a comprehensive toolkit designed for production-level statistical modeling, causal inference, and machine learning pipelines. It empowers OpenClaw to act as an expert collaborator capable of designing rigorous A/B tests, conducting feature engineering, and evaluating model performance using industry-standard metrics. Whether you are dealing with structured tabular data or complex observational studies, this skill bridges the gap between raw data and actionable business intelligence. It provides pre-validated methodologies for sample size calculation, difference-in-differences analysis, and cross-validated predictive analytics using a robust stack of Python libraries like Scikit-learn, NumPy, and Pandas.

Installation

To integrate this skill into your environment, run the following command in your terminal: clawhub install openclaw/skills/skills/alirezarezvani/senior-data-scientist Ensure you have the necessary environment dependencies installed to support advanced scientific Python packages.

Use Cases

  • Experimentation: Design and analyze A/B tests with strict statistical controls, including MDE (Minimum Detectable Effect) calculation, power analysis, and Bonferroni corrections for multiple testing.
  • Machine Learning Pipelines: Build automated feature engineering workflows using ColumnTransformers and Pipelines to ensure consistency between training and inference environments.
  • Causal Analysis: Perform observational studies and difference-in-differences analysis to uncover insights when controlled experiments are not feasible.
  • Model Evaluation: Conduct thorough performance audits using AUC-ROC, AUC-PR, and interpretability tools like SHAP to ensure model fairness and reliability.
  • Business Intelligence: Translate complex statistical outputs—such as confidence intervals, lift metrics, and feature importance—into clear, data-driven recommendations for stakeholders.

Example Prompts

  1. "I am planning a website conversion experiment with a 10% baseline. Calculate the required sample size per variant for a 5% relative MDE at 80% power."
  2. "Review my feature engineering pipeline for this churn prediction model. I need to impute missing values, scale numeric columns, and one-hot encode categorical features."
  3. "The A/B test results are in. Please perform a two-proportion z-test on these conversion numbers and provide the lift, confidence interval, and interpretation."

Tips & Limitations

  • Data Integrity: Always ensure your data is clean before feeding it into the pipeline; garbage in, garbage out applies heavily here.
  • Sample Ratio Mismatch (SRM): Always check for SRM before analyzing experiments to ensure data collection logs are accurate.
  • Business Logic: While the models provide statistical significance, always layer in domain expertise to validate if the findings make sense for your specific business cycle.
  • Multi-tasking: Remember to manually apply the Bonferroni correction if your objective is to monitor more than one primary outcome metric to avoid the p-hacking fallacy.

Metadata

Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-alirezarezvani-senior-data-scientist": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#data-science#machine-learning#statistics#experimentation#python
Safety Score: 4/5

Flags: code-execution

Related Skills

intl-expansion

International market expansion strategy. Market selection, entry modes, localization, regulatory compliance, and go-to-market by region. Use when expanding to new countries, evaluating international markets, planning localization, or building regional teams.

alirezarezvani 4473

marketing-strategy-pmm

Product marketing skill for positioning, GTM strategy, competitive intelligence, and product launches. Use when the user asks about product positioning, go-to-market planning, competitive analysis, target audience definition, ICP definition, market research, launch plans, or sales enablement. Covers April Dunford positioning, ICP definition, competitive battlecards, launch playbooks, and international market entry. Produces deliverables including positioning statements, battlecard documents, launch plans, and go-to-market strategies.

alirezarezvani 4473

paid-ads

When the user wants help with paid advertising campaigns on Google Ads, Meta (Facebook/Instagram), LinkedIn, Twitter/X, or other ad platforms. Also use when the user mentions 'PPC,' 'paid media,' 'ad copy,' 'ad creative,' 'ROAS,' 'CPA,' 'ad campaign,' 'retargeting,' or 'audience targeting.' This skill covers campaign strategy, ad creation, audience targeting, and optimization.

alirezarezvani 4473

qms-audit-expert

ISO 13485 internal audit expertise for medical device QMS. Covers audit planning, execution, nonconformity classification, and CAPA verification. Use for internal audit planning, audit execution, finding classification, external audit preparation, or audit program management.

alirezarezvani 4473

code-reviewer

Code review automation for TypeScript, JavaScript, Python, Go, Swift, Kotlin. Analyzes PRs for complexity and risk, checks code quality for SOLID violations and code smells, generates review reports. Use when reviewing pull requests, analyzing code quality, identifying issues, generating review checklists.

alirezarezvani 4473