einstein-research-backtest
Expert guidance for systematic backtesting of trading strategies. Use when developing, testing, stress-testing, or validating quantitative trading strategies. Covers "beating ideas to death" methodology, parameter robustness testing, slippage modeling, bias prevention, and interpreting backtest results. Applicable when user asks about backtesting, strategy validation, robustness testing, avoiding overfitting, or systematic trading development.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/clawdiri-ai/einstein-research-backtest-dvSystematic Backtesting Methodology
This skill provides expert guidance for the rigorous, systematic backtesting of quantitative trading strategies. It ensures that strategies are robust, statistically sound, and free from common biases before any consideration of live deployment. This is the methodology guide; for the programmatic backtesting engine, see the einstein-research-backtest-engine skill.
Core Principle: "Beat the Idea to Death"
A single backtest with good results is meaningless. The goal is not to find one set of parameters that worked in the past, but to prove that a strategy has a persistent edge across a wide range of market conditions and parameter variations.
When to Use This Skill
- User asks how to backtest a trading idea.
- User presents a backtest result and asks for interpretation or next steps.
- User wants to know if their strategy is robust or overfit.
- User is developing a systematic or quantitative trading strategy.
- Triggers: "backtest", "strategy validation", "robustness testing", "overfitting", "systematic trading".
The 7 Stages of Systematic Backtesting
Stage 1: Hypothesis Definition
- Action: Clearly define the strategy's logic, the underlying inefficiency it exploits, and the expected behavior.
- Example: "Hypothesis: Stocks that gap down on high volume but close in the upper 50% of their daily range tend to mean-revert over the next 1-3 days."
- Output: A clear, one-sentence hypothesis.
Stage 2: Initial Backtest
- Action: Run a single backtest using a baseline set of parameters on in-sample data.
- Goal: Sanity check. Does the idea show any promise at all?
- Tool:
einstein-research-backtest-engine - Output: Initial performance metrics (Sharpe, Max Drawdown, CAGR).
Stage 3: Parameter Robustness Testing
- Action: Vary the strategy's key parameters across a logical range.
- Example: For a moving average crossover, test 20/50, 25/60, 15/45, etc.
- Goal: Check for a "plateau" of profitability. A good strategy works across a range of parameters, not just one magic number. A single peak is a major red flag for overfitting.
- Output: A heatmap or table showing performance across parameter variations.
Stage 4: Out-of-Sample (OOS) Testing
- Action: Test the best parameter plateau from Stage 3 on a separate, unseen dataset (e.g., a different time period).
- Goal: Verify that the strategy's edge is not specific to the in-sample data.
- Rule: If performance degrades significantly (>30%) on OOS data, the strategy is likely overfit. Go back to Stage 1.
- Output: Comparison of In-Sample vs. Out-of-Sample performance metrics.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-clawdiri-ai-einstein-research-backtest-dv": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
Follow-Through Day (FTD) Detector
Detects Follow-Through Day (FTD) signals for market bottom confirmation using William O'Neil's methodology. Dual-index tracking (S&P 500 + NASDAQ) with state machine for rally attempt, FTD qualification, and post-FTD health monitoring. Use when user asks about market bottom signals, follow-through days, rally attempts, re-entry timing after corrections, or whether it's safe to increase equity exposure. Complementary to market-top-detector (defensive) - this skill is offensive (bottom confirmation).
Gumroad Page Generator
Generates Gumroad product page content from a simple spec.
Analytics Tracking
Track events and user behavior using Segment.
AutoSignals - Autonomous Trading Signal Optimization
Monitors and controls the AutoSignals autonomous research loop.
first-principles
Deep first-principles analysis of any topic, decision, strategy, or assumption. Strips inherited thinking, identifies what is provably true, and rebuilds from ground truth. Use when user asks for first principles analysis, wants to challenge assumptions, says "analyze this from scratch", "break this down", "what's really true here", or triggers with /firstprinciples. Also useful for strategic decisions, investment theses, product strategy, career moves, or any situation where conventional wisdom may be wrong.