ml-evolution-agent
Auto-evolving ML competition agent. Learns from each experiment, accumulates HCC multi-layer memory, and continuously improves LB scores. Inspired by MLE-Bench #1 ML-Master methodology.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/guohongbin-git/ml-evolution-agentML Evolution Agent 🤖
Auto-evolving ML competition agent that learns from every experiment.
What This Skill Does
- Auto-evolves ML models for Kaggle-style competitions
- HCC Multi-layer Memory - Episodic, Pattern, Knowledge, Strategic layers
- Continuous improvement - Each phase learns from previous failures/successes
- Resource-aware - Respects system limits (time, memory, API quotas)
When to Use
- User mentions Kaggle competition
- Tabular data classification/regression tasks
- Need to beat a target LB score
- User wants automated ML experimentation
Quick Start
# Initialize
from ml_evolution import MLEvolutionAgent
agent = MLEvolutionAgent(
competition="playground-series-s6e2",
target_lb=0.95400,
data_dir="./data"
)
# Run evolution
agent.evolve(max_phases=10)
HCC Memory Architecture
Layer 1: Episodic Memory
├── Experiment logs (phase, CV, LB, features, params)
├── Success/failure records
└── Resource usage tracking
Layer 2: Pattern Memory
├── What works (success patterns)
├── What fails (failure patterns)
└── When to use each approach
Layer 3: Knowledge Memory
├── Feature engineering techniques
├── Model configurations
├── Hyperparameter knowledge
└── Domain-specific features
Layer 4: Strategic Memory
├── Auto-evolution rules
├── Resource management rules
├── Exploration-exploitation balance
└── Competition-specific strategies
Proven Techniques (from real competitions)
Feature Engineering
| Technique | Effect | Best For |
|---|---|---|
| Target Statistics | +0.00018 LB | All tabular data |
| Frequency Encoding | +0.00005 LB | High-cardinality features |
| Smooth Target Encoding | +0.00003 LB | Prevent overfitting |
| Medical Indicators | +0.00006 CV | Health data |
Model Configurations
| Model | Best Params | Weight |
|---|---|---|
| CatBoost | iter=1000-1200, lr=0.04-0.05, depth=6-7 | 50% |
| XGBoost | n_est=1000-1200, lr=0.04, max_depth=6 | 25-30% |
| LightGBM | n_est=1000-1200, lr=0.04, leaves=40 | 20-25% |
Resource Limits
- Features: < 60 (avoids timeout)
- Iterations: < 1200 (avoids SIGKILL)
- Training time: < 20 min (system limit)
- Submissions: 10/day (Kaggle quota)
Evolution Rules
# Auto-evolution decision tree
if phase_improved:
keep_features()
try_similar_approach()
elif phase_degraded > 0.0001:
rollback()
try_new_direction()
else:
fine_tune_params()
# Overfitting detection
if cv_lb_gap > 0.002:
increase_regularization()
reduce_features()
simplify_model()
Files Structure
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-guohongbin-git-ml-evolution-agent": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
sspai-hot-cn
少数派热门文章监控 | SSPAI Hot Articles Monitor. 获取少数派热门数码评测、应用推荐、效率工具 | Get SSPAI trending digital reviews, app recommendations, productivity tools. 触发词:少数派、sspai、数码评测、效率工具.
binance-pro-cn
币安专业版 | Binance Pro. 完整币安集成 | Complete Binance integration. 现货/合约交易、杠杆、质押 | Spot/futures trading, leverage, staking. 触发词:币安、Binance、交易、trading.
v2ex-hot-cn
V2EX 热门话题监控 | V2EX Hot Topics Monitor. 获取 V2EX 热门帖子、技术讨论、数码生活 | Get V2EX trending posts, tech discussions, digital life. 触发词:V2EX、v2、程序员社区.
xueqiu-hot-cn
雪球热门讨论监控 | Xueqiu Hot Discussions Monitor. 获取雪球热门股票讨论、投资观点、大V动态 | Get Xueqiu trending stock discussions, investment insights, top posts. 触发词:雪球、股票、投资、xueqiu.
tianyancha-cn
企业信息查询 - 天眼查/企查查/爱企查数据查询(Bloomberg 终端中国版)