self-improvement
Generic agent self-improvement skill built on OpenClaw-RL research (arxiv.org/abs/2603.10165). Captures evaluative signals (+1/-1) and directive hints from any user feedback, generates improvement suggestions.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/adelpro/agent-self-improvementSelf-Improvement Skill
Enables any agent to learn and improve from user feedback using PRM-style evaluation.
What It Does
- Capture Feedback: Store user responses to agent outputs
- Evaluate: Extract score (+1 positive, -1 correction)
- Analyze: Find patterns in directive hints
- Suggest: Generate actionable improvements
Concept (OpenClaw-RL Inspired)
From OpenClaw-RL paper:
"Next-state signals encode both evaluative and directive information about the preceding action."
Two signal types:
- Evaluative: Did the output work? (binary score)
- Directive: How should it be different? (hints)
Usage
Capture Feedback
SKILL:self-improvement --feedback "Great!" --job daily-report
SKILL:self-improvement --feedback "Add more stats" --job daily-report
Get Stats
SKILL:self-improvement --stats daily-report
Generate Improvements
SKILL:self-improvement --improve daily-report
Arguments
| Arg | Description | Example |
|---|---|---|
--job | Task/job name | daily-digest, weekly-recap |
--feedback | User response | "Thanks!", "Add more X" |
--score | Manual score override | 1, 0, -1 |
--stats | Show feedback stats | daily-digest |
--improve | Generate improvements | daily-digest |
--weekly | Weekly summary | (flag) |
Storage
memory/learning/agent-feedback.json:
{
"jobs": {
"daily-digest": {
"evaluations": [
{ "date": "2026-03-14", "score": 1, "hint": null },
{ "date": "2026-03-13", "score": -1, "hint": "add weekly star comparison" }
],
"improvements": [
{ "date": "2026-03-14", "suggestion": "Add weekly star delta", "implemented": false }
]
}
}
}
Cron Integration
Daily: Capture Feedback (9:30 AM)
{
"id": "feedback-capture",
"schedule": "30 9 * * *",
"message": "SKILL:self-improvement --job daily-digest"
}
Daily: Generate Improvements (10 AM)
{
"id": "improvement-suggestions",
"schedule": "0 10 * * *",
"message": "SKILL:self-improvement --improve daily-digest → Telegram"
}
Weekly: Summary (Saturday 9 AM)
{
"id": "weekly-summary",
"schedule": "0 9 * * 6",
"message": "SKILL:self-improvement --improve all --weekly"
}
Example Workflow
- User receives daily digest
- User responds: "Good! But can you show star trends?"
- Feedback captured: score=1, hint="show star trends"
- Next day, improvements generated:
- "Add star trend comparison (last 7 days)"
- "User满意度: 75%"
- Agent auto-updates prompt
Improvement Suggestion Format
📈 Improvement Suggestions - {job}
Stats: 8 evaluations, avg score: 0.75
Top Hints:
1. "add weekly star changes" (2x)
2. "use table format" (1x)
Suggested Actions:
• Add 7-day star delta to GitHub section
• Use table-image-generator for stats
Status: 1 improvement pending
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-adelpro-agent-self-improvement": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
openclaw-backup-automation
Automated backup for OpenClaw instances. Backs up agents, skills, cron jobs, and memory. Supports local tar archives. Credentials, periodic scheduling, and git sync are OPT-IN only.
private-web-search-searchxng
Self-hosted private web search using SearXNG. Use when privacy is important, external APIs are blocked, you need search without tracking, or want to avoid paid search APIs.
openclaw-continuous-learning
Instinct-based learning system for OpenClaw. Analyzes sessions, detects patterns, creates atomic learnings with confidence scoring, and suggests optimizations for self-evolution. Works alongside agent-self-improvement for complete learning: internal session analysis + external user feedback. Use when: you want your AI agent to learn from its own behavior, improve over time, discover optimization opportunities, or build a self-improving automation system. Don't use when: static agent behavior is preferred.
council-of-wisdom
A multi-agent deliberation hub with 3 core agents and extensible extended agents. Can call user workspace skills when needed.
openclaw-daily-tips
Daily AI agent optimization tips, tricks and self-improvement strategies. Learn cost-saving, speed, memory and automation best practices from the OpenClaw community. Use when: you want daily tips to optimize your AI agent, reduce costs, improve performance, or learn automation workflows. Don't use when: you need immediate config changes - use openclaw-agent-optimize for deep audits.