increment-quality-judge-v2
AI-powered quality assessment using LLM-as-Judge pattern with BMAD risk scoring and formal gate decisions. Use for evaluating increment specs, assessing task completeness, or making quality gate decisions (PASS/CONCERNS/FAIL). Chain-of-thought reasoning ensures transparent evaluation.
Why use this skill?
Optimize your workflow with Increment Quality Judge v2. Automated LLM-as-Judge assessment, risk scoring, and quality gates for your specs and task planning.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/anton-abyzov/sw-increment-quality-judge-v2What This Skill Does
The increment-quality-judge-v2 is a sophisticated, AI-driven assessment tool designed to enforce rigorous quality standards in your development workflow. It leverages the 'LLM-as-Judge' pattern, a gold-standard approach in machine learning evaluation that mimics expert peer review through chain-of-thought processing. By systematically breaking down your specifications, task plans, and implementation artifacts, this skill identifies structural, logical, and technical risks using a BMAD (Basis, Magnitude, Assessment, Direction) risk-scoring framework. It evaluates your work against seven critical quality dimensions, providing a granular verdict that culminates in a formal quality gate decision (PASS, CONCERNS, or FAIL). Unlike manual review, which is prone to human fatigue and subjective bias, this skill delivers consistent, scalable, and evidence-based critiques in near real-time, ensuring that only high-quality increments proceed through your pipeline.
Installation
To integrate this skill into your local environment, ensure you have the OpenClaw SDK installed. Execute the following command in your terminal:
clawhub install openclaw/skills/skills/anton-abyzov/sw-increment-quality-judge-v2
Once installed, the skill becomes available via your CLI and internal slash command interfaces. No further agent configuration is required.
Use Cases
- Spec Validation: Ensure new increment specifications are exhaustive, unambiguous, and technically feasible before coding begins.
- Task Completeness: Audit task lists to ensure all requirements are mapped to specific, actionable, and testable units of work.
- Quality Gate Enforcement: Automate the assessment of pre-commit or pre-merge artifacts to prevent technical debt from entering the codebase.
- Technical Due Diligence: Use the risk-scoring mechanism to evaluate the complexity and potential failure modes of an proposed architectural change.
Example Prompts
- "Run a quality check on the current spec for increment 0042 to ensure it meets our documentation standards."
- "/sw:qa 0089 --pre - Perform a deep dive evaluation on the task list and provide a risk report."
- "Evaluate the proposed implementation plan for the new auth module; look specifically for edge-case coverage and security gaps."
Tips & Limitations
- Context is Key: Always provide the full scope of your increment; the skill's accuracy improves with the richness of your input metadata.
- Not an Agent: Do not attempt to spawn this as an autonomous agent. It is a utility skill. Using it via the provided CLI commands ensures the highest performance and stability.
- Iterative Refinement: When the skill returns a CONCERNS verdict, treat the actionable recommendations as a checklist. You can re-run the assessment after implementing changes to verify the fix.
- Human-in-the-loop: While highly accurate, the judge acts as an assistant. High-criticality decisions should still be reviewed by senior personnel where organizational policy dictates.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-anton-abyzov-sw-increment-quality-judge-v2": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read
Related Skills
network-engineer
Cloud network architect for VPC design, service mesh, zero-trust networking, load balancers, and CDN optimization. Use for network troubleshooting or connectivity issues.
jira-multi-project-mapper
Expert in mapping SpecWeave specs to multiple JIRA projects with intelligent project detection and cross-project coordination. Use when syncing to multiple JIRA projects (project-per-team, component-based), or managing bidirectional sync across team boundaries.
helm-chart-scaffolding
Design, organize, and manage Helm charts for templating and packaging Kubernetes applications with reusable configurations. Use when creating Helm charts, packaging Kubernetes applications, or implementing templated deployments.
performance-optimization
React Native performance with Hermes V1, FlashList, expo-image v2, concurrent rendering. Use for slow app, memory leaks, or FPS issues.
release-strategy-advisor
Release strategy advisor - detects brownfield patterns (tags, CI/CD, changelogs), recommends versioning strategy based on architecture. Creates release-strategy.md.