Official Verified developer tools Safety 5/5

increment-quality-judge-v2

AI-powered quality assessment using LLM-as-Judge pattern with BMAD risk scoring and formal gate decisions. Use for evaluating increment specs, assessing task completeness, or making quality gate decisions (PASS/CONCERNS/FAIL). Chain-of-thought reasoning ensures transparent evaluation.

Why use this skill?

Optimize your workflow with Increment Quality Judge v2. Automated LLM-as-Judge assessment, risk scoring, and quality gates for your specs and task planning.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/anton-abyzov/sw-increment-quality-judge-v2

Download Source Code (.zip)

What This Skill Does

The increment-quality-judge-v2 is a sophisticated, AI-driven assessment tool designed to enforce rigorous quality standards in your development workflow. It leverages the 'LLM-as-Judge' pattern, a gold-standard approach in machine learning evaluation that mimics expert peer review through chain-of-thought processing. By systematically breaking down your specifications, task plans, and implementation artifacts, this skill identifies structural, logical, and technical risks using a BMAD (Basis, Magnitude, Assessment, Direction) risk-scoring framework. It evaluates your work against seven critical quality dimensions, providing a granular verdict that culminates in a formal quality gate decision (PASS, CONCERNS, or FAIL). Unlike manual review, which is prone to human fatigue and subjective bias, this skill delivers consistent, scalable, and evidence-based critiques in near real-time, ensuring that only high-quality increments proceed through your pipeline.

Installation

To integrate this skill into your local environment, ensure you have the OpenClaw SDK installed. Execute the following command in your terminal:

clawhub install openclaw/skills/skills/anton-abyzov/sw-increment-quality-judge-v2

Once installed, the skill becomes available via your CLI and internal slash command interfaces. No further agent configuration is required.

Use Cases

Spec Validation: Ensure new increment specifications are exhaustive, unambiguous, and technically feasible before coding begins.
Task Completeness: Audit task lists to ensure all requirements are mapped to specific, actionable, and testable units of work.
Quality Gate Enforcement: Automate the assessment of pre-commit or pre-merge artifacts to prevent technical debt from entering the codebase.
Technical Due Diligence: Use the risk-scoring mechanism to evaluate the complexity and potential failure modes of an proposed architectural change.

Example Prompts

"Run a quality check on the current spec for increment 0042 to ensure it meets our documentation standards."
"/sw:qa 0089 --pre - Perform a deep dive evaluation on the task list and provide a risk report."
"Evaluate the proposed implementation plan for the new auth module; look specifically for edge-case coverage and security gaps."

Tips & Limitations

Context is Key: Always provide the full scope of your increment; the skill's accuracy improves with the richness of your input metadata.
Not an Agent: Do not attempt to spawn this as an autonomous agent. It is a utility skill. Using it via the provided CLI commands ensures the highest performance and stability.
Iterative Refinement: When the skill returns a CONCERNS verdict, treat the actionable recommendations as a checklist. You can re-run the assessment after implementing changes to verify the fix.
Human-in-the-loop: While highly accurate, the judge acts as an assistant. High-criticality decisions should still be reviewed by senior personnel where organizational policy dictates.

Read Full Documentation on GitHub

Metadata

Author@anton-abyzov

Stars1054

Updated2026-02-16

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-anton-abyzov-sw-increment-quality-judge-v2": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#qa#evaluation#llm-as-judge#quality-assurance#developer-tools

Safety Score: 5/5

Flags: file-read

Related Skills

network-engineer

Cloud network architect for VPC design, service mesh, zero-trust networking, load balancers, and CDN optimization. Use for network troubleshooting or connectivity issues.

anton-abyzov 1100

jira-multi-project-mapper

Expert in mapping SpecWeave specs to multiple JIRA projects with intelligent project detection and cross-project coordination. Use when syncing to multiple JIRA projects (project-per-team, component-based), or managing bidirectional sync across team boundaries.

anton-abyzov 1100

helm-chart-scaffolding

Design, organize, and manage Helm charts for templating and packaging Kubernetes applications with reusable configurations. Use when creating Helm charts, packaging Kubernetes applications, or implementing templated deployments.

anton-abyzov 1100

performance-optimization

React Native performance with Hermes V1, FlashList, expo-image v2, concurrent rendering. Use for slow app, memory leaks, or FPS issues.

anton-abyzov 1100

release-strategy-advisor

Release strategy advisor - detects brownfield patterns (tags, CI/CD, changelogs), recommends versioning strategy based on architecture. Creates release-strategy.md.

anton-abyzov 1100