Skill Quality Auditor

Overview

Evaluate a target skill with a consistent rubric and return a clear pass/fail-style verdict plus a multi-dimensional review. Prefer the bundled script for the first pass, then turn the raw findings into a concise human-readable assessment.

Workflow

Identify the target skill folder.
Run scripts/evaluate_skill.py <path-to-skill>.
Read the report and group findings into:
- final verdict
- strengths
- weaknesses
- critical blockers
- recommended fixes
If the script reports missing context or borderline results, inspect the target skill's SKILL.md and any referenced resources before writing the final judgment.
Keep the final answer decisive: say whether the skill is currently qualified, conditionally qualified, or not qualified.

Rubric

Score the skill across these dimensions:

structure: required files, frontmatter validity, naming, obvious TODO placeholders
triggering: whether description clearly explains what the skill does and when to use it
workflow: whether the body gives actionable steps instead of vague guidance
progressive_disclosure: whether detailed material is kept in scripts or references instead of bloating SKILL.md
resources: whether scripts, references, and assets are included only when useful and are mentioned in the body
examples_and_outputs: whether the skill helps the agent understand expected usage or output shape
maintainability: clarity, concision, stale metadata checks, and overall ease of iteration

Use references/rubric.md when you need the detailed scoring logic and interpretation rules.

Verdict Rules

Use these labels:

Qualified: no critical blockers and score is strong enough for immediate use
Borderline: usable but needs material fixes soon
Not Qualified: missing required structure or too weak to trust in repeated use

Treat these as critical blockers:

missing SKILL.md
invalid or missing YAML frontmatter
missing name or description
unresolved template placeholders such as TODO
description too weak to trigger reliably
instructions too incomplete to execute the core task safely

Output Shape

Prefer this response shape:

Verdict

State Qualified, Borderline, or Not Qualified in the first sentence and explain the main reason.

Score Summary

Include the total score and 3-5 highest-signal dimension notes.

What Works Well

List concrete strengths tied to files or sections.

What Needs Work

List concrete weaknesses tied to files or sections.

Next Fixes

List the smallest set of changes most likely to move the skill to Qualified.

Script

Run:

python3 scripts/evaluate_skill.py /absolute/path/to/skill

Optional JSON mode:

python3 scripts/evaluate_skill.py /absolute/path/to/skill --json

skill-quality-auditor

Install via CLI (Recommended)