github-actions-step-flake-audit
Detect flaky GitHub Actions job steps by finding mixed success/failure conclusions across runs.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/daniellummis/github-actions-step-flake-auditWhat This Skill Does
The github-actions-step-flake-audit skill is a sophisticated diagnostic tool designed to maintain CI/CD health by identifying non-deterministic behaviors in GitHub Actions. By analyzing historical JSON exports of workflow runs, the skill aggregates step outcomes across repository, workflow, job, and step identifiers. It calculates the failure rate for every unique step and applies configurable thresholds to flag 'flaky' behavior—specifically, steps that oscillate between success and failure across multiple executions. It outputs actionable data in either human-readable text or structured JSON, allowing for both manual investigation and programmatic CI gating.
Installation
To install this skill, use the OpenClaw CLI within your terminal:
clawhub install openclaw/skills/skills/daniellummis/github-actions-step-flake-audit
Ensure your project contains the necessary GitHub Actions run artifacts, which can be generated using the command: gh run view <run-id> --json databaseId,workflowName,headBranch,headSha,url,repository,jobs > artifacts/github-actions/run-<run-id>.json.
Use Cases
- CI Maintenance: Automatically break build pipelines when critical flakes (exceeding the 40% failure rate) are detected to prevent 'flaky noise' from desensitizing the engineering team.
- Regression Hunting: Compare flakiness before and after dependency updates by running the audit across different time-windowed artifacts.
- Post-Mortem Analysis: Audit why a specific suite is failing intermittently by isolating the most unstable steps across hundreds of historical workflow runs.
- Performance Optimization: Identify which slow-running steps are also unreliable, prioritizing them for refactoring or migration to more stable runners.
Example Prompts
- 'Run the step flake audit on all artifacts in my folder and show me the top 10 most unstable steps in JSON format.'
- 'Check if any GitHub Actions steps have a failure rate higher than 20% in the last 50 runs and print a summary.'
- 'Audit my CI artifacts and enable the fail gate; if any step is critical, stop the process so I can investigate.'
Tips & Limitations
- Data Quality: The accuracy of this skill is entirely dependent on the quality of your
gh run viewexports. Ensure you are exporting full JSON data rather than truncated views. - Threshold Tuning: Use
MIN_OCCURRENCESto avoid flagging steps that have only run once or twice; a minimum of 5-10 runs is recommended for statistical significance. - Scope: While excellent at detecting intermittent failures, it does not diagnose the root cause (e.g., resource contention, network timeouts, or race conditions). It is a detection tool, not a debugger.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-daniellummis-github-actions-step-flake-audit": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read, code-execution
Related Skills
github-actions-recovery-latency-audit
Measure GitHub Actions failure recovery latency and unresolved incident age by workflow group.
github-actions-cache-hardening-audit
Audit GitHub Actions workflow cache usage for poisoning, keying, and secret-path risks.
render-env-guard
Preflight-check Render service environment variables before deploys; catches missing keys and placeholder/template values that commonly break production rollouts.
github-actions-trigger-health-audit
Audit GitHub Actions run health by trigger event and workflow so flaky or noisy automation sources are easy to prioritize.
github-actions-run-gap-audit
Detect GitHub Actions workflow groups that stopped running on their normal cadence using median run intervals and current inactivity gap.