ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified developer tools Safety 5/5

observability-designer

Observability Designer (POWERFUL)

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/alirezarezvani/observability-designer
Or

What This Skill Does

The Observability Designer skill is a high-level engineering tool designed to help developers and SREs architect comprehensive monitoring and diagnostic strategies for production environments. It bridges the gap between raw data collection and actionable operational intelligence by implementing industry-standard frameworks like the Golden Signals (Latency, Traffic, Errors, Saturation), the RED method for microservices, and the USE method for infrastructure resource analysis. The skill assists in defining precise SLIs (Service Level Indicators) and SLOs (Service Level Objectives), calculating error budgets, and configuring multi-window burn rate alerts to ensure systems remain performant while managing reliability expectations.

Installation

To install this skill, run the following command in your OpenClaw terminal: clawhub install openclaw/skills/skills/alirezarezvani/observability-designer

Use Cases

  • Production Readiness Reviews: Automatically generate a monitoring checklist for new microservices to ensure they meet internal reliability standards.
  • Alert Noise Reduction: Analyze current alerting configurations and suggest optimizations to minimize false positives and fatigue.
  • Dashboard Prototyping: Design high-fidelity dashboard layouts that prioritize critical user journeys and operational visibility based on your specific architecture.
  • Incident Response Strategy: Map out tracing and logging requirements to accelerate root cause analysis for distributed system failures.

Example Prompts

  1. "Design an SLO framework for our order processing service, including suggested SLIs for latency and success rate, along with a strategy for burn rate alerting."
  2. "Review my current dashboard layout; I have 20 panels and I'm overwhelmed. Apply the 7±2 rule and suggest a hierarchy for my SRE team to improve cognitive load."
  3. "I am struggling with high cardinality in my logs. Design a structured logging and sampling strategy that reduces costs without losing visibility into critical error flows."

Tips & Limitations

  • Focus on Business Value: When designing SLIs, always start with the user experience rather than raw infrastructure metrics.
  • Iterate on Dashboards: Treat your dashboard as code. Start simple and add complexity only when a specific operational need is identified through incident data.
  • Limitations: The skill provides architectural design and strategy. It cannot directly modify your infrastructure or external monitoring tool APIs unless connected to relevant automation interfaces. It is most effective when provided with a clear description of your current technology stack (e.g., Kubernetes, CloudWatch, Datadog, Prometheus).

Metadata

Stars4473
Views3
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-alirezarezvani-observability-designer": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#observability#sre#devops#monitoring#reliability
Safety Score: 5/5

Related Skills

intl-expansion

International market expansion strategy. Market selection, entry modes, localization, regulatory compliance, and go-to-market by region. Use when expanding to new countries, evaluating international markets, planning localization, or building regional teams.

alirezarezvani 4473

marketing-strategy-pmm

Product marketing skill for positioning, GTM strategy, competitive intelligence, and product launches. Use when the user asks about product positioning, go-to-market planning, competitive analysis, target audience definition, ICP definition, market research, launch plans, or sales enablement. Covers April Dunford positioning, ICP definition, competitive battlecards, launch playbooks, and international market entry. Produces deliverables including positioning statements, battlecard documents, launch plans, and go-to-market strategies.

alirezarezvani 4473

paid-ads

When the user wants help with paid advertising campaigns on Google Ads, Meta (Facebook/Instagram), LinkedIn, Twitter/X, or other ad platforms. Also use when the user mentions 'PPC,' 'paid media,' 'ad copy,' 'ad creative,' 'ROAS,' 'CPA,' 'ad campaign,' 'retargeting,' or 'audience targeting.' This skill covers campaign strategy, ad creation, audience targeting, and optimization.

alirezarezvani 4473

qms-audit-expert

ISO 13485 internal audit expertise for medical device QMS. Covers audit planning, execution, nonconformity classification, and CAPA verification. Use for internal audit planning, audit execution, finding classification, external audit preparation, or audit program management.

alirezarezvani 4473

code-reviewer

Code review automation for TypeScript, JavaScript, Python, Go, Swift, Kotlin. Analyzes PRs for complexity and risk, checks code quality for SOLID violations and code smells, generates review reports. Use when reviewing pull requests, analyzing code quality, identifying issues, generating review checklists.

alirezarezvani 4473