observability-designer
Observability Designer (POWERFUL)
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/alirezarezvani/observability-designerWhat This Skill Does
The Observability Designer skill is a high-level engineering tool designed to help developers and SREs architect comprehensive monitoring and diagnostic strategies for production environments. It bridges the gap between raw data collection and actionable operational intelligence by implementing industry-standard frameworks like the Golden Signals (Latency, Traffic, Errors, Saturation), the RED method for microservices, and the USE method for infrastructure resource analysis. The skill assists in defining precise SLIs (Service Level Indicators) and SLOs (Service Level Objectives), calculating error budgets, and configuring multi-window burn rate alerts to ensure systems remain performant while managing reliability expectations.
Installation
To install this skill, run the following command in your OpenClaw terminal:
clawhub install openclaw/skills/skills/alirezarezvani/observability-designer
Use Cases
- Production Readiness Reviews: Automatically generate a monitoring checklist for new microservices to ensure they meet internal reliability standards.
- Alert Noise Reduction: Analyze current alerting configurations and suggest optimizations to minimize false positives and fatigue.
- Dashboard Prototyping: Design high-fidelity dashboard layouts that prioritize critical user journeys and operational visibility based on your specific architecture.
- Incident Response Strategy: Map out tracing and logging requirements to accelerate root cause analysis for distributed system failures.
Example Prompts
- "Design an SLO framework for our order processing service, including suggested SLIs for latency and success rate, along with a strategy for burn rate alerting."
- "Review my current dashboard layout; I have 20 panels and I'm overwhelmed. Apply the 7±2 rule and suggest a hierarchy for my SRE team to improve cognitive load."
- "I am struggling with high cardinality in my logs. Design a structured logging and sampling strategy that reduces costs without losing visibility into critical error flows."
Tips & Limitations
- Focus on Business Value: When designing SLIs, always start with the user experience rather than raw infrastructure metrics.
- Iterate on Dashboards: Treat your dashboard as code. Start simple and add complexity only when a specific operational need is identified through incident data.
- Limitations: The skill provides architectural design and strategy. It cannot directly modify your infrastructure or external monitoring tool APIs unless connected to relevant automation interfaces. It is most effective when provided with a clear description of your current technology stack (e.g., Kubernetes, CloudWatch, Datadog, Prometheus).
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-alirezarezvani-observability-designer": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Related Skills
intl-expansion
International market expansion strategy. Market selection, entry modes, localization, regulatory compliance, and go-to-market by region. Use when expanding to new countries, evaluating international markets, planning localization, or building regional teams.
marketing-strategy-pmm
Product marketing skill for positioning, GTM strategy, competitive intelligence, and product launches. Use when the user asks about product positioning, go-to-market planning, competitive analysis, target audience definition, ICP definition, market research, launch plans, or sales enablement. Covers April Dunford positioning, ICP definition, competitive battlecards, launch playbooks, and international market entry. Produces deliverables including positioning statements, battlecard documents, launch plans, and go-to-market strategies.
paid-ads
When the user wants help with paid advertising campaigns on Google Ads, Meta (Facebook/Instagram), LinkedIn, Twitter/X, or other ad platforms. Also use when the user mentions 'PPC,' 'paid media,' 'ad copy,' 'ad creative,' 'ROAS,' 'CPA,' 'ad campaign,' 'retargeting,' or 'audience targeting.' This skill covers campaign strategy, ad creation, audience targeting, and optimization.
qms-audit-expert
ISO 13485 internal audit expertise for medical device QMS. Covers audit planning, execution, nonconformity classification, and CAPA verification. Use for internal audit planning, audit execution, finding classification, external audit preparation, or audit program management.
code-reviewer
Code review automation for TypeScript, JavaScript, Python, Go, Swift, Kotlin. Analyzes PRs for complexity and risk, checks code quality for SOLID violations and code smells, generates review reports. Use when reviewing pull requests, analyzing code quality, identifying issues, generating review checklists.