sre-engineer
Use when defining SLIs/SLOs, managing error budgets, or building reliable systems at scale. Invoke for incident management, chaos engineering, toil reduction, capacity planning.
Why use this skill?
Enhance system reliability with the SRE Engineer skill. Automate toil, define effective SLOs, manage error budgets, and master incident response with AI.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/veeramanikandanr48/sre-engineerWhat This Skill Does
The SRE Engineer skill provides expert-level capabilities in building and maintaining highly reliable, scalable infrastructure. It acts as a senior consultant for Site Reliability Engineering practices, focusing on the intersection of software engineering and systems operations. This skill empowers users to define, measure, and improve service reliability through quantitative data, automated toil reduction, and structured incident management processes. By integrating SLO/SLI management with capacity planning and chaos engineering, it ensures that your systems remain resilient under load while maintaining velocity.
Installation
To install this skill, use the OpenClaw CLI:
clawhub install openclaw/skills/skills/veeramanikandanr48/sre-engineer
Use Cases
- Service Level Objectives: Create clear, actionable SLOs backed by SLIs that reflect actual user impact.
- Error Budget Management: Develop policies for managing error budgets and handling budget exhaustion with data-driven decision making.
- Automation of Toil: Analyze operational workflows to identify high-toil tasks and generate scripts or procedural improvements to eliminate manual intervention.
- Incident Management: Facilitate the creation of blameless postmortems and design remediation plans to prevent recurrence.
- Capacity Planning: Use historical telemetry data to forecast resource needs and prevent saturation-based outages.
- Chaos Engineering: Propose failure scenarios and design controlled experiments to test system resilience.
Example Prompts
- "Our user-facing API has been unstable lately. Help me define an SLI for latency and set a realistic SLO target based on our recent performance metrics."
- "We spend 20 hours a week manually restarting crashed services. Analyze this process and suggest an automation plan or architectural change to reduce this toil."
- "We just experienced a major database outage. Guide me through conducting a blameless postmortem and suggest specific action items to improve our recovery time objective (RTO)."
Tips & Limitations
To get the most out of this skill, ensure you provide concrete telemetry data when requesting analysis. The skill functions best when it has access to specific performance metrics, incident logs, or architectural diagrams.
Limitations: This skill is advisory and analytical; while it can generate infrastructure code (Terraform) or automation scripts, you should always review generated configurations in a staging environment before deploying to production. Do not rely on it as a substitute for human oversight during critical incident mitigation. Always prioritize safety, security, and human-in-the-loop validation for all automated operational changes.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-veeramanikandanr48-sre-engineer": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: code-execution, file-read
Related Skills
earnings-calendar
This skill retrieves upcoming earnings announcements for US stocks using the Financial Modeling Prep (FMP) API. Use this when the user requests earnings calendar data, wants to know which companies are reporting earnings in the upcoming week, or needs a weekly earnings review. The skill focuses on mid-cap and above companies (over $2B market cap) that have significant market impact, organizing the data by date and timing in a clean markdown table format. Supports multiple environments (CLI, Desktop, Web) with flexible API key management.
better-auth
Self-hosted auth for TypeScript/Cloudflare Workers with social auth, 2FA, passkeys, organizations, RBAC, and 15+ plugins. Requires Drizzle ORM or Kysely for D1 (no direct adapter). Self-hosted alternative to Clerk/Auth.js. Use when: self-hosting auth on D1, building OAuth provider, multi-tenant SaaS, or troubleshooting D1 adapter errors, session caching, rate limits, Expo crashes, additionalFields bugs.
dividend-growth-pullback-screener
Use this skill to find high-quality dividend growth stocks (12%+ annual dividend growth, 1.5%+ yield) that are experiencing temporary pullbacks, identified by RSI oversold conditions (RSI ≤40). This skill combines fundamental dividend analysis with technical timing indicators to identify buying opportunities in strong dividend growers during short-term weakness.
cli-developer
Use when building CLI tools, implementing argument parsing, or adding interactive prompts. Invoke for CLI design, argument parsing, interactive prompts, progress indicators, shell completions.
options-strategy-advisor
Options trading strategy analysis and simulation tool. Provides theoretical pricing using Black-Scholes model, Greeks calculation, strategy P/L simulation, and risk management guidance. Use when user requests options strategy analysis, covered calls, protective puts, spreads, iron condors, earnings plays, or options risk management. Includes volatility analysis, position sizing, and earnings-based strategy recommendations. Educational focus with practical trade simulation.