ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified

devops-insight

This skill should be used when the user asks to "analyze incidents", "troubleshoot production issues", "investigate alerts", "create tickets", "root cause analysis", "check monitoring", or discusses DevOps/SRE automation, incident management, or observability integration.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/cafechen/devops-insight
Or

DevOps Insight - Intelligent DevOps Incident Management

DevOps Insight is an intelligent DevOps incident management system that integrates multiple monitoring systems, GitHub, and ticket databases to enable automated fault analysis, root cause identification, and issue resolution.

System Architecture

Core Components

  1. Monitoring Data Source Integration (via MCP)

    • Kubernetes: Cluster status, Pod logs, events
    • PostgreSQL: Database performance metrics
    • Redis: Cache status and performance
    • Neo4j: Graph database monitoring
    • Elasticsearch: Log platform
    • Metrics: General metrics collection
    • APM (Skywalking): Application performance monitoring
  2. Code Management

    • GitHub integration (via gitnexus Nexus-skill)
    • Code review and commits
    • Automated fix commits
  3. EvoMap Integration

    • Capsule creation and publishing
    • Gene + Capsule bundle publishing
    • Automated quality validation
    • Network reputation tracking
  4. AI Agent

    • Problem clue identification via LLM
    • Root cause analysis
    • Code review and fix suggestions
    • Index construction decisions

Workflow

1. Monitoring Data Collection

When receiving an alert or analyzing an issue:

# Retrieve Kubernetes monitoring data via MCP
# Assumes MCP server connections to each monitoring system are configured

Steps:

  • Retrieve Pod status, logs, and events from Kubernetes
  • Retrieve application performance traces from APM (Skywalking)
  • Retrieve relevant logs from Elasticsearch
  • Retrieve performance metrics from the Metrics system
  • Retrieve status information from databases (PostgreSQL/Redis/Neo4j)

2. Intelligent Analysis and Root Cause Identification

Perform multi-dimensional analysis using Claude:

Analysis Dimensions:

  1. Problem Clue Identification

    • Analyze alert information and monitoring data
    • Identify anomalous patterns and trends
    • Correlate with historical events
  2. Root Cause Analysis

    • Code level: Recent code changes
    • Configuration level: Configuration changes and environment differences
    • Infrastructure level: Resource usage and network issues
    • Dependency level: Third-party services and databases
  3. Impact Assessment

    • Affected services and users
    • Business impact severity
    • Urgency determination

3.

Metadata

Author@cafechen
Stars4097
Views0
Updated2026-04-14
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-cafechen-devops-insight": {
      "enabled": true,
      "auto_update": true
    }
  }
}
Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.