Official Verified

cpu-gpu-performance

Establish CPU/GPU baselines before resource-intensive operations. Use for regression detection

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/athola/nm-conserve-cpu-gpu-performance

Night Market Skill — ported from claude-night-market/conserve. For the full experience with agents, hooks, and commands, install the Claude Code plugin.

When to Use
Required TodoWrite Items
Step 1: Establish Current Baseline
Step 2: Narrow the Scope
Step 3: Instrument Before You Optimize
Step 4: Throttle and Sequence Work
Step 5: Log Decisions and Next Steps
Output Expectations

CPU/GPU Performance Discipline

When To Use

At the beginning of every session (auto-load alongside token-conservation).
Whenever you plan to build, train, or test anything that could pin CPU cores or GPUs for more than a minute.
Before retrying a failing command that previously consumed significant resources.

When NOT To Use

Simple operations with no resource impact
Quick single-file operations

Required TodoWrite Items

cpu-gpu-performance:baseline
cpu-gpu-performance:scope
cpu-gpu-performance:instrument
cpu-gpu-performance:throttle
cpu-gpu-performance:log

Step 1: Establish Current Baseline

Capture current utilization:
- uptime
- ps -eo pcpu,cmd | head
- nvidia-smi --query-gpu=utilization.gpu,memory.used --format=csv
Note which hosts/GPUs are already busy.
Record any CI/cluster budgets (time quotas, GPU hours) before launching work.
Set a per-task CPU minute / GPU minute budget that respects those limits.

Step 2: Narrow the Scope

Avoid running "whole world" jobs after a small fix. Prefer diff-based or tag-based selective testing:
- pytest -k
- Bazel target patterns
- cargo test <module>
Batch low-level fixes so you can validate multiple changes with a single targeted command.
For GPU jobs, favor unit-scale smoke inputs or lower epoch counts before scheduling the full training/eval sweep.

Step 3: Instrument Before You Optimize

Pick the right profiler/monitor:
- CPU work:
  - perf
  - intel vtune
  - cargo flamegraph
  - language-specific profilers
- GPU work:
  - nvidia-smi dmon
  - nsys
  - nvprof
  - DLProf
  - framework timeline tracers
Capture kernel/ops timelines, memory footprints, and data pipeline latency so you have evidence when throttling or parallelizing.
Record hot paths + I/O bottlenecks in notes so future reruns can jump straight to the culprit.

Read Full Documentation on GitHub

Metadata

Author@athola

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-athola-nm-conserve-cpu-gpu-performance": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

extract

Analyze a codebase and build a knowledge base of business logic, architecture, data flow, and engineering patterns. The foundation for gauntlet challenges and agent integration

athola 4473

discourse

>- Scan community discussion channels (HN, Lobsters, Reddit, tech blogs) for experience reports and opinions on a topic

athola 4473

synthesize

>- Merge, deduplicate, rank, and format research findings from multiple channels into a coherent report. Use after research agents return their results

athola 4473

workflow-monitor

Detect workflow failures and inefficient patterns, then create GitHub issues for improvement via /fix-workflow

athola 4473

architecture-paradigm-hexagonal

Hexagonal (Ports and Adapters) architecture isolating domain logic from infrastructure

athola 4473

cpu-gpu-performance

Install via CLI (Recommended)

Table of Contents

CPU/GPU Performance Discipline

When To Use

When NOT To Use

Required TodoWrite Items

Step 1: Establish Current Baseline

Step 2: Narrow the Scope

Step 3: Instrument Before You Optimize

Metadata

Related Skills

extract

discourse

synthesize

workflow-monitor

architecture-paradigm-hexagonal