cpu-gpu-performance
Establish CPU/GPU baselines before resource-intensive operations. Use for regression detection
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/athola/nm-conserve-cpu-gpu-performanceNight Market Skill — ported from claude-night-market/conserve. For the full experience with agents, hooks, and commands, install the Claude Code plugin.
Table of Contents
- When to Use
- Required TodoWrite Items
- Step 1: Establish Current Baseline
- Step 2: Narrow the Scope
- Step 3: Instrument Before You Optimize
- Step 4: Throttle and Sequence Work
- Step 5: Log Decisions and Next Steps
- Output Expectations
CPU/GPU Performance Discipline
When To Use
- At the beginning of every session (auto-load alongside
token-conservation). - Whenever you plan to build, train, or test anything that could pin CPU cores or GPUs for more than a minute.
- Before retrying a failing command that previously consumed significant resources.
When NOT To Use
- Simple operations with no resource impact
- Quick single-file operations
Required TodoWrite Items
cpu-gpu-performance:baselinecpu-gpu-performance:scopecpu-gpu-performance:instrumentcpu-gpu-performance:throttlecpu-gpu-performance:log
Step 1: Establish Current Baseline
-
Capture current utilization:
uptimeps -eo pcpu,cmd | headnvidia-smi --query-gpu=utilization.gpu,memory.used --format=csv
Note which hosts/GPUs are already busy.
-
Record any CI/cluster budgets (time quotas, GPU hours) before launching work.
-
Set a per-task CPU minute / GPU minute budget that respects those limits.
Step 2: Narrow the Scope
- Avoid running "whole world" jobs after a small fix. Prefer diff-based
or tag-based selective testing:
pytest -k- Bazel target patterns
cargo test <module>
- Batch low-level fixes so you can validate multiple changes with a single targeted command.
- For GPU jobs, favor unit-scale smoke inputs or lower epoch counts before scheduling the full training/eval sweep.
Step 3: Instrument Before You Optimize
- Pick the right profiler/monitor:
- CPU work:
perfintel vtunecargo flamegraph- language-specific profilers
- GPU work:
nvidia-smi dmonnsysnvprof- DLProf
- framework timeline tracers
- CPU work:
- Capture kernel/ops timelines, memory footprints, and data pipeline latency so you have evidence when throttling or parallelizing.
- Record hot paths + I/O bottlenecks in notes so future reruns can jump straight to the culprit.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-athola-nm-conserve-cpu-gpu-performance": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
extract
Analyze a codebase and build a knowledge base of business logic, architecture, data flow, and engineering patterns. The foundation for gauntlet challenges and agent integration
discourse
>- Scan community discussion channels (HN, Lobsters, Reddit, tech blogs) for experience reports and opinions on a topic
synthesize
>- Merge, deduplicate, rank, and format research findings from multiple channels into a coherent report. Use after research agents return their results
workflow-monitor
Detect workflow failures and inefficient patterns, then create GitHub issues for improvement via /fix-workflow
architecture-paradigm-hexagonal
Hexagonal (Ports and Adapters) architecture isolating domain logic from infrastructure