rocm_vllm_deployment
Production-ready vLLM deployment on AMD ROCm GPUs. Combines environment auto-check, model parameter detection, Docker Compose deployment, health verification, and functional testing with comprehensive logging and security best practices.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/alexhegit/rocm-vllm-deploymentROCm vLLM Deployment Skill
Production-ready automation for deploying vLLM inference services on AMD ROCm GPUs using Docker Compose.
Features
- Environment Auto-Check - Detects and repairs missing dependencies
- Model Parameter Detection - Auto-reads config.json for optimal settings
- VRAM Estimation - Calculates memory requirements before deployment
- Secure Token Handling - Never writes tokens to compose files
- Structured Output - All logs and test results saved per-model
- Deployment Reports - Human-readable summary for each deployment
- Health Verification - Automated health checks and functional tests
- Troubleshooting Guide - Common issues and solutions
Environment Prerequisites
Recommended (for production): Add to ~/.bash_profile:
# HuggingFace authentication token (required for gated models)
export HF_TOKEN="hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx"
# Model cache directory (optional)
export HF_HOME="$HOME/models"
# Apply changes
source ~/.bash_profile
Not required for testing: The skill will proceed without these set:
- HF_TOKEN: Optional — public models work without it; gated models fail at download with clear error
- HF_HOME: Optional — defaults to
/root/.cache/huggingface/hub
Environment Variable Detection
Priority Order:
- Explicit parameter (highest) — Provided in task/request (e.g.,
hf_token: "xxx") - Environment variable — Already set in shell or from parent process
- ~/.bash_profile — Source to load variables
- Default value (lowest) — HF_HOME defaults to
/root/.cache/huggingface/hub
| Variable | Required | If Missing |
|---|---|---|
HF_TOKEN | Conditional | Continue without token (public models work; gated models fail at download with clear error) |
HF_HOME | No | Warning + Default — Use /root/.cache/huggingface/hub |
Philosophy: Fail fast for configuration errors, fail at download time for authentication errors.
Helper Scripts
Location: <skill-dir>/scripts/
check-env.sh
Validate and load environment variables before deployment.
Usage:
# Basic check (HF_TOKEN optional, HF_HOME optional with default)
./scripts/check-env.sh
# Strict mode (HF_HOME required, fails if not set)
./scripts/check-env.sh --strict
# Quiet mode (minimal output, for automation)
./scripts/check-env.sh --quiet
# Test with environment variables
HF_TOKEN="hf_xxx" HF_HOME="/models" ./scripts/check-env.sh
Exit Codes:
| Code | Meaning |
|---|---|
| 0 | Environment check completed (variables loaded or defaulted) |
| 2 | Critical error (e.g., cannot source ~/.bash_profile) |
Note: This script is optional. You can also directly run source ~/.bash_profile.
generate-report.sh
Generate human-readable deployment report after successful deployment.
Usage:
./scripts/generate-report.sh <model-id> <container-name> <port> <status> [model-load-time] [memory-used]
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-alexhegit-rocm-vllm-deployment": {
"enabled": true,
"auto_update": true
}
}
}Tags
Related Skills
data-analysis-skill
数据分析技能包 - 自动抓取、清洗、可视化、生成报告。适合数据分析师、运营人员,告别 Excel 手工操作。
polymarket-whale-copier
Copy trade winning Polymarket wallets automatically. Track whale wallets, mirror their bets at configurable percentages, with built-in risk management. No API keys needed.
sharkflow
⚡ SharkFlow - 链上任务自动化,智能合约交互队列 + 多签工作流
obsidian-cli
Skill for the official Obsidian CLI (v1.12+). Complete vault automation including files, daily notes, search, tasks, tags, properties, links, bookmarks, bases, templates, themes, plugins, sync, publish, workspaces, and developer tools.
cherry-mcp
HTTP bridge that keeps MCP servers alive and exposes them via REST. Built for OpenClaw agents that need MCP tools without native MCP support.