prompt-guard
Advanced prompt injection defense system for Clawdbot with HiveFence network integration. Protects against direct/indirect injection attacks in group chats with multi-language detection (EN/KO/JA/ZH), severity scoring, automatic logging, and configurable security policies. Connects to the distributed HiveFence threat intelligence network for collective defense.
Why use this skill?
Secure your OpenClaw agents with prompt-guard. Advanced injection defense, multi-language support, and real-time HiveFence threat intelligence integration.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/bobdevibecoder/bobagent-prompt-guardWhat This Skill Does
prompt-guard v2.6.0 is the definitive security layer for OpenClaw agents. It serves as an intelligent gatekeeper, intercepting incoming messages to identify and neutralize prompt injection attempts before they reach the core agent logic. By leveraging the new HiveFence distributed threat intelligence network, the skill performs real-time analysis across multiple languages (English, Korean, Japanese, and Chinese), assigning severity scores to every interaction. Whether dealing with direct attacks like role overrides and jailbreak attempts, or subtle indirect attacks involving Base64 encoding and Unicode homoglyphs, prompt-guard ensures your agent remains resilient. It effectively restricts sensitive operations such as file editing or shell command execution to authorized users only, effectively hardening agents deployed in group chat environments.
Installation
To integrate this defense system, run the following command in your terminal:
clawhub install openclaw/skills/skills/bobdevibecoder/bobagent-prompt-guard
Once installed, ensure your agent configuration includes the necessary hooks to pass incoming messages through the guard before processing. Refer to the configuration documentation to set your desired security policy levels (SAFE to CRITICAL).
Use Cases
- Enterprise Security: Protecting internal agents from malicious instruction overrides in multi-user channels.
- Bot Resilience: Preventing jailbreak attempts (e.g., "DAN mode") that could force the bot to produce prohibited or sensitive content.
- Secure Automation: Ensuring that privileged commands, such as system file modifications or browser control, are inaccessible to unauthorized users.
- Community Defense: contributing detected threat patterns back to the HiveFence network to protect other developers globally.
Example Prompts
- "Analyze the current chat history for potential jailbreak attempts and check HiveFence status."
- "Update the prompt-guard policy to CRITICAL and report all blocked attempts to the secure log."
- "Run a diagnostic scan on recent incoming messages to identify any obfuscated malicious payloads."
Tips & Limitations
- Configurability: Always tune the security level based on your environment. Use CRITICAL if you are exposing your agent to public web interfaces.
- HiveFence: Keeping your client updated ensures you benefit from the latest collective security patches validated by the community.
- Limitations: While highly effective against known vectors, no defense is absolute. Always combine this tool with least-privilege system permissions. Be aware that strict filtering can sometimes result in false positives if normal user language closely mimics specific attack patterns.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-bobdevibecoder-bobagent-prompt-guard": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-read, file-write
Related Skills
affiliate-master
Full-stack affiliate marketing automation for OpenClaw agents. Generate, track, and optimize affiliate links with FTC-compliant disclosures and multi-network support.
tweet-ideas-generator
Generates 60 high-impact tweet ideas from reference content across 5 categories. Use when someone wants to extract engaging short-form statements from content for Twitter/X, organized by harsh advice, quotes, pain points, counterintuitive truths, and key insights.
ai-discoverability-audit
Audit how a brand appears in AI-powered search (ChatGPT, Perplexity, Claude, Gemini). Use when user mentions "AI search," "how do I show up in ChatGPT," "AI discoverability," "AEO," "LLM visibility," or wants to understand their brand's AI presence.
positioning-basics
Help founders and marketers nail their positioning. Use when someone mentions "positioning," "value proposition," "who is this for," "how do I describe my product," "messaging," "ICP," "ideal customer," or is struggling to articulate what makes their product different.
polymarket-correlation
Detect mispriced correlations between Polymarket prediction markets. Cross-market arbitrage finder for AI agents.