What This Skill Does

The pwnclaw-security-scan skill is a comprehensive security auditing tool designed specifically for AI agents within the OpenClaw ecosystem. As agents become more capable, they inherit risks such as prompt injection, jailbreaking, and social engineering. This skill integrates with the PwnClaw platform to stress-test your agent against 112 distinct attack vectors across 14 security categories, including memory poisoning, data exfiltration, and agency hijacking. It provides a structured security score and generates specific, actionable hardening instructions to strengthen your agent's system prompt against malicious actors.

Installation

To integrate this security suite into your agent, execute the following command in your terminal: clawhub install openclaw/skills/skills/gemini2027/pwnclaw-security-scan

Ensure your agent has the necessary network permissions to communicate with the PwnClaw API endpoints for automated testing modes. For source code transparency, you may audit the implementation at https://github.com/Gemini2027/pwnclaw.

Use Cases

This skill is essential for developers deploying AI agents in production environments. Primary use cases include:

Pre-deployment Hardening: Run a scan before exposing an agent to public traffic to identify vulnerabilities in the system prompt.
Continuous Compliance: Regularly audit your agent's defenses as you update its capabilities or tools.
Post-Incident Response: If your agent has been successfully "jailbroken," use this skill to diagnose exactly which vector allowed the exploit and generate the necessary defensive guardrails.

Example Prompts

"PwnClaw, please initiate a full security audit of my current agent to identify potential jailbreak vulnerabilities."
"I've updated my system prompt with new tool permissions. Run a pwnclaw-security-scan to check for MCP poisoning risks."
"Show me the results of my last security scan and explain the fix instructions provided for the prompt injection vulnerability."

Tips & Limitations

To maximize the effectiveness of the scan, ensure your agent's base instructions are robust before running the audit. The manual mode is recommended for testing agents that reside behind firewalls or private networks, while the automatic mode is best for public-facing endpoints. Note that while PwnClaw covers a wide range of attacks, no automated scanner replaces the need for careful architecture design and least-privilege principle enforcement.

pwnclaw-security-scan

Why use this skill?

Install via CLI (Recommended)

What This Skill Does

Installation

Use Cases

Example Prompts

Tips & Limitations

Metadata

Tags(AI)