ai-sentinel
Prompt injection detection and security scanning for OpenClaw agents. Installs the ai-sentinel plugin via OpenClaw CLI, configures plugin settings, and offers local (Community) or remote (Pro) classification with dashboard reporting. All configuration changes require explicit user confirmation.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/amandiwakar/ai-sentinelWhat This Skill Does
AI Sentinel is a robust prompt injection firewall designed specifically for OpenClaw agents. It serves as a protective layer that monitors message traffic, tool calls, and tool outputs to identify malicious patterns before they can compromise your agent's integrity. By hooking into the OpenClaw lifecycle, it intercept threats ranging from standard jailbreak attempts to complex indirect prompt injection. Whether you are running a local-only community instance or a globally monitored production agent, AI Sentinel provides the visibility required to maintain a secure AI environment.
Installation
Installation is handled directly through the OpenClaw CLI. First, ensure you are in the root of your OpenClaw project. Run openclaw plugins install ai-sentinel. Once installed, the plugin will require an initialization step to register with your gateway. Use the AskUserQuestion flow to confirm the modification of your openclaw.config.ts. If you opt for the Pro tier, you will be prompted to inject API keys into your .env file, which requires explicit user authorization for file write operations.
Use Cases
- Enterprise Guardrails: Enforcing strict adherence to internal instructions by detecting jailbreak attempts that try to bypass system prompts.
- Safe Tool Execution: Preventing indirect prompt injection where malicious data inside tool outputs attempts to manipulate the agent's next action.
- Compliance & Auditing: Using the Pro dashboard to log security events and analyze the frequency and nature of injection attempts for regulatory reporting.
Example Prompts
- "Initialize AI Sentinel in Community mode and scan all inbound messages for jailbreak attempts."
- "Enable the Pro tier for my agent and configure the dashboard reporting for real-time threat analysis."
- "Check if there are any active prompt injection warnings in the current session log."
Tips & Limitations
- Performance: Community mode operates locally, ensuring zero latency impact. Pro mode adds minimal network overhead for logging.
- False Positives: Like all heuristic scanners, AI Sentinel may occasionally flag complex but benign prompts. Use 'monitor' mode before moving to 'enforce' to tune sensitivity.
- Privacy: Always review the Zetro.ai privacy policy before opting into the Pro tier if you handle sensitive user data, as scan logs will be transmitted externally.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-amandiwakar-ai-sentinel": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, external-api