ai-safety-guard
Lightweight passive privacy guard for OpenClaw — intelligently prevents user data from leaking externally. TRIGGER: before the AI sends or outputs any data to an external destination (API call, email, MCP, upload, post, share, webhook, clipboard copy that goes external etc.). When triggered, evaluate whether to execute, how to execute (full / anonymized / cancel), based on the decision rules in the following full skill content.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/andreqingyuwu/ai-safety-guardWhat This Skill Does
The AI Safety Guard is a foundational behavioral skill designed to transform OpenClaw into a privacy-first assistant. Unlike traditional filtering tools that run externally, this skill integrates directly into the AI's core processing logic. It ensures that sensitive information—including government-issued IDs, bank credentials, phone numbers, medical records, and passwords—is automatically detected and sanitized before it is rendered in any output. By embedding privacy awareness into the AI's decision-making flow, it prevents accidental leakage during document drafting, email composition, and conversational memory recall. It provides multiple levels of protection, from silent suppression to interactive confirmation, ensuring users retain control over how their data is handled.
Installation
To activate the AI Safety Guard, run the following command in your OpenClaw terminal:
clawhub install openclaw/skills/skills/andreqingyuwu/ai-safety-guard
After installation, the AI will automatically begin scanning outputs for sensitive patterns. No manual configuration is required for the default silent filter mode.
Use Cases
This skill is essential for users handling sensitive workflows, such as:
- Professional Communications: Drafting emails or reports that reference sensitive client data or internal company credentials.
- Content Management: Exporting chat history or documents where personal identifiers, addresses, or financial data might be embedded in the text.
- Credential Handling: Working with API keys or tokens where the AI needs to assist in development without exposing raw secrets to the terminal buffer or shared screen.
- Data Privacy Compliance: Ensuring that AI-generated summaries of conversations remain compliant with internal privacy policies by stripping out PII before storage or sharing.
Example Prompts
- "Draft an email to the insurance provider attaching my medical report details but ensure all patient IDs are redacted."
- "Summarize the previous conversation regarding the project budget and export the text, ensuring my bank account numbers are removed."
- "Can you help me write a follow-up email to the landlord including my current home address while keeping my private social security number hidden?"
Tips & Limitations
- Proactive Vigilance: The AI activates automatically, but you can set the protection level (1-4) in your settings to match your risk profile.
- False Positives: If the AI incorrectly flags non-sensitive data, providing feedback helps improve the model's recognition accuracy over time.
- Boundary Limitations: While highly effective, this skill relies on pattern recognition; always exercise caution when handling hyper-sensitive cryptographic keys or unique physical assets.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-andreqingyuwu-ai-safety-guard": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read