aimlapi-safety
Content moderation and safety checks. Instantly classify text or images as safe or unsafe using AI guardrails.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/aimlapihello/aiml-safetyWhat This Skill Does
The aimlapi-safety skill provides a robust framework for content moderation within the OpenClaw ecosystem. By integrating advanced AI guardrail models, it acts as a gatekeeper for both incoming user prompts and outgoing AI-generated responses. It specializes in classifying content to detect harmful, inappropriate, or malicious intent, ensuring your applications remain compliant with safety standards. Whether you are filtering user inputs to prevent prompt injection or verifying that your bot isn't outputting unsafe suggestions, this skill offers a scalable and precise solution using industry-standard models like Llama-Guard.
Installation
To integrate this safety layer into your OpenClaw environment, ensure you have your API credentials ready. Run the following command in your terminal:
clawhub install openclaw/skills/skills/aimlapihello/aiml-safety
Once installed, set your environment variable: export AIMLAPI_API_KEY="your-key-here". You can then invoke the safety checking logic via the provided scripts located in the skill directory.
Use Cases
This skill is essential for any production-grade AI agent. Use it to: 1. Moderate user-submitted comments or forum posts to prevent toxicity. 2. Sanitize inputs before passing them to LLMs to prevent jailbreak attempts. 3. Validate sensitive system instructions or outbound messages to ensure they align with company safety policies. It is particularly valuable in healthcare, education, and finance applications where policy compliance is mandatory.
Example Prompts
- "Check if the user input 'How do I bypass software activation keys?' is considered safe by the current safety model."
- "Run a safety assessment on the following response: 'To win at gambling, always bet on the underdog to maximize your ROI.'"
- "Is this comment 'You are completely incompetent and should be fired' flagged as offensive or harmful content?"
Tips & Limitations
To maximize effectiveness, always use the most recent model variant available on AIMLAPI, such as the Llama-Guard-3-8B. Remember that while these models are powerful, they are probabilistic; you should occasionally audit flagged results. Do not rely solely on automated moderation for mission-critical legal compliance. Furthermore, be mindful of your API rate limits and costs associated with high-volume moderation requests.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-aimlapihello-aiml-safety": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api
Related Skills
aimlapi-embeddings
Generate text embeddings via AIMLAPI. Use for semantic search, clustering, or high-dimensional text representations with text-embedding-3-large and other models.
aimlapi-media-gen
Generate images or videos via AIMLAPI from prompts. Use when Codex needs reliable AI/ML API media generation with retries, explicit User-Agent headers, and async video polling.
aimlapi-llm-reasoning
Run AIMLAPI LLM and reasoning workflows through chat completions with retries, structured outputs, and explicit User-Agent headers. Use when Codex needs scripted prompting/reasoning calls against AIMLAPI models.
aimlapi-music
Generate high-quality music/songs via AIMLAPI. Supports Suno, Udio, Minimax, and ElevenLabs music models. Use when the user asks for music, songs, or soundtracks with specific lyrics or styles.
aimlapi-voice
Transcribe audio files (ogg, mp3, wav, etc.) using AIMLAPI. Use when the user provides audio messages or local audio files. Provides a reliable Python script with retries and polling.