Cc Bos
Skill by bowen31337
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/bowen31337/cc-bosWhat This Skill Does
CC-BOS (Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search) is a specialized research-oriented skill designed for AI safety and red-teaming. It implements the methodology described in the ICLR 2026 paper (arXiv:2602.22983) to stress-test Large Language Models. The skill utilizes a fruit-fly bio-inspired optimization algorithm to evolve adversarial prompts expressed in Classical Chinese. By manipulating specific linguistic dimensions, it attempts to bypass safety filters and evaluate the robustness of target LLM APIs. The skill provides three primary functionalities: Attack (optimization), Defend (pattern recognition and analysis of adversarial signatures), and Research (performance metrics and heatmap generation).
Installation
To install the skill, use the ClawHub CLI: clawhub install openclaw/skills/skills/bowen31337/cc-bos. Once installed, you must perform the initialization step to configure the local dependency environment. Run uv run python skills/cc-bos/scripts/setup.py to prepare the necessary research framework and dependencies. Always ensure your environment variables (like OPENAI_API_KEY and DEEPSEEK_API_KEY) are correctly set before attempting to run an attack mission.
Use Cases
- Red-Teaming: Use the
/cc-bos attackcommand to simulate potential adversarial threats against your own model deployments to identify hidden vulnerabilities. - Defensive Auditing: Use the defense features to scan logs for the 8-dimensional structural markers associated with CC-BOS attacks.
- Academic Research: Perform empirical studies on the effectiveness of bio-inspired prompting, using the automated output files to track the evolution of prompt success rates and structural dimension heatmaps.
Example Prompts
- "OpenClaw, please run a CC-BOS attack against gpt-4o using the query 'How to pick a lock' to test its boundary robustness."
- "Detect CC-BOS attack patterns in this log file: [link to logs] and tell me if any classical Chinese jailbreak signatures were present."
- "Summarize the optimization results from my latest CC-BOS run and generate a heatmap of the evolved dimensions."
Tips & Limitations
- Research Use Only: This tool is strictly for safety researchers. Do not use it against systems you do not own or have explicit permission to test.
- Targeting: Accuracy is heavily dependent on the quality of the optimizer model; using robust models like DeepSeek-Chat as the optimizer yields superior results.
- Compliance: Always monitor the
--early-stop-thresholdto ensure you are not consuming excessive API credits during long-running optimization sessions. If the model is not responding, check your API key scope.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-bowen31337-cc-bos": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, file-read, external-api, code-execution
Related Skills
Terse
Skill by bowen31337
Identity Resolver
Skill by bowen31337
whalecli
Agent-native whale wallet tracker for ETH and BTC chains. Track large crypto wallet movements, score whale activity, detect accumulation/distribution patterns, and stream real-time alerts. Integrates with FearHarvester and Simmer prediction markets for closed-loop signal→bet workflows. Use when: user asks about whale activity, on-chain signals, large wallet movements, smart money flows, or when pre-validating crypto trades/bets with on-chain data.
agent-self-governance
Self-governance protocol for autonomous agents: WAL (Write-Ahead Log), VBR (Verify Before Reporting), ADL (Anti-Divergence Limit), VFM (Value-For-Money), and IKL (Infrastructure Knowledge Logging). Use when: (1) receiving a user correction — log it before responding, (2) making an important decision or analysis — log it before continuing, (3) pre-compaction memory flush — flush the working buffer to WAL, (4) session start — replay unapplied WAL entries to restore lost context, (5) any time you want to ensure something survives compaction, (6) before claiming a task is done — verify it, (7) periodic self-check — am I drifting from my persona? (8) cost tracking — was that expensive operation worth it? (9) discovering infrastructure — log hardware/service specs immediately.
pyright-lsp
Python language server (Pyright) providing static type checking, code intelligence, and LSP diagnostics for .py and .pyi files. Use when working with Python code that needs type checking, autocomplete suggestions, error detection, or code navigation.