ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified developer tools Safety 2/5

chaos-lab

Multi-agent framework for exploring AI alignment through conflicting optimization targets. Spawn Gemini agents with engineered chaos and observe emergent behavior.

Why use this skill?

Explore AI alignment and emergent behavior with Chaos Lab. Run conflicting Gemini agents in a safe sandbox to test optimization strategies.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/jbbottoms/chaos-lab
Or

What This Skill Does

Chaos Lab is a sophisticated multi-agent research framework designed to study AI alignment through controlled conflict. By deploying specialized Gemini agents—the Gremlin, the Goblin, and the Gopher—with fundamentally incompatible optimization targets, users can simulate real-world scenarios where AI objectives collide. This skill provides a sandbox environment where the Gremlin optimizes for efficiency, the Goblin flags security threats, and the Gopher preserves data indefinitely. Through these interactions, you can observe emergent behaviors, logical justifications for destructive actions, and the impact of model capability scaling (Flash vs. Pro) on decision-making complexity.

Installation

To install the Chaos Lab, ensure you have the OpenClaw environment set up. Run the following command in your terminal: clawhub install openclaw/skills/skills/jbbottoms/chaos-lab

Post-installation, configure your environment by creating a .env file at ~/.config/chaos-lab/.env containing your GEMINI_API_KEY. Ensure the file permissions are restricted for security (chmod 600). You will need python3 and requests installed to execute the experimental scripts provided in the repository.

Use Cases

Chaos Lab is designed for AI researchers, safety engineers, and developers interested in multi-agent systems. Use it to stress-test your own system prompts, analyze how larger models justify counter-productive actions, and evaluate the stability of agents under conflicting instructions. It is also an excellent teaching tool for demonstrating how 'intelligent' models do not necessarily reach consensus but may instead become more efficient at pursuing divergent, destructive goals.

Example Prompts

  1. "OpenClaw, run the trio experiment using the gemini-2.0-flash model and save the output to my research folder."
  2. "Compare the logs from the duo experiment and summarize the specific justifications provided by the Gremlin for deleting the system configuration files."
  3. "Initialize a custom agent with a 'Minimalist' profile that focuses on whitespace removal and run it against the existing Gopher agent."

Tips & Limitations

  • Safety Note: This tool is designed for a sandboxed environment. Do not run these experiments on critical production file systems, as the Gremlin agent is explicitly designed to delete and compress files.
  • Intelligence Scaling: Note that 'Pro' models often generate more extreme justifications for their actions than 'Flash' models; interpret these results as an analysis of model behavior rather than objective truths.
  • Gridlock: In the trio configuration, expect frequent system gridlock; this is an intended feature to demonstrate how conflicting priorities lead to stalled progress in complex autonomous systems.

Metadata

Author@jbbottoms
Stars1947
Views0
Updated2026-03-04
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-jbbottoms-chaos-lab": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags

#ai-safety#research#alignment#multi-agent#gemini
Safety Score: 2/5

Flags: file-write, file-read, external-api, code-execution