consensus-agent-action-guard
Pre-execution governance for high-risk agent actions. Uses persona-weighted consensus to decide ALLOW/BLOCK/REQUIRE_REWRITE before external or irreversible side effects occur, with board-native audit artifacts.
Why use this skill?
Implement robust, multi-persona governance for your AI agents. The consensus-agent-action-guard prevents high-risk, irreversible operations.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/kaicianflone/consensus-agent-action-guardWhat This Skill Does
The consensus-agent-action-guard serves as the primary defensive layer for autonomous systems, acting as a mandatory checkpoint before any irreversible or high-stakes operation is executed. Unlike simple rule-based filters, this skill employs a multi-persona weighted consensus engine to evaluate proposed agent actions. It scrutinizes the intent, potential for side effects, and risk profile of every action before allowing it to proceed to the system shell or network boundary.
By returning one of three states—ALLOW, BLOCK, or REQUIRE_REWRITE—it ensures that autonomous agents cannot drift into unauthorized behaviors. The skill integrates directly with your board-native audit trail, ensuring that every decision is logged with the corresponding persona weightings. This creates an immutable record of why an action was permitted or stopped, which is essential for compliance and post-mortem analysis.
Installation
To integrate this safety gate into your workflow, use the standard ClawHub CLI to pull the package from the official source repository. Ensure your environment has node and tsx installed to support the runtime binary requirements.
Command:
clawhub install openclaw/skills/skills/kaicianflone/consensus-agent-action-guard
Once installed, the skill resides within your agent's dependency tree, utilizing consensus-guard-core to maintain architectural consistency across your other security modules. No external credentials are required, as the skill operates entirely within the scope of your board's local filesystem artifacts.
Use Cases
- Destructive File Operations: Prevent agents from executing
rm -rfor other mass-deletion commands without explicit consensus from multiple, specialized, pre-configured personas. - External Messaging: Guard communication tools so that an agent cannot post to public channels or send emails without reaching a predefined 'human-in-the-loop' or 'senior-agent' consensus threshold.
- Network Configuration: Protect critical system infrastructure by requiring a high-trust score for any tool call attempting to modify proxy settings, firewall rules, or DNS entries.
- Financial Transactions: Gate API calls involving fund transfers or wallet interactions to ensure that even if an agent is compromised, it cannot execute high-value movements without multi-signature approval logic.
Example Prompts
- "Guard the execution of this script: verify the file path /etc/production-config and ensure no deletion flags are present before allowing execution."
- "Before I push these changes to the production branch, run the consensus-agent-action-guard to check if my persona weights are satisfied."
- "Evaluate the pending proposal to update external messaging; if the risk profile is above 0.7, trigger a REQUIRE_REWRITE and ask the agent to justify the content."
Tips & Limitations
- Persona Management: The effectiveness of this skill is directly tied to the quality of your persona sets. Define diverse personas to avoid echo-chamber logic.
- Performance: While the evaluation is deterministic and local, heavy usage of deep-chain consensus in a tight loop may introduce minor latency. Batch your critical action requests where possible.
- Scope: Note that this skill does not monitor the internal thought process of the agent; it monitors the 'tool-call' contract boundary. Ensure that your agent is configured to pass all relevant context to the
invokeentrypoint for accurate risk assessment. - Modes: Use
external_agentmode when integrating human-in-the-loop signals to ensure your human team has the final say in the consensus calculation.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-kaicianflone-consensus-agent-action-guard": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, code-execution
Related Skills
consensus-persona-generator
Generate and persist reusable persona panels (persona_set artifacts) for consensus decision workflows. This skill initializes evaluator diversity for downstream guards; ongoing reputation updates are owned by consensus-persona-engine.
Consensus Interact
Skill by kaicianflone
consensus-send-email-guard
Persona-weighted pre-send email governance for AI systems. Produces APPROVE/BLOCK/REWRITE decisions, writes decision artifacts to the board ledger, updates persona reputation over time, and returns strict machine-parseable JSON.
consensus-publish-guard
Persona-weighted governance for outbound publishing (blog, social, announcements). Prevents unsafe public claims via hard-block checks, weighted consensus, rewrite paths, and board-native audit artifacts.
consensus-persona-respawn
Ledger-informed persona lifecycle management. Replaces low-performing personas with successor personas derived from mistake patterns in board decision history, preserving adaptive governance over long-running automation. Reputation updates are computed by consensus-persona-engine.