agent-browser
Headless browser automation CLI optimized for AI agents with accessibility tree snapshots and ref-based element selection
Why use this skill?
Automate web tasks with the agent-browser skill for OpenClaw. Features headless accessibility snapshots, ref-based element selection, and session isolation.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/nicoataiza/agent-browser-clawdbot-bak-2026-01-28t18-01-09-10-30What This Skill Does
The agent-browser skill is a high-performance, headless browser automation tool designed specifically for AI agents. It prioritizes efficiency and deterministic behavior by utilizing accessibility tree snapshots, which allow the agent to "see" the page structure in a machine-readable format. By assigning unique reference IDs (refs) to interactive elements, the tool eliminates the flakiness often associated with standard CSS or XPath selectors in dynamic single-page applications (SPAs). It supports full session isolation, state persistence, and granular network control, making it an essential utility for complex, multi-step web automation workflows.
Installation
To install this skill, use the ClawHub CLI command within your OpenClaw environment:
clawhub install openclaw/skills/skills/nicoataiza/agent-browser-clawdbot-bak-2026-01-28t18-01-09-10-30
Once installed, you can trigger browser commands directly through your agent interface. Ensure your environment has the necessary dependencies for headless browser rendering.
Use Cases
- Automated Web Testing: Run regression tests for SPAs where UI components change dynamically.
- Data Extraction & Scraping: Efficiently navigate through paginated content using ref-based selectors to extract data points consistently.
- Workflow Automation: Automate complex form filling, dashboard monitoring, or multi-step account management tasks that require session persistence.
- Network Interception: Debugging frontend applications by mocking API responses or blocking unnecessary tracking scripts.
Example Prompts
- "Open https://portal.example.com, log in using the saved credentials, navigate to the dashboard, and extract the latest status count from the element labeled 'active-tasks'."
- "Go to the contact page, fill out the form using the provided details for the name and email fields, and wait for the success message to appear before closing the session."
- "Monitor the pricing page for updates, blocking all advertisement requests to speed up load times, and notify me if the price element value drops below 50."
Tips & Limitations
- Snapshot Best Practices: Always use the
-i --jsonflags with the snapshot command to ensure you receive the most relevant interactive elements and structured data for your agent. - Session Management: Leverage the
--sessionflag to maintain multiple independent browser states, which is vital for testing multi-user scenarios simultaneously. - Performance: For large pages, use the
-d(depth) flag in snapshots to limit the accessibility tree size and reduce context window token usage. - Limitations: This skill is strictly for automation and interaction. If you require deep visual analysis or pixel-perfect screenshots for debugging layout issues, fall back to the built-in browser tool instead.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-nicoataiza-agent-browser-clawdbot-bak-2026-01-28t18-01-09-10-30": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, file-read
Related Skills
plan-my-day
Generate an energy-optimized, time-blocked daily plan
gno
Search local documents, files, notes, and knowledge bases. Index directories, search with BM25/vector/hybrid, get AI answers with citations. Use when user wants to search files, find documents, query notes, look up information in local folders, index a directory, set up document search, build a knowledge base, needs RAG/semantic search, or wants to start a local web UI for their docs.
tldr
Simplified man pages from tldr-pages. Use this to quickly understand CLI tools.
openai-tts
Text-to-speech via OpenAI Audio Speech API.
clawdhub
Use the ClawdHub CLI to search, install, update, and publish agent skills from clawdhub.com. Use when you need to fetch new skills on the fly, sync installed skills to latest or a specific version, or publish new/updated skill folders with the npm-installed clawdhub CLI.