Agent Browser
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
Why use this skill?
Power your AI agents with Agent Browser. A high-performance Rust-based headless automation CLI for web navigation, interaction, and data extraction.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/sakaen736jih/agent-browser-ufymjtykwuasWhat This Skill Does
Agent Browser is a high-performance, Rust-based headless browser automation engine designed to empower AI agents with the ability to interact with the web just like a human. By leveraging structured commands, the agent can navigate complex websites, analyze DOM structures via accessibility trees, and perform precise UI interactions. Unlike traditional automation scripts, this skill provides an 'interactive' mode that identifies elements with specific references (e.g., @e1, @e2), allowing the agent to perform granular tasks such as form filling, clicking, and complex drag-and-drop operations without needing to know the underlying HTML tag structures.
Installation
To begin, ensure you have the OpenClawProvider installed as per your OS requirements. Once the provider is active, install the CLI via npm: npm install -g agent-browser. For advanced users or those contributing to the project, the source can be cloned and built directly using pnpm. Always run agent-browser install to ensure browser dependencies are correctly configured for your environment.
Use Cases
- Automated Research: Have your agent browse target websites to aggregate data or monitor price changes.
- Headless Testing: Perform end-to-end testing of web applications by scripting user journeys.
- Content Submission: Automate the repetitive process of filling out forms, uploading files, or navigating through multi-step account setups.
- Dynamic Data Extraction: Use snapshots to extract specific elements from pages that require authentication or complex JavaScript rendering.
Example Prompts
- "Open https://example.com/login, fill the username field with 'my_user' and the password field with 'secure123', then click the login button."
- "Go to the contact page, snapshot the elements to find the submit button, and press Enter to submit the inquiry form."
- "Navigate to the latest news section, scroll down to load more items, and snapshot all interactive links available on the page."
Tips & Limitations
- Snapshot Depth: When dealing with massive pages, use
agent-browser snapshot -cor set a specific depth limit to avoid token exhaustion or irrelevant data. - Stability: Always re-snapshot after a navigation command, as DOM changes render old element references invalid.
- Compliance: This skill provides significant power to interact with the web; ensure all automated requests comply with the website's
robots.txtand terms of service. - OS Security: Note the specific installation requirements for MacOS, which uses a setup-wizard for driver configuration; always verify the source of your provider files.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-sakaen736jih-agent-browser-ufymjtykwuas": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, code-execution
Related Skills
nano-pdf
Edit PDFs with natural-language instructions using the nano-pdf CLI.
auto-updater
Automatically update Clawdbot and all installed skills once daily. Runs via cron, checks for updates, applies them, and messages the user with a summary of what changed.
Agent Browser
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
nano-pdf
Edit PDFs with natural-language instructions using the nano-pdf CLI.
bird
X/Twitter CLI for reading, searching, and posting via cookies or Sweetistics.