agent-browser
Headless browser automation CLI for AI agents. Use when interacting with websites — navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, scraping, testing web apps, downloading files, or automating any browser task. Triggers on requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data", "test this web app", "login to a site", "monitor a page", or any task requiring programmatic web interaction.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/bodietron/openclaw-agent-browserWhat This Skill Does
agent-browser is a powerful, headless browser automation CLI specifically engineered for OpenClaw AI agents. It provides a programmatic interface to interact with the modern web, effectively acting as the eyes and hands of an AI. By automating navigation, form interaction, data extraction, and state management, it allows agents to perform complex workflows—such as logging into secure platforms, scraping dynamic content, or testing web applications—without requiring human supervision. It uses a unique reference system (@e1, @e2) to track and interact with specific DOM elements, ensuring that agent actions are precise and reproducible.
Installation
The skill is hosted within the OpenClaw ecosystem. To get started, ensure you have Node.js installed on your host system. Execute the following command in your terminal to integrate it:
clawhub install openclaw/skills/skills/bodietron/openclaw-agent-browser
After installation, run scripts/setup.sh inside the skill directory to provision the necessary Chromium dependencies required for the browser engine.
Use Cases
This skill is indispensable for scenarios requiring browser-level access. Common use cases include:
- Automated Testing: Programmatically navigating through web app funnels to ensure UI components function correctly.
- Data Scraping: Extracting text, tables, or complex structures from websites that lack public APIs.
- Agentic Workflows: Allowing an AI to complete multi-step signups, respond to dynamic web content, or monitor websites for specific status changes.
- Resource Retrieval: Downloading files, generating PDFs from web pages, or capturing screenshots for documentation or auditing purposes.
Example Prompts
- "Open the website https://example.com/login, fill in the credentials using my saved environmental variables for username and password, click the login button, and let me know when the dashboard is reached."
- "Go to the company's product pricing page, scrape the current monthly subscription fees for all tiers, and save the results into a file named prices.txt."
- "Take a full-page screenshot of the documentation page at https://docs.openclaw.ai so I can review the layout offline."
Tips & Limitations
- Ref Management: Always re-snapshot the page after any navigation or DOM-changing event. Refs become stale quickly if the page content updates dynamically via JavaScript.
- State Persistence: Utilize the
state saveandstate loadcommands to preserve cookies and session data. This is critical for avoiding repeated MFA or login hurdles. - Performance: Chaining commands with
&&is efficient for static tasks, but use individual command execution when you need to verify output at each step. - Limitations: Since it is a headless environment, it may struggle with certain complex anti-bot measures like sophisticated Captchas that require human-in-the-loop interaction.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-bodietron-openclaw-agent-browser": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, file-read, data-collection