agent-browser
使用 agent-browser CLI 进行浏览器自动化。用于签到、填表、截图、信息抓取等需要控制浏览器的任务。触发条件:(1) 用户要求自动化浏览器操作 (2) 需要签到、填表、点击按钮 (3) 需要抓取网页内容作为研究素材
Why use this skill?
Use the agent-browser skill for efficient web automation, form filling, and data scraping. Built on Playwright, it offers reliable browser control.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/joshhuang123/agent-browser-cliWhat This Skill Does
The agent-browser skill provides an advanced command-line interface for browser automation powered by Playwright. Designed for the OpenClaw AI agent, it allows the agent to navigate the web, interact with DOM elements, and extract information with high precision. By leveraging accessibility tree snapshots, it offers a more robust way to target web elements compared to traditional coordinate-based scraping, making it highly effective for complex, dynamic web applications.
Installation
To integrate this skill into your environment, use the OpenClaw CLI tool:
clawhub install openclaw/skills/skills/joshhuang123/agent-browser-cli
Ensure that Node.js is installed on your system. After installation, verify the environment by running the dependency setup command provided by the agent-browser package:
npm install -g agent-browser
agent-browser install
Use Cases
This tool is ideal for repetitive tasks and automated research. Use it for automated check-ins on attendance or loyalty platforms, filling out complex web forms across multiple pages, scraping specific data points from websites that require JavaScript execution, and conducting automated visual regression or documentation tasks through page snapshots and annotated screenshots.
Example Prompts
- "Open https://example.com/login, fill in the username field with 'my_user', enter 'secret_pass' in the password field, and click the submit button."
- "Go to the company dashboard, capture a snapshot to identify the latest report link, click it, and save a screenshot of the results to my workspace."
- "Check for new items on the product page and alert me if the 'Buy Now' button appears."
Tips & Limitations
- Snapshot First: Always run
agent-browser snapshotbefore performing actions to ensure your references (@ref) are current and valid. - Handle Latency: Web pages are dynamic. Always include
sleepcommands or explicit waits after navigation to allow for elements to render. - Resource Management: Keep your browser sessions clean; always execute
agent-browser closeto free up system memory and terminate underlying browser processes after your task is finished. - Complexity: While powerful, this tool requires the agent to be aware of the DOM structure. If a website changes its layout, you may need to re-verify your targeting logic.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-joshhuang123-agent-browser-cli": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, file-read, code-execution