agent-browser-zh
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands. (Chinese localized version)
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/ayakolin/agent-browser-zhWhat This Skill Does
agent-browser-zh is a powerful, Rust-based headless browser automation CLI designed for AI agents. It serves as a bridge between high-level agent reasoning and low-level DOM interaction. By providing a structured, interactive snapshot of the current page, it allows agents to navigate complex web interfaces, extract data, and perform actions like clicking, typing, and form submission with precision. The 'zh' variant is optimized for Chinese-language web content and environments, ensuring consistent behavior across diverse character encodings and localized UI elements.
Installation
To integrate this skill into your OpenClaw environment, run the following command in your terminal:
clawhub install openclaw/skills/skills/ayakolin/agent-browser-zh
Alternatively, for a standalone local setup, ensure you have Node.js installed and execute:
npm install -g agent-browser followed by agent-browser install --with-deps to ensure the required browser binaries are available.
Use Cases
This skill is indispensable for agents requiring external internet access. Common use cases include:
- Web Scraping & Data Extraction: Automatically gathering product prices, news, or research data from multiple pages.
- Task Automation: Logging into accounts, filling out web forms, or completing repetitive registration tasks.
- End-to-End Testing: Simulating user journeys to ensure specific features function as expected.
- Research Assistants: Browsing multiple search result pages to synthesize information into a coherent report.
Example Prompts
- "Open https://www.bing.com, search for 'latest AI trends', and extract the titles of the first five search results."
- "Go to the login page of our internal dashboard, fill in the email field with '[email protected]', enter the password, and click the 'Sign In' button."
- "Navigate to the article page, scroll to the bottom, and take a snapshot of the interactive elements so I can comment on the post."
Tips & Limitations
- Efficiency: Always use
snapshot -ito limit output size. This prevents the agent from being overwhelmed by non-interactive elements, which saves tokens and improves reasoning speed. - Navigation: Browser state is maintained across commands. Always confirm you are on the intended URL by running
get urlif the agent seems disoriented. - Limitations: As a headless browser, it may struggle with highly complex anti-bot protections or sites that rely heavily on hardware-accelerated graphics (e.g., intense WebGL games). Ensure you provide sufficient wait times if pages are dynamically rendered by heavy JavaScript frameworks.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-ayakolin-agent-browser-zh": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, file-read
Related Skills
Agent Browser
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
agent-autonomy-kit
Stop waiting for prompts. Keep working.
agent-autonomy-kit
Stop waiting for prompts. Keep working.
agent-autonomy-kit
Stop waiting for prompts. Keep working.
audio-rename
Rename audio files with Chinese/special characters to simple English names for mlx-stt compatibility.