agent-browser-core
OpenClaw skill for the agent-browser CLI (Rust-based with Node.js fallback) enabling AI-friendly web automation with snapshots, refs, and structured commands.
Why use this skill?
Master web automation with agent-browser-core. A Rust-based CLI skill for OpenClaw that delivers deterministic, DOM-aware web navigation for AI agents.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/lion504/agent-browser-core-1-0-1What This Skill Does
The agent-browser-core skill acts as the primary interface for the OpenClaw agent to interact with the web. Built on a Rust-based CLI with a robust Node.js fallback, it provides a deterministic, machine-readable way to perform web automation. Instead of relying on brittle pixel-based screen scraping, this skill utilizes DOM-based snapshots and unique element references (refs). This approach allows the AI to perceive the browser content as a structured JSON object, making it significantly more reliable for complex workflows like form filling, data extraction, and multi-step navigation. It is designed for production-grade automation where predictability and auditability are critical.
Installation
To integrate this skill into your OpenClaw environment, execute the following command in your terminal:
clawhub install openclaw/skills/skills/lion504/agent-browser-core-1-0-1
Ensure that you have a compatible browser runtime installed (such as Chromium or Playwright) as the CLI requires a browser engine to render web pages and generate snapshots.
Use Cases
- Automated Data Research: The agent can navigate to search engines or news portals, extract text data, and compile it into a structured summary.
- Form Submission Automation: Navigate through complex web forms, interact with specific input fields using refs, and submit data securely.
- Regression Testing: Capture before-and-after snapshots of web elements to verify that UI updates have not negatively impacted functionality.
- Account Management: Log into authorized services, perform routine tasks, and handle authentication workflows with session-based persistence.
Example Prompts
- "Open the browser and go to github.com/openclaw, find the 'Latest Releases' section using the latest snapshot, and list the versions."
- "Navigate to the registration page for our internal dashboard, fill in the fields with the provided credentials, and confirm the submission succeeds."
- "Search for the latest documentation on agent-browser core, extract the main installation steps, and format them into a JSON output."
Tips & Limitations
To maximize effectiveness, always trigger a snapshot immediately after a navigation event or DOM change. This ensures the AI is making decisions based on the most current state of the page. Use the --json flag consistently to ensure the agent receives clean input.
Limitations: This skill is not designed for streaming heavy media or executing arbitrary JavaScript on untrusted sites. It is strictly constrained by safe-mode defaults to prevent unauthorized network access or file system modification. Avoid heavy tasks that require long-running browser sessions, as this can exhaust system resources and lead to performance degradation.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-lion504-agent-browser-core-1-0-1": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, data-collection
Related Skills
Code
Coding workflow with planning, implementation, verification, and testing for clean software development.
language-learning
AI language tutor for learning ANY language through conversation, vocab drills, grammar lessons, flashcards, and immersive practice. Use when the user wants to: learn a new language, practice vocabulary, study grammar, do flashcard drills, translate phrases, practice conversation, prepare for travel, learn slang/idioms, or improve pronunciation. Supports ALL languages including Spanish, French, German, Japanese, Chinese (Mandarin/Cantonese), Korean, Arabic, Hindi, Bengali/Bangla, Portuguese, Russian, Italian, Turkish, Vietnamese, Thai, Swahili, Hebrew, Polish, Dutch, Greek, and 100+ more.
gemini-deep-research
Perform complex, long-running research tasks using Gemini Deep Research Agent. Use when asked to research topics requiring multi-source synthesis, competitive analysis, market research, or comprehensive technical investigations that benefit from systematic web search and analysis.
humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.