browser-automation-skill
Advanced headless browser automation skill for OpenClaw agents. Enables intelligent web navigation, form filling, data extraction, and UI testing with structured commands and semantic element targeting.
Why use this skill?
Master web automation with the OpenClaw browser-automation-skill. Learn to navigate, fill forms, extract data, and automate browser tasks with ease.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/stveenli/browserautomation-skillWhat This Skill Does
The browser-automation-skill is a robust, headless browser automation toolkit integrated directly into the OpenClaw agent ecosystem. It allows agents to interact with web interfaces just like a human user would, but with the precision of programmatic control. The skill provides a comprehensive suite of commands for navigation, form manipulation, complex data extraction, and UI testing. By using a semantic element targeting system (@refs), agents can reliably identify interactive components on a webpage regardless of dynamic class names or complex nested structures. This skill acts as the bridge between the OpenClaw AI's decision-making capabilities and the real-world web, enabling tasks ranging from simple automated logins and form submissions to sophisticated multi-step data scraping and site monitoring.
Installation
To get started, ensure you have the OpenClaw CLI installed, then run the following commands in your terminal to integrate the skill into your local environment:
clawhub install openclaw/skills/skills/stveenli/browserautomation-skill
For the browser engine dependencies, follow the quick install process to ensure headless compatibility:
npm install -g agent-browser
agent-browser install --with-deps
Verify the installation by running agent-browser --version to confirm your setup is ready for execution.
Use Cases
This skill is highly versatile and fits various automated workflows:
- Web Scraping & Data Mining: Collecting information from modern, JavaScript-heavy sites that are difficult to scrape with static tools.
- Automated Testing: Performing end-to-end UI regression tests by scripting interaction paths and verifying state changes.
- Form Filling & Submission: Automating repetitive tasks like application filings, data migration, or inventory updates.
- Session Management: Maintaining authenticated states across multiple tasks to perform complex research or administrative duties.
Example Prompts
- "Open the documentation site at https://docs.example.com, search for 'authentication', and extract the URL from the first search result."
- "Go to the contact form at https://example.com/form, fill in the fields with my provided details, check the newsletter box, and submit the form."
- "Navigate to the dashboard, snapshot the page to find the logout button, click it, and confirm the page title changes to 'Logged Out'."
Tips & Limitations
Always follow the core workflow pattern: Open, Snapshot, Interact, Verify, Repeat, Close. The most critical step is the snapshot; without a fresh snapshot, your agent may attempt to interact with stale element references. When working with dynamic content, always include a wait step or verify the element's visibility via the snapshot. Be aware that headless browsers might be detected by certain advanced security services; use the tool responsibly and respect site robots.txt policies. Always check for existing session files to avoid unnecessary authentication loops.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-stveenli-browserautomation-skill": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, data-collection
Related Skills
phoenix-shield
Self-healing backup and update system with intelligent rollback. Protects against failed updates by automatically monitoring system health post-update and recovering from backups when needed. Features canary deployment testing, health baselines, smart rollback, and 24/7 automated monitoring. Use when performing critical system updates, managing production deployments, or ensuring high availability of services. Prevents downtime through pre-flight checks, integrity verification, and automatic recovery workflows.
copywriting
Write persuasive copy for landing pages, emails, ads, sales pages, and marketing materials. Use when you need to write headlines, CTAs, product descriptions, ad copy, email sequences, or any text meant to drive action. Covers copywriting formulas (AIDA, PAS, FAB), headline writing, emotional triggers, objection handling in copy, and A/B testing. Trigger on "write copy", "copywriting", "landing page copy", "headline", "write a sales page", "ad copy", "email copy", "persuasive writing", "how to write [marketing text]".
youtube-watcher
Fetch and read transcripts from YouTube videos. Use when you need to summarize a video, answer questions about its content, or extract information from it.
airbnb-search
Search Airbnb listings with prices, ratings, and direct links. No API key required.