browser-automation
Browser automation for AI agents using PinchTab. Control Chrome programmatically for testing, scraping, and interaction. Features token-efficient text extraction, multi-instance orchestration, headless/headed modes, and MCP integration. Use when automating browser tasks, extracting web data, testing web apps, or validating sites in real browsers.
Why use this skill?
Efficiently automate Chrome tasks with the PinchTab-powered browser-automation skill for OpenClaw. Achieve token-efficient web scraping, testing, and UI interaction.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/huamu668/browser-automation-pinWhat This Skill Does
The browser-automation skill provides a robust interface for controlling Chrome programmatically via PinchTab. Designed for AI agents, it bridges the gap between high-level task objectives and low-level browser execution. By interacting directly with the accessibility tree rather than relying on bloated visual screenshots, the skill achieves massive token efficiency, allowing agents to navigate complex web apps while keeping context windows clean. Whether you need to scrape dynamic content, validate form submissions, or automate multi-step workflows, this skill handles the heavy lifting of process orchestration, headless/headed state management, and tab lifecycle control.
Installation
To install this skill, run the following command in your terminal: clawhub install openclaw/skills/skills/huamu668/browser-automation-pin
Ensure you have the PinchTab server installed on your system or running via Docker, as the agent relies on the local HTTP API at localhost:9867 to bridge commands into the browser environment.
Use Cases
- Web Scraping: Extracting data from dynamic, JavaScript-heavy sites that standard crawlers cannot interpret.
- Quality Assurance: Automated end-to-end testing for web applications by simulating user clicks, text input, and navigation.
- Agentic Research: Browsing the internet to gather news, monitor site changes, or verify information as part of a larger research task.
- Form Automation: Programmatically filling out registrations, surveys, or internal business tools that lack APIs.
Example Prompts
- "Open a headless browser, navigate to example.com, wait for the page to load, and extract all the links found in the main content area."
- "Log into the portal using my work profile, fill out the user email field, click the submit button, and let me know if the confirmation text appears."
- "Start a new headed instance, go to the tech news site, and summarize the top three headlines from the homepage."
Tips & Limitations
- The 3-Second Rule: Always allow a 3-second buffer after navigation. Chrome needs this time to render the accessibility tree; attempting to snap or extract elements immediately will result in an empty response.
- Token Efficiency: Use the suggested JQ filtering patterns to strip out non-essential elements like navigation bars or footers before passing the data to the LLM.
- Persistence: Use browser profiles for tasks requiring authentication so you do not have to perform login sequences every time the instance restarts.
- Headless vs. Headed: Use headless mode for background tasks to save system resources; use headed mode only when you need to visually debug or interact with sites requiring MFA or complex CAPTCHAs.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-huamu668-browser-automation-pin": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write
Related Skills
Writing Assistant Pro
Skill by huamu668
web-quality-audit
Comprehensive web quality audit covering performance, accessibility, SEO, best practices, and browser automation testing. Supports automated testing with PinchTab for headless/headed browser control, multi-instance orchestration, and token-efficient content extraction.
memos-recall
Recall memories from MemOS Cloud before responding. Use this skill when you need context about user's previous conversations, preferences, or decisions.
openclaw-deploy
OpenClaw/NEUXSBOT 全平台部署指南 - 支持 Mac、Windows、Linux 三平台一键安装。包含环境准备、安装配置、AI模型设置、消息平台对接、故障排除等完整流程。
Huamu668 Openclaw Security
Skill by huamu668