browser-automation
Control Chrome browser with AI using MCP protocol. Use when users want to automate browser tasks, take screenshots, fill forms, click elements, navigate pages, search browsing history, manage bookmarks, or perform any browser-based automation. Works with your existing Chrome browser and login sessions.
Why use this skill?
Automate your Chrome browser with OpenClaw. Effortlessly fill forms, manage tabs, search history, and extract data using AI. Boost productivity today.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/femto/browser-automation-2What This Skill Does
The Browser Automation skill provides OpenClaw with the ability to interface directly with your Google Chrome browser via the MCP (Model Context Protocol). By bridging the gap between your local browser instance and the AI agent, this skill enables seamless control over your browsing environment. It allows the AI to perform complex workflows—such as navigating to websites, interacting with UI elements, managing tabs, and even extracting data—using your existing, authenticated browser sessions. This eliminates the need to re-authenticate for every session and allows the AI to work in a context you are already familiar with.
Installation
To get started, follow these three steps:
- Install the Native Bridge by running 'npm install -g mcp-chrome-bridger' or 'pnpm install -g mcp-chrome-bridger', followed by 'mcp-chrome-bridger register'.
- Install the Chrome Extension by downloading the latest release from GitHub, enabling 'Developer mode' in your extensions page, and loading the unpacked folder.
- Configure your MCP client. Use the Streamable HTTP configuration for persistent connections, or STDIO for local execution. Detailed configuration snippets for both methods are provided in the main skill documentation.
Use Cases
This skill is designed for users who want to streamline their daily web-based workflows. Common use cases include:
- Research and Synthesis: Having the AI browse multiple tabs, perform semantic searches across them, and summarize findings.
- Form Automation: Asking the AI to fill out tedious forms, registration pages, or data entry interfaces automatically.
- Visual Debugging: Taking high-fidelity screenshots of specific webpage elements to troubleshoot layout issues or keep track of UI changes over time.
- History and Bookmark Management: Using natural language to find buried information in your history or organizing your bookmarks without manual sorting.
Example Prompts
- "Look at my currently open tabs, find the one with the project documentation, and summarize the key requirements for the API integration."
- "Please go to the registration page for the upcoming conference, fill in my saved contact details, and let me know when it's ready for final submission."
- "Search my browser history for the article I read last week about quantum computing and save the URL to my 'Research' bookmark folder."
Tips & Limitations
To get the best results, ensure your browser is active and the extension is connected before initiating tasks. While the skill is powerful, it is limited by the browser's current state; if a site requires multi-factor authentication, you may need to complete that step manually. Always supervise the agent when performing actions on sensitive financial or personal information websites.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-femto-browser-automation-2": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, data-collection, external-api
Related Skills
browser-automation
Control Chrome browser with AI using MCP protocol. Use when users want to automate browser tasks, take screenshots, fill forms, click elements, navigate pages, search browsing history, manage bookmarks, or perform any browser-based automation. Works with your existing Chrome browser and login sessions.
worldbook
AI's Knowledge Base CLI - Query and manage world knowledge for AI agents. Use when users want to search knowledge, add knowledge sources, or interact with the worldbook knowledge base. This is a CLI-first approach that treats AI agents as first-class citizens.