browser-automation
Control Chrome browser with AI using MCP protocol. Use when users want to automate browser tasks, take screenshots, fill forms, click elements, navigate pages, search browsing history, manage bookmarks, or perform any browser-based automation. Works with your existing Chrome browser and login sessions.
Why use this skill?
Use the browser-automation skill to let OpenClaw control your Chrome browser. Automate clicks, forms, screenshots, and history searches seamlessly via MCP.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/femto/mcp-chromeWhat This Skill Does
The browser-automation skill is a powerful bridge that connects OpenClaw to your local Google Chrome environment via the MCP (Model Context Protocol). By leveraging this skill, your AI agent gains the ability to interact with the web just like a human user. It can navigate to specific URLs, manipulate web page elements, extract text content, manage browser sessions, and even capture visual representations of sites through screenshots. Because it integrates directly with your existing browser instance, it retains your login sessions, cookies, and browsing context, making it ideal for automating recurring web-based tasks that would otherwise require manual intervention.
Installation
Installation follows a three-part process to ensure secure communication between OpenClaw and your browser:
- Native Bridge: Install the bridge via npm or pnpm using
npm install -g mcp-chrome-bridgerfollowed bymcp-chrome-bridger registerto enable system-level communication. - Chrome Extension: Download the latest MCP Chrome Extension from GitHub releases. Install it by navigating to
chrome://extensions/in Chrome, enabling "Developer mode," and selecting "Load unpacked" to point to the downloaded directory. - MCP Client Configuration: Depending on your environment, add the server configuration to your OpenClaw settings using either the Streamable HTTP method (recommended for flexibility) or the STDIO method (for local, process-isolated communication).
Use Cases
This skill is perfect for users who want to save time on repetitive browser workflows. Common use cases include auto-filling complex forms, scraping structured data from websites into spreadsheets, managing long-term research by searching your history and bookmarks, or performing regression testing by simulating user clicks across multiple pages. It is also excellent for monitoring specific web elements or network requests when debugging applications.
Example Prompts
- "Open my Gmail tab, find the latest email from 'Support', and take a screenshot of the body content."
- "Go to the project management dashboard, add a bookmark for the current page, and list all my existing project-related bookmarks."
- "Search my browser history for pages I visited last Tuesday about 'OpenClaw documentation' and summarize the key information from them."
Tips & Limitations
For the best results, ensure your browser is kept in a stable state. Complex dynamic sites (like those using heavy shadow DOMs) may occasionally require retries. The tool works best when you keep the browser window focused or visible. Be mindful that excessive automation may trigger bot-detection mechanisms on some websites; consider setting reasonable delays between automated actions to maintain a natural browsing profile.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-femto-mcp-chrome": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, data-collection
Related Skills
worldbook
AI's Knowledge Base CLI - Query and manage world knowledge for AI agents. Use when users want to search knowledge, add knowledge sources, or interact with the worldbook knowledge base. This is a CLI-first approach that treats AI agents as first-class citizens.
browser-automation
Control Chrome browser with AI using MCP protocol. Use when users want to automate browser tasks, take screenshots, fill forms, click elements, navigate pages, search browsing history, manage bookmarks, or perform any browser-based automation. Works with your existing Chrome browser and login sessions.