Playwright Scraper
Skill by 3coco3
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/3coco3/playwright-scraperWhat This Skill Does
The Playwright Scraper by 3coco3 is a sophisticated MCP skill designed to bridge the gap between AI agents and complex, modern web environments. Unlike traditional HTTP request libraries that often fail when encountering dynamic content, this skill leverages Playwright Extra alongside the specialized stealth plugin. This combination allows OpenClaw to render JavaScript, resolve React states, and simulate human-like interaction patterns. By automatically managing browser fingerprints, User-Agent rotation, and automated scroll behaviors, the skill effectively bypasses common anti-bot measures such as Cloudflare or PerimeterX, ensuring your agent can reliably gather data from sites that would otherwise block automated scrapers.
Installation
To integrate this skill into your environment, ensure you have Playwright and its Chromium dependencies installed within the target skill directory. Run the following command in your terminal:
clawhub install openclaw/skills/skills/3coco3/playwright-scraper
Ensure that your OpenClaw environment has adequate system permissions to spawn headless browser processes.
Use Cases
This skill is perfect for scenarios involving:
- Extracting pricing data from e-commerce platforms that load inventory dynamically.
- Gathering research data from sites that require full JavaScript execution to show results.
- Monitoring job boards or listing services that use aggressive anti-scraping protections.
- Archiving content from single-page applications (SPAs) that do not provide raw HTML feeds.
Example Prompts
- "Scrape the product details and price from this URL: https://example-store.com/item/123, making sure to capture the final rendered output."
- "Use the Playwright Scraper to visit this tech blog and extract all article headers and their respective publication dates."
- "Go to this job search page, run a search for 'AI Engineer', and list the first five job titles found on the results page."
Tips & Limitations
- Resource Usage: Because this tool spins up a browser instance, it consumes significantly more RAM and CPU than a standard HTTP request. Avoid running multiple heavy scraping tasks simultaneously.
- Rate Limiting: Even with stealth features, excessive scraping can trigger IP-based rate limiting. Consider rotating proxies if you are planning to perform high-volume data collection.
- Headless Limitations: While the stealth plugin is effective, some advanced detection systems (like Turnstile or Proof-of-Work challenges) may still occasionally challenge the agent. If you encounter a block, try adding a brief delay or adjusting your request frequency.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-3coco3-playwright-scraper": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, code-execution