clawpod
Fetch any web page via Massive's Unblocker REST API. Handles JavaScript rendering, anti-bot protection, CAPTCHAs, paywalls, and geo-restrictions server-side — returns clean extracted markdown. Use for any web fetching, scraping, or content extraction task where standard HTTP requests would be blocked.
Why use this skill?
Easily scrape JavaScript-heavy, geo-restricted, or anti-bot protected websites. Get clean markdown content directly from the web with Massive Unblocker.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/codeninja23/clawpod-mainWhat This Skill Does
The clawpod skill acts as a powerful web-scraping interface for the Massive Unblocker REST API. It bypasses complex web barriers—such as Cloudflare challenges, CAPTCHAs, paywalls, and geo-blocking—by utilizing a headless browser environment to render and extract page content. Whether a site requires heavy JavaScript execution or is protected by sophisticated anti-bot systems, clawpod processes the page server-side and returns the information as clean, structured markdown. This makes it an essential tool for turning inaccessible web content into usable data for your AI agent.
Installation
To install this skill, use the command: clawhub install openclaw/skills/skills/codeninja23/clawpod-main. Before running it, ensure the MASSIVE_UNBLOCKER_TOKEN environment variable is set. If you lack a token, visit clawpod.joinmassive.com/waitlist to register. The skill also benefits from the node-html-markdown package for superior formatting; while optional, it is recommended to have it installed globally via npm install -g node-html-markdown to ensure optimal readability of the retrieved content.
Use Cases
- Market Research: Extracting product pricing, reviews, or stock availability from e-commerce sites that block standard crawlers.
- Content Aggregation: Converting dynamic, JS-heavy news articles into clean markdown for summarization or analysis.
- Automated Testing: Verifying the presence of specific UI elements on websites that employ anti-bot protections.
- Data Mining: Gathering research data from academic or government portals that require browser-like headers and wait-times to access.
Example Prompts
- "Fetch the latest pricing and technical specifications for the new GPU release from this URL: https://example-tech-store.com/new-release. Please provide the output in a markdown table."
- "Go to this news article, bypass any consent walls, and provide a 3-bullet point summary of the author's argument: https://news-site-protected.com/deep-dive-analysis."
- "Search for the current residential real estate listings on this regional site: https://regional-property-listing.com. Extract the property names and prices and save them to a file."
Tips & Limitations
Note that each fetch operation can take up to two minutes to complete as it handles retries and CAPTCHA resolution. Use the delay parameter if you are targeting pages that load content asynchronously after the initial DOM load. If you require raw data for parsing tools, set format=raw to bypass markdown conversion. Be mindful of your API usage, as the complexity of the target site directly impacts the latency of the returned data. Always prefer residential IPs for high-security sites to minimize the likelihood of detection.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-codeninja23-clawpod-main": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-read, file-write, external-api
Related Skills
hubspot
Query and manage HubSpot CRM data via the HubSpot API. Use when you need to search or manage contacts, companies, deals, tickets, or pipelines. Supports creating, updating, and associating CRM objects. Calls api.hubapi.com directly with no third-party proxy.
sentry
Read Sentry issues, events, and production errors via the Sentry REST API. Use when the user wants to inspect errors, list recent issues, get stack traces, or summarize production health. Requires SENTRY_AUTH_TOKEN with read-only scopes.
airtable
Read Airtable bases, tables, and records directly via the Airtable API. Use when you need spreadsheet/database data from Airtable. Calls api.airtable.com directly with no third-party proxy.
clawpod
Read any website or search Google, even when sites block bots or are geo-restricted. Handles CAPTCHAs, JavaScript rendering, and anti-bot protection server-side via residential proxies. Returns HTML or structured JSON.
stripe
Query and manage Stripe data via the Stripe API. Use when you need to list charges, customers, invoices, subscriptions, payment intents, refunds, products, or prices. Supports filtering, pagination, and creating/updating customers and refunds. Calls api.stripe.com directly with no third-party proxy.