Web Fetch Vx
Skill by 3511815125
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/3511815125/web-fetch-vxWhat This Skill Does
Web Fetch Vx is a high-performance content extraction engine designed specifically for the OpenClaw ecosystem. It functions as a sophisticated scraper that intelligently processes web pages—ranging from WeChat public account articles and news sites to personal blogs—and strips away clutter like advertisements, navigation menus, and sidebars. Powered by a triple-engine architecture (Readability, Firecrawl, and Defuddle), it provides clean, formatted Markdown content that is immediately ready for LLM consumption, summarization, or archival. Beyond raw text, it extracts vital metadata including author, publication date, images, and word count, ensuring your data pipelines receive structured and meaningful information.
Installation
To integrate this skill into your environment, run the following command in your OpenClaw terminal:
clawhub install openclaw/skills/skills/3511815125/web-fetch-vx
Ensure that your OpenClaw instance is updated to version 2.0.0 or higher to leverage the full multi-engine capabilities and stability improvements included in this release.
Use Cases
- Research and Knowledge Management: Automate the collection of technical blog posts and research papers into your personal Markdown-based knowledge base.
- Content Curation: Extract full articles from WeChat or newsletters to summarize trends for your team without navigating distracting web layouts.
- Data Analysis: Collect raw text from multiple news sources simultaneously using batch processing to feed into sentiment analysis or trend tracking models.
- Offline Reading: Transform dynamic web articles into clean, readable Markdown files for offline storage or e-reader consumption.
Example Prompts
- "Use Web Fetch Vx to extract the content from this URL [link] and save it as a markdown file in my 'Articles' folder, including the author and publish date."
- "Please fetch the latest news from this link [link], give me a summary, and extract all the images used in the post."
- "Go through these 5 URLs [url1, url2, url3, url4, url5], extract the main text content, and provide a consolidated report with the word count for each."
Tips & Limitations
To maximize success, always prioritize the default 'markdown' extraction mode. For highly interactive websites or pages that rely heavily on client-side rendering (SPA), performance may vary; while the tool attempts to use its internal browser engine as a fallback, it is not designed to bypass complex anti-bot measures, paywalls, or login-protected content. Ensure that URLs are publicly accessible. If you encounter empty returns, consider increasing the 'timeout' parameter for pages with heavy scripts.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-3511815125-web-fetch-vx": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, data-collection