web-fetcher
Smart web content fetcher - articles and videos from WeChat, Feishu, Bilibili, Zhihu, Toutiao, YouTube, etc. Triggers: '抓取文章', '下载网页', '保存文章', 'fetch URL', '下载视频', '抓取飞书文档', '抓取微信文章', '把这个链接内容保存下来', '下载B站视频', 'download video', 'scrape article'.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/alexxxiong/web-fetcherWhat This Skill Does
The web-fetcher is an advanced, intelligent agent skill designed to bridge the gap between unstructured web content and actionable data for OpenClaw. It automatically detects the source platform of a provided URL—ranging from professional social platforms like WeChat and Feishu to multimedia giants like Bilibili and YouTube—and selects the most efficient extraction strategy. By leveraging specialized tools like 'scrapling' for article parsing, 'yt-dlp' for high-quality video acquisition, and 'camoufox' for bypassing sophisticated anti-bot protections on sites like Xiaohongshu, the skill ensures high success rates in data retrieval.
Installation
To integrate this skill into your environment, use the OpenClaw command-line interface:
clawhub install openclaw/skills/skills/alexxxiong/web-fetcher
Ensure your local environment meets the runtime requirements. Based on your specific needs, you may need to install supplemental libraries via pip. For standard article scraping, ensure scrapling is installed. For video tasks, yt-dlp is mandatory. For platforms requiring stealth browsing (e.g., Weibo), the camoufox toolkit must be installed and initialized correctly. Detailed dependency mappings are available in the skill's local documentation.
Use Cases
This skill is perfect for researchers, content curators, and developers who need to aggregate information from diverse web sources into a local knowledge base. It is particularly powerful for:
- Creating offline archives of long-form articles from WeChat and Zhihu.
- Batch-processing video tutorials from Bilibili or YouTube for local viewing.
- Scraping complex documentation from Feishu docs into clean Markdown files.
- Automating the collection of media content from platforms that typically employ strict anti-crawling measures.
Example Prompts
- "抓取这篇文章 https://mp.weixin.qq.com/s/example123 并保存到我的本地笔记文件夹中。"
- "下载这个B站视频 https://b23.tv/abcxyz 到我的视频库,请确保画质选择最高。"
- "我想把这份飞书文档里的内容整理成Markdown,你可以用fetch URL帮我抓取一下吗?"
Tips & Limitations
- Platform-Specific Needs: Always ensure that if you are targeting platforms like Xiaohongshu, your environment has
camoufoxproperly configured to handle the stealth-mode browser requirements. - Data Privacy: Be mindful of platform Terms of Service regarding automated scraping. The fetcher is designed for personal knowledge management.
- Performance: For batch operations, use the
--urls-fileargument to process multiple links efficiently, reducing overhead and improving the speed of your data collection workflows.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-alexxxiong-web-fetcher": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, file-read, code-execution
Related Skills
inspirai-apispec
API 规范管理工具 - 跨项目 API 文档的初始化、更新、查询与搜索。Triggers: 'API文档', 'API规范', '接口文档', '路由解析', 'apispec', 'API lookup', 'API search'.
inspirai-evo
技能自我进化 - 检测流程问题信号(重复试错、流程中断、代码翻动),生成分析报告,引导改进。Triggers: '流程优化', '技能进化', 'skill evolution', '自我改进', '流程问题', 'workflow analysis'.
web-reader
智能网页阅读器 - 抓取文章/下载视频并归档,支持分析、摘要、衍生。Triggers: '下载这篇文章', '抓取文章', '保存文章', 'fetch URL', '分析这篇文章', '摘要', '总结文章', '下载视频', '抓取微信文章', '抓取飞书文档', '把这个链接保存下来', '下载B站视频', 'download article', 'analyze article', 'summarize'.
inspirai-project
多 Agent 项目管理 - 从想法到立项,自动在 Discord 频道创建项目 Thread 并分配 Agent 任务。支持快速讨论路由、批量立项、项目状态追踪。Triggers: '立项', '创建项目', '新项目', '讨论一下', '项目状态', 'create project', 'discuss', 'project status'
inspirai-deploy
智能部署工具 - 自动检测部署策略,预检查、发布、监控一体化。支持 K8s/Helm、Docker Compose、Vercel、Fly.io。Triggers: '部署', 'deploy', '发布', '上线', '预检查', '部署监控', 'helm upgrade', 'docker compose up'.