deep-scraper
High-performance containerized web scraper (Docker + Crawlee + Playwright). Use when user mentions any of these: 爬虫, 爬取, 抓取, 采集, 数据采集, 爬数据, 抓数据, 获取数据, scrape, crawl, extract, fetch data, pull data, 亚马逊, Amazon, ASIN, BSR, Best Sellers, 畅销榜, 热销榜, 新品榜, 飙升榜, 排行榜, 选品, 竞品分析, 竞品调研, 市场调研, 品类分析, 类目分析, 产品调研, 月销量, bought in past month, 销量, 评论数, 价格对比, YouTube, 视频字幕, 转录, transcript, 网页内容, 网站数据, 页面抓取, 动态页面, TikTok, Twitter, X, 社交媒体数据, 帖子内容, 关键词搜索, 搜索结果, search results, 产品详情, 产品信息, listing数据, listing分析, top 100, top sellers, 热门产品, 爆款, 跑量款, 价格带, 评分分布, review分析, 评论分析
Why use this skill?
Use Deep Scraper for automated Amazon data collection, YouTube transcript extraction, and general web scraping. High-performance, containerized, and fast.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/jiafar/deep-scraper-amazonWhat This Skill Does
Deep Scraper is a high-performance, containerized web data extraction agent designed for the OpenClaw ecosystem. Powered by Docker, Crawlee, and Playwright, this skill acts as a versatile bridge between unstructured web content and actionable insights. It provides specialized logic for high-traffic platforms like Amazon and YouTube, while retaining a robust general-purpose scraper for any other website. Whether you need to analyze market trends, gather competitor pricing, or extract video transcripts, Deep Scraper handles the complexities of dynamic rendering, anti-scraping measures, and request orchestration.
Installation
To integrate this skill, run the following command in your terminal:
clawhub install openclaw/skills/skills/jiafar/deep-scraper-amazon
Ensure your system has Docker installed and running. Before first use, build the local image:
docker build -t clawd-crawlee skills/deep-scraper/
Use Cases
- E-commerce Intelligence: Automate your market research by tracking Amazon Best Sellers, product listings, pricing fluctuations, and competitor sales performance (bought-in-past-month metrics).
- Content Processing: Quickly extract YouTube video transcripts or descriptions for summarization, content repurposing, or SEO keyword research.
- Web Data Aggregation: Scrape raw text data from arbitrary websites, social media posts, or news portals, effectively turning messy HTML pages into clean, usable JSON.
- Trend Analysis: Identify emerging products or high-growth items by scraping the Amazon Movers & Shakers or New Releases categories.
Example Prompts
- "Check the current pricing and monthly sales volume for this product: https://www.amazon.com/dp/B001TQ6IHS"
- "Can you extract the full transcript for this YouTube video? https://www.youtube.com/watch?v=ExampleID"
- "Run a market survey on the top 100 electronics best sellers on Amazon and tell me the average price and rating distribution."
Tips & Limitations
- Strategy for Amazon: Note that 'Best Seller' rank data and 'Monthly Sales' data are often on different URL structures. If you need both, run the scraper against a Category page first, then use a secondary search query for specific product depth.
- Performance: Always include the
--pagesflag if you need deep data aggregation beyond the initial page load. - Data Privacy: This tool is intended for ethical data gathering. Always respect
robots.txtfiles and the terms of service of the target websites when scaling your operations. The generic scraper is best used for text-heavy content extraction and may have limitations with highly complex SPAs (Single Page Applications) that require specific cookies or auth tokens.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-jiafar-deep-scraper-amazon": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, data-collection, code-execution
Related Skills
amazon-scraper
High-performance containerized web scraper (Docker + Crawlee + Playwright). Use when user mentions any of these: 爬虫, 爬取, 抓取, 采集, 数据采集, 爬数据, 抓数据, 获取数据, scrape, crawl, extract, fetch data, pull data, 亚马逊, Amazon, ASIN, BSR, Best Sellers, 畅销榜, 热销榜, 新品榜, 飙升榜, 排行榜, 选品, 竞品分析, 竞品调研, 市场调研, 品类分析, 类目分析, 产品调研, 月销量, bought in past month, 销量, 评论数, 价格对比, YouTube, 视频字幕, 转录, transcript, 网页内容, 网站数据, 页面抓取, 动态页面, TikTok, Twitter, X, 社交媒体数据, 帖子内容, 关键词搜索, 搜索结果, search results, 产品详情, 产品信息, listing数据, listing分析, top 100, top sellers, 热门产品, 爆款, 跑量款, 价格带, 评分分布, review分析, 评论分析
horse-sticker-maker
Create and deploy a festive Chinese New Year (Year of the Horse 2026) animated GIF sticker maker web app. Use when the user wants to generate custom horse-themed blessing stickers, deploy a sticker generator H5 page, or create WeChat-compatible animated GIF stickers with gold horse animation on red background. Supports custom text input, 6-frame gold horse galloping animation, Canvas-based client-side GIF rendering via gif.js, and Vercel deployment.
google-patents
Search Google Patents database for patent research, infringement risk checks, and competitive IP analysis. Use when user mentions: 专利, patent, 侵权, infringement, 知识产权, IP, 外观设计, design patent, 专利检索, patent search, 专利风险, patent risk, 专利分析, patent analysis, 专利布局, patent portfolio, 有没有专利, 会不会侵权, 能不能卖, FTO, freedom to operate, 规避设计, 专利壁垒, 技术壁垒, 权利要求, claims, 专利详情, 发明人, inventor, 受让人, assignee, 专利号, patent number, 说明书, description, 技术领域, 背景技术, 发明内容, 具体实施方式, PDF
vercel-to-cloudflare
Migrate Next.js projects from Vercel to Cloudflare Workers with Supabase/Hyperdrive support. Use when user wants to move a Next.js app off Vercel to reduce costs, deploy to Cloudflare Workers, configure Hyperdrive connection pooling, or fix Supabase connectivity issues on Cloudflare. Triggers on phrases like "migrate to Cloudflare", "Vercel too expensive", "deploy Next.js on Cloudflare Worker", "Cloudflare Hyperdrive setup", "Supabase on Cloudflare", "从Vercel迁移到Cloudflare", "Vercel太贵了", "部署到Cloudflare Worker".
clawprompt
Launch a smart teleprompter with mobile remote control for video recording. Use when the user wants to read scripts while recording video, use a teleprompter, or needs a prompter with phone remote control. Triggers on phrases like "open teleprompter", "start prompter", "提词器", "打开提词器", "录视频提词", "teleprompter", "提词", "I need a prompter", "read script while recording", "手机遥控提词", "ClawPrompt", "念稿子", "录视频看词", "对镜头念词". Features: dual-screen sync (computer + phone show same text), QR code phone pairing, mobile remote control (another person controls page turns), text upload from either device, fullscreen black-background white-text display, auto sentence segmentation, adjustable font size, countdown before start. Works with ClawCut — import AI-generated 9-scene scripts directly. 提词器, 智能提词器, teleprompter, 手机遥控, 视频录制辅助工具, prompter, autocue, 录制提词, 双屏同步, 远程翻页.