ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified data analysis Safety 4/5

wechat-article-spider

微信公众号文章爬虫 - 将微信公号文章转换为 Markdown + 本地图片

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/chenchaoqun/wechat-article-spider
Or

What This Skill Does

The wechat-article-spider is a specialized OpenClaw skill designed to streamline the archival and content extraction process for WeChat official account articles. It acts as an automated bridge between the closed ecosystem of WeChat's mobile-first interface and your local development environment. By inputting a standard WeChat article URL, the skill initiates a targeted scraping sequence that parses the raw HTML content, preserves the article formatting, and transforms the structure into a clean, portable Markdown file. A critical component of this process is the automatic harvesting of all embedded images, which are downloaded and stored locally in an organized subdirectory, ensuring that your archives remain self-contained and readable even if the original source is deleted or restricted.

Installation

To integrate this skill into your OpenClaw environment, execute the following command in your terminal: clawhub install openclaw/skills/skills/chenchaoqun/wechat-article-spider

Ensure that you have Python 3.x installed on your system. Once installed, navigate to the scripts directory of the skill folder, and execute pip install -r requirements.txt to install essential dependencies, including requests, beautifulsoup4, and lxml. These libraries are vital for handling HTTP requests, parsing complex DOM structures, and efficient data processing.

Use Cases

This skill is perfect for content curators, researchers, and developers who need to aggregate information from WeChat for long-term storage or analysis. It is frequently used for creating personal knowledge bases, drafting archival backups of industry reports, or migrating legacy WeChat content into static site generators like Jekyll, Hugo, or Obsidian. Additionally, writers can use it to repurpose research materials into professional documentation without manual copy-pasting.

Example Prompts

  1. "@openclaw please download the WeChat article at https://mp.weixin.qq.com/s/example-link and save it to my current working directory."
  2. "@openclaw use the wechat-article-spider to archive this article: https://mp.weixin.qq.com/s/xyz123 into a folder named /archive/projects/research."
  3. "@openclaw, can you convert the WeChat article at https://mp.weixin.qq.com/s/abc456 to markdown? Please make sure to save the images locally."

Tips & Limitations

Be mindful that WeChat implements robust anti-scraping measures. If a request fails, the server may be temporarily throttling traffic; it is advised to wait a few minutes before retrying. Furthermore, the tool primarily focuses on static content. Complex layouts, lazy-loaded images, or interactive widgets may not render perfectly in the final Markdown output. Always verify that the outputted directory contains the expected image assets before integrating them into other projects.

Metadata

Stars3840
Views1
Updated2026-04-06
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-chenchaoqun-wechat-article-spider": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#wechat#crawler#markdown#archival#scraper
Safety Score: 4/5

Flags: network-access, file-write, file-read