ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified data analysis Safety 4/5

content-parser

Extract and parse content from URLs. Triggers on: user provides a URL to extract content from, another skill needs to parse source material, "parse this URL", "extract content", "解析链接", "提取内容".

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/0xfango/content-parser
Or

What This Skill Does

The content-parser skill is a powerful OpenClaw agent tool designed to transform raw web pages into clean, structured, and actionable data. It serves as a bridge between the unstructured nature of the internet and the structured requirements of AI workflows. By providing intelligent extraction, it allows users to pull text, metadata, and references from any valid HTTP(S) URL. It is built with a user-centric interaction model, ensuring that you stay in control by requiring manual confirmation before any data is processed or saved to your system.

Installation

To add this skill to your OpenClaw environment, execute the following command in your terminal:

clawhub install openclaw/skills/skills/0xfango/content-parser

Once installed, the skill performs a mandatory API key check using your shared configuration protocols. If you are a first-time user, it will prompt you to create a local configuration file in .listenhub/content-parser/config.json, where you can define your preferences for automatic file handling.

Use Cases

  • Academic & Research: Quickly extract text from long-form articles or journals to summarize key findings without the clutter of advertisements or navigation menus.
  • Content Aggregation: Fetch material from multiple links to prepare summaries or reports for team documentation.
  • Automated Workflows: Use the parser as a prerequisite for other agent tasks, such as feeding parsed web content into an LLM for sentiment analysis or translation.
  • Archiving: Automatically download and store relevant page content locally to ensure you retain a copy of the information even if the source URL becomes unavailable.

Example Prompts

  1. "I need to parse this URL: https://example-blog.com/tech-trends-2024, extract the main body text, and summarize it for my morning meeting."
  2. "Can you extract the content from this link so I can use it as a reference for my research report?"
  3. "解析链接 https://news-source.com/article-123 并提取所有主要内容。"

Tips & Limitations

  • Configuration: Always use the AskUserQuestion flow to verify settings. If you find yourself repeatedly changing options, consider updating your config.json via the setup flow.
  • Hard Constraints: This skill is strictly designed for content extraction. Do not attempt to use it for scraping dynamic, heavy-JavaScript-dependent web apps that require a full browser engine, as this is optimized for standard URL-based retrieval.
  • Safety: Always verify URLs before submitting them to avoid potential malicious redirects or unauthorized data collection triggers. The skill requires an active internet connection to function.

Metadata

Author@0xfango
Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-0xfango-content-parser": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#web-scraping#data-extraction#automation#parser
Safety Score: 4/5

Flags: network-access, file-write, file-read, external-api