ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified utilities Safety 4/5

Url Fetcher

Skill by johstracke

Why use this skill?

Download web content to Markdown without API keys or dependencies. Secure, path-validated, and built for OpenClaw research and aggregation tasks.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/johstracke/url-fetcher
Or

What This Skill Does

The URL Fetcher is a lightweight, dependency-free OpenClaw skill designed to retrieve web content directly from your agent interface. By leveraging Python’s built-in urllib library, it ensures that your environment remains uncluttered by third-party packages or complex API dependencies. The tool is purpose-built for scenarios where speed and simplicity are prioritized over rendering complex JavaScript-heavy applications. It provides a robust suite of safety checks, including automated URL validation to prevent access to internal networks or localhost, and path validation to ensure file writes are restricted to authorized directories like your workspace or temporary folders. Beyond simple text retrieval, it includes a regex-based engine to convert HTML into readable Markdown, making it ideal for processing web articles, documentation, or static pages into formats easily consumed by LLMs or stored for later review.

Installation

To integrate this utility into your OpenClaw environment, use the following CLI command:

clawhub install openclaw/skills/skills/johstracke/url-fetcher

Ensure that you have sufficient write permissions in your target workspace directory before attempting to save fetched content, as the internal security layer strictly enforces path constraints to protect your system files.

Use Cases

  • Research Automation: Rapidly gather content from multiple URLs and convert them into organized Markdown files for knowledge base indexing.
  • Content Curation: Aggregate text from various static web pages to build local datasets for analysis without needing browser automation or heavy scraping frameworks.
  • Documentation Offline-Use: Download project documentation or technical specifications directly to your machine for local reading or AI processing.
  • Utility Scripting: Use as a building block in larger pipelines where web data must be pulled without triggering anti-scraping mechanisms that often block large, headless browser instances.

Example Prompts

  1. "Fetch the content from https://example-research-blog.com/post1 and save it as a markdown file in my workspace/research folder."
  2. "Go to https://news-site.org and extract the text content so I can summarize the main points."
  3. "Please grab the page at https://documentation.project.io/api-reference and format the output to a new file named api_docs.md in my home directory."

Tips & Limitations

Because the URL Fetcher utilizes a regex-based parser rather than a full DOM-rendering engine, it cannot execute JavaScript. Consequently, it will not be able to retrieve content from Single Page Applications (SPAs) or websites that require dynamic client-side rendering. For best results, use this tool on static content, news articles, or technical documentation. Additionally, while it handles basic errors and timeouts, be aware that it does not feature an advanced request-throttling mechanism; if you are fetching hundreds of pages, ensure you manually implement pauses to avoid being flagged by server-side rate limits or security filters.

Metadata

Stars1947
Views1
Updated2026-03-04
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-johstracke-url-fetcher": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#web-scraping#markdown#utility#automation#research
Safety Score: 4/5

Flags: network-access, file-write