jina-reader
Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.
Why use this skill?
Learn how to use the Jina Reader skill in OpenClaw to extract markdown, search the web, and fact-check data while protecting your server IP address.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/ericsantos/jina-readerWhat This Skill Does
The jina-reader skill for OpenClaw is a powerful web-interfacing tool that bridges the gap between unstructured web content and actionable LLM data. By leveraging the Jina AI Reader API, this skill transforms complex websites into clean, token-efficient markdown, text, or even screenshots. Its primary strength lies in its ability to abstract web requests through Jina's proxy infrastructure, ensuring that your local server IP address remains hidden and secure. Whether you need to scrape specific DOM elements using CSS selectors, perform broad web searches to stay updated, or fact-check specific claims through the grounding mode, this tool provides a highly configurable interface for advanced web data retrieval.
Installation
To integrate this skill into your OpenClaw environment, execute the following command in your terminal:
clawhub install openclaw/skills/skills/ericsantos/jina-reader
Once installed, ensure you have your API key ready to maximize performance. You can obtain a free-tier key by visiting the Jina AI portal. Once you have your key, export it to your environment variables:
export JINA_API_KEY="your_key_here"
This setup allows the script to authenticate requests to the Jina API, enabling high-performance extraction and search capabilities directly from your CLI or agent workflows.
Use Cases
- Research & Content Aggregation: Quickly turn deep-link articles into structured markdown to feed into summarization agents.
- Automated Fact-Checking: Use the grounding mode to verify claims against the latest web data, perfect for validating output generated by models prone to hallucinations.
- Market Intelligence: Use the search mode to pull the top 5 results for industry-specific queries, scraping their content simultaneously for comparative analysis.
- Dynamic Site Scraping: Bypass traditional limitations of static scrapers by using the headless Chrome rendering engine to handle JavaScript-heavy sites.
Example Prompts
- "OpenClaw, use jina-reader to search for the latest advancements in quantum computing research from January 2025 and summarize the top three breakthroughs."
- "Extract the main content from this URL [link] but make sure to remove all navigation elements and footer links using the removal flags."
- "Is it true that the Eiffel Tower grows in the summer? Please verify this claim using the grounding mode and explain why."
Tips & Limitations
- CSS Selectors: Always inspect the page source beforehand; using
--selectorsignificantly reduces token usage and noise in your markdown output. - Geo-Proxies: Utilize the
--proxyflag (e.g., 'br', 'us') to access region-locked content that might otherwise be unavailable from your current network location. - Costs: While the basic read mode is highly affordable, be mindful that the 'grounding' mode has higher token costs and latency (up to 30s) due to the verification logic involved. For high-frequency data collection, consider caching strategies or the
--nocacheflag only when absolute freshness is required.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-ericsantos-jina-reader": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api
Related Skills
serpapi
Search Google via SerpAPI (Google Search, Google News, Google Local). Use when you need to search the web, find news articles, or look up local businesses. Supports country/language targeting for region-specific results.
relay-to-agent
Relay messages to AI agents on any OpenAI-compatible API. Supports multi-turn conversations with session management. List agents, send messages, reset sessions.
messaging
NexusMessaging Protocol client — ephemeral agent-to-agent sessions. Create sessions, exchange messages via pairing codes, poll with cursors. Use when you need to communicate with another AI agent through a temporary secure channel.
perplexity
Deep search via Perplexity API. Three modes: search (quick facts), reason (complex analysis), research (in-depth reports). Returns AI-grounded answers with citations.
confidant
Secure secret handoff and credential setup wizard for AI agents. Use when you need sensitive information from the user (API keys, passwords, tokens) or need to save credentials to config files. Never ask for secrets via chat — use Confidant instead.