ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified

RAG

Build, optimize, and debug RAG pipelines with chunking strategies, retrieval tuning, evaluation metrics, and production monitoring.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/ivangdavila/rag
Or

When to Use

User wants to implement, improve, or troubleshoot Retrieval-Augmented Generation systems.

Quick Reference

TopicFile
Pipeline components & architecturearchitecture.md
Implementation patterns & codeimplementation.md
Evaluation metrics & debuggingevaluation.md
Security & compliancesecurity.md

Core Capabilities

  1. Architecture design — Select embedding models, vector DBs, and chunking strategies based on requirements
  2. Implementation — Write ingestion pipelines, query handlers, and update logic
  3. Retrieval optimization — Tune top-k, reranking, hybrid search parameters
  4. Evaluation — Build test datasets, measure recall/precision, diagnose failures
  5. Production ops — Monitor quality drift, set up alerts, debug degradation
  6. Security — PII detection, access control, compliance requirements

Decision Checklist

Before recommending architecture, ask:

  • What document types and volume?
  • Latency requirements (real-time chat vs batch)?
  • Update frequency (how often do docs change)?
  • Access control needs (who can see what)?
  • Compliance constraints (GDPR, HIPAA, SOC2)?
  • Budget (managed vs self-hosted, embedding costs)?

Critical Rules

  • Never skip access control — Filter at retrieval time, not after
  • Always overlap chunks — 10-20% prevents context loss at boundaries
  • Evaluate before optimizing — Build eval dataset first, then tune
  • Same embedding model — Query and documents must use identical model
  • Monitor similarity scores — Dropping averages signal drift or issues
  • Plan for deletion — GDPR erasure requires re-embedding capability

Common Failure Patterns

SymptomLikely CauseFix
Wrong docs retrievedQuery too vague, poor chunksQuery expansion, smaller chunks
Relevant doc missedNot indexed, low similarityCheck ingestion, hybrid search
Hallucinated answersContext too shortIncrease top-k, better reranking
Slow responsesLarge chunks, no cachingOptimize chunk size, cache embeddings
Inconsistent resultsNon-deterministic rerankingSet seeds, use stable sorting

Metadata

Stars2102
Views0
Updated2026-03-06
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-ivangdavila-rag": {
      "enabled": true,
      "auto_update": true
    }
  }
}
Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.