Official Verified developer tools Safety 5/5

rag-engineer

Expert in building Retrieval-Augmented Generation systems. Masters embedding models, vector databases, chunking strategies, and retrieval optimization for LLM applications. Use when: building RAG, vector search, embeddings, semantic search, document retrieval.

Why use this skill?

Expertly design and optimize your Retrieval-Augmented Generation systems. Improve your LLM's accuracy with professional chunking, hybrid search, and retrieval strategies.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/mupengi-bot/rag-engineer

Download Source Code (.zip)

What This Skill Does

The RAG Engineer skill serves as your architectural backbone for Retrieval-Augmented Generation (RAG) systems. It transforms raw, unstructured data into a high-fidelity knowledge retrieval system, ensuring that your LLM applications provide accurate, context-aware, and reliable responses. By focusing on the critical phases of the RAG pipeline—chunking, embedding, vector database management, and retrieval optimization—this skill mitigates the common pitfalls of hallucinations and poor source material grounding. It acts as a specialized consultant that guides you through the complex decisions of balancing performance, cost, and accuracy in your search infrastructure.

Installation

To install this skill, run the following command in your terminal: clawhub install openclaw/skills/skills/mupengi-bot/rag-engineer

Use Cases

Building enterprise-grade knowledge bases for customer support chatbots.
Designing document retrieval systems for legal or medical research assistants.
Implementing semantic search functionality for internal corporate wikis.
Optimizing existing RAG pipelines that suffer from low accuracy or outdated information.
Creating hybrid search engines that combine keyword-based precision with vector-based semantic understanding.

Example Prompts

"I am struggling with my RAG pipeline; documents are being split mid-sentence, causing retrieval errors. Can you help me implement a semantic chunking strategy that respects paragraph boundaries?"
"Compare the pros and cons of using HNSW index vs. IVF-FLAT in my vector database for a collection of 500,000 technical manuals."
"My system is getting high retrieval scores but the LLM isn't using the data correctly. Could you help me design a reranking strategy using Cross-Encoders to improve precision?"

Tips & Limitations

Quality over Quantity: Focus on clean data preprocessing. Garbage in leads to garbage out regardless of your embedding model.
Metadata is Key: Always use metadata filtering for your vector searches; pure semantic search often fails on ambiguous terminology.
Continuous Evaluation: Treat retrieval evaluation as a separate task from LLM output evaluation. Use tools like RAGAS to measure faithfulness and answer relevance.
Don't Over-embed: Avoid embedding everything. Strategic indexing of critical sections often yields better results than naive ingestion.
System Complexity: This skill provides architecture advice but requires integration with your existing vector database (like Pinecone, Weaviate, or Milvus).

Read Full Documentation on GitHub

Metadata

Author@mupengi-bot

Stars1335

Updated2026-02-23

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-mupengi-bot-rag-engineer": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#rag#vector-search#embeddings#nlp#llm-architecture

Safety Score: 5/5

Related Skills

prompt-engineer

Expert prompt engineer specializing in advanced prompting techniques, LLM optimization, and AI system design. Masters chain-of-thought, constitutional AI, and production prompt strategies. Use when building AI features, improving agent performance, or crafting system prompts.

mupengi-bot 1335

appointment-scheduler

Automated appointment management for beauty salons, clinics, studios, and photo booths. Handles booking requests, calendar sync, conflict detection, reminders, no-show tracking, and waitlist management.

mupengi-bot 1335

Mupeng Social Postcjo

Skill by mupengi-bot

mupengi-bot 1335

data-scraper

Web page data collection and structured text extraction

mupengi-bot 1335

auto-reply

Instagram DM auto-reply system. DM monitoring, reading, replying, security check (injection rejection). Use when checking Instagram DMs, reading unread messages, replying to DMs, setting up DM monitoring cron jobs, or handling DM auto-reply workflows. Triggers on: Instagram DM, DM check, DM reply, DM auto-reply, dm-alert.

mupengi-bot 1335