ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified

Search Engine

Design and build any search engine with robust indexing, retrieval logic, relevance controls, and evaluation workflows for production systems.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/ivangdavila/search-engine
Or

Setup

On first use, read setup.md and establish activation behavior, system scope, and data constraints before proposing implementation steps.

When to Use

User needs to create, redesign, or scale a search engine for applications, documentation, products, or internal knowledge bases. Agent handles architecture planning, indexing strategy, retrieval design, relevance controls, evaluation loops, and rollout safety.

Architecture

Memory lives in ~/search-engine/. See memory-template.md for baseline structure and status values.

~/search-engine/
|-- memory.md              # Persistent context, constraints, and active priorities
|-- requirements.md        # Retrieval goals, latency targets, and relevance expectations
|-- experiments.md         # Offline experiments and tuning decisions
`-- incidents.md           # Production issues, root cause, and remediation notes

Quick Reference

Use the smallest relevant file for the task.

TopicFile
Setup and activation behaviorsetup.md
Memory template and status modelmemory-template.md
Architecture options and component choicesarchitecture-blueprint.md
Retrieval and ranking strategy patternsretrieval-patterns.md
Quality measurement and evaluation loopsevaluation-metrics.md
Delivery and rollout gatesimplementation-checklist.md

Data Storage

Local notes stay in ~/search-engine/:

  • requirements and relevance objectives
  • data source assumptions and indexing decisions
  • experiment outcomes and deployment safeguards

Core Rules

1. Start with a Retrieval Contract, Not with Tools

Before selecting engines, define the contract:

  • query types to support (keyword, phrase, semantic, hybrid)
  • response format, latency budget, and freshness target
  • error tolerance and fallback behavior

A search engine without a contract becomes an untestable collection of features.

2. Design Ingestion and Indexing as a Deterministic Pipeline

Every document should pass explicit stages:

  • ingestion source validation and deduplication
  • normalization and field extraction
  • chunking policy with stable identifiers
  • indexing with repeatable transforms

Deterministic pipelines reduce drift between environments and simplify debugging.

3. Separate Recall Layers from Precision Layers

Treat retrieval as a staged system:

  • broad candidate retrieval first (lexical, vector, or hybrid)
  • reranking and business rules second
  • formatting and explanation last

Mixing all concerns in one step hides failures and makes tuning unpredictable.

4. Define Relevance Features as Versioned Policy

Relevance changes must be tracked as policy versions:

  • feature weights and boosts
  • typo tolerance and synonym policy
  • filtering, faceting, and tie-break rules

Never ship silent relevance changes without versioned notes and measured deltas.

Metadata

Stars2102
Views1
Updated2026-03-06
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-ivangdavila-search-engine": {
      "enabled": true,
      "auto_update": true
    }
  }
}
Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.