vector-databases
Deep vector database workflow—embedding choice, index algorithms, recall/latency trade-offs, hybrid search, filtering, operational tuning, and cost. Use when selecting or optimizing Pinecone, Milvus, Qdrant, Weaviate, pgvector, OpenSearch kNN, etc.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/clawkk/vector-databasesWhat This Skill Does
The vector-databases skill provides a structured, engineering-focused framework for implementing and optimizing production-grade vector search systems. It transitions users from basic similarity searches to scalable, high-performance RAG and recommendation architectures. The skill covers the end-to-end lifecycle: defining similarity metrics, selecting embedding models and chunking strategies, tuning index types (HNSW, IVF, PQ), implementing metadata filtering, and managing cost-efficient infrastructure. It serves as an expert advisor for selecting the right backend—whether you are deciding between Pinecone, Milvus, Qdrant, Weaviate, pgvector, or OpenSearch kNN—by calculating the trade-offs between recall, latency (p95), and operational overhead.
Installation
To install this skill, run the following command in your terminal: clawhub install openclaw/skills/skills/clawkk/vector-databases
Use Cases
- RAG Architecture: Designing and optimizing document retrieval for LLM applications to minimize hallucinations through improved context selection.
- Recommendation Systems: Configuring high-throughput vector storage for real-time item similarity and user preference matching.
- Hybrid Search Strategy: Combining dense vector embeddings with sparse keyword search (BM25) to solve for exact keyword match limitations.
- Performance Tuning: Resolving recall drop-offs or latency spikes in production vector databases by re-tuning index parameters like M, efConstruction, and nlist.
- Database Selection: Comparing managed cloud vector services versus self-hosted or extension-based (e.g., pgvector) solutions based on total cost of ownership and scale requirements.
Example Prompts
- "I am seeing a significant drop in recall when moving from 10k to 1M vectors in my Qdrant instance. Can you help me review my HNSW index configuration?"
- "Compare the pros and cons of using pgvector on RDS versus deploying a dedicated Milvus cluster for a RAG system with 5 million documents."
- "How should I design my metadata schema to support multi-tenant filtering while keeping latency under 100ms?"
Tips & Limitations
- Model Stability: Always version your embedding models. A change in the embedding model requires a full re-indexing of your data, which is a major operational task.
- Evaluation First: Avoid premature optimization. Establish a ground truth dataset or a set of proxy evaluation tasks before tweaking index parameters.
- Hybrid is King: Almost every production search system requires metadata filtering or keyword-based re-ranking to be useful; don't rely solely on dense vector retrieval.
- Resource Awareness: HNSW indexes are memory-intensive. Always monitor RAM usage during index building to prevent OOM errors.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-clawkk-vector-databases": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Related Skills
data-move
Deep data migration workflow—scope, mapping, validation, batching and ordering, dual-write and cutover, rollback, and reconciliation. Use when moving tenants, bulk backfills, or changing stores without losing trust in data correctness.
data-model
Deep data modeling workflow—grain, facts and dimensions, keys, slowly changing dimensions, normalization trade-offs, and analytics query patterns. Use when designing warehouse/analytics models or reviewing star/snowflake schemas.
guard
Deep AI safety guardrails workflow—policy definition, input/output filtering, monitoring, escalation, and false-positive handling. Use when reducing harmful outputs, misuse, or policy violations in LLM products.
prompts
Deep prompt engineering workflow—task spec, constraints, examples, evaluation sets, iteration protocol, regression testing, and safety alignment. Use when improving LLM outputs, shipping prompt changes, or building reusable prompt templates.
cost-opt
Cloud cost review: rightsizing, reservations, waste. Use when reducing infra spend.