ai-engineer
AI/ML engineering specialist for building intelligent features, RAG systems, LLM integrations, data pipelines, vector search, and AI-powered applications. Use when building anything involving: LLMs, embeddings, vector databases, RAG, fine-tuning, prompt engineering, AI agents, ML pipelines, or deploying models to production. NOT for general web dev (use rapid-prototyper) or simple API calls.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/bullkis1/ai-engineerAI Engineer
Build practical AI systems that work in production. Data-driven, systematic, performance-focused.
Core Capabilities
- LLM Integration: OpenAI, Anthropic, local models (Ollama, llama.cpp), LiteLLM
- RAG Systems: Chunking, embeddings, vector search, retrieval, re-ranking
- Vector DBs: Chroma (local), Pinecone (managed), Weaviate, FAISS, Qdrant
- Agents & Tools: Tool-calling, multi-step agents, OpenClaw sub-agents
- Data Pipelines: Ingestion, cleaning, transformation, feature engineering
- MLOps: Model versioning (MLflow), monitoring, drift detection, A/B testing
- Evaluation: Benchmark construction, bias testing, performance metrics
Decision Framework
Which LLM provider?
- Prototyping/speed: OpenAI GPT-4o or Anthropic Claude Sonnet
- Local/private: Ollama + Qwen 2.5 32B or Llama 3.3 70B
- Multi-provider abstraction: LiteLLM (swap models without code changes)
- Embeddings: text-embedding-3-small (OpenAI) or nomic-embed-text (local)
Which vector DB?
- Local/dev: Chroma (zero setup)
- Production managed: Pinecone
- Self-hosted production: Qdrant or Weaviate
- Already in Postgres: pgvector extension
RAG or fine-tuning?
- RAG first — always try RAG before fine-tuning. 90% of cases RAG is enough.
- Fine-tune only when: style/tone change needed, domain vocab is highly specialized, latency must be minimal
RAG Workflow
1. Ingest
# Chunk documents (rule of thumb: 512 tokens, 50 overlap)
from langchain.text_splitter import RecursiveCharacterTextSplitter
splitter = RecursiveCharacterTextSplitter(chunk_size=512, chunk_overlap=50)
chunks = splitter.split_documents(docs)
2. Embed + store
import chromadb
from chromadb.utils.embedding_functions import OpenAIEmbeddingFunction
client = chromadb.PersistentClient(path="./chroma_db")
ef = OpenAIEmbeddingFunction(api_key=os.environ["OPENAI_API_KEY"], model_name="text-embedding-3-small")
collection = client.get_or_create_collection("docs", embedding_function=ef)
collection.add(documents=[c.page_content for c in chunks], ids=[str(i) for i in range(len(chunks))])
3. Retrieve + generate
results = collection.query(query_texts=[user_query], n_results=5)
context = "\n\n".join(results["documents"][0])
response = client.chat.completions.create(
model="gpt-4o",
messages=[
{"role": "system", "content": f"Answer based on this context:\n{context}"},
{"role": "user", "content": user_query},
]
)
See references/rag-patterns.md for advanced patterns: re-ranking, hybrid search, HyDE, eval.
LLM Tool Calling (Agents)
tools = [{
"type": "function",
"function": {
"name": "search_docs",
"description": "Search internal documentation",
"parameters": {
"type": "object",
"properties": {"query": {"type": "string"}},
"required": ["query"]
}
}
}]
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-bullkis1-ai-engineer": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
howtoletmyagent_secure_gmail_access
Teach an OpenClaw agent the recommended Gmail OAuth2 setup, scope choices, and safety guardrails from this guide.
growth-hacker
Rapid user acquisition, viral loops, conversion optimization, and growth experiments. Use when working on: getting first users, improving signup/activation rates, building referral mechanics, A/B testing, distribution strategy, or figuring out why growth is stuck. Specializes in early-stage and indie product growth (0→1 and 1→10k users). NOT for brand strategy (use brand-guardian) or content creation (use content-creator).
rapid-prototyper
Ultra-fast proof-of-concept and MVP development. Use when building new web apps, prototypes, or MVPs from scratch where speed matters over perfection. Specializes in the canonical fast-stack: Next.js 14 + Supabase + Clerk + shadcn/ui + Prisma. Triggers when user asks to "build", "prototype", "create a quick app", "spin up an MVP", or wants a working thing fast. NOT for small one-liner fixes or edits to existing codebases.
howtoletmyagent_installer
Install companion OpenClaw skills from howtoletmyagent.xyz article URLs or skill manifests.