senior-ml-engineer
ML engineering skill for productionizing models, building MLOps pipelines, and integrating LLMs. Covers model deployment, feature stores, drift monitoring, RAG systems, and cost optimization. Use when the user asks about deploying ML models to production, setting up MLOps infrastructure (MLflow, Kubeflow, Kubernetes, Docker), monitoring model performance or drift, building RAG pipelines, or integrating LLM APIs with retry logic and cost controls. Focused on production and operational concerns rather than model research or initial training.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/alirezarezvani/senior-ml-engineerWhat This Skill Does
The Senior ML Engineer skill transforms OpenClaw into a production-grade machine learning architect. It bridges the gap between experimental Jupyter notebook models and robust, scalable production systems. The skill provides codified workflows for model serialization, containerization, and the integration of MLOps best practices. It manages the complexities of feature store orchestration using tools like Feast or Tecton, experiment tracking via MLflow or Weights & Biases, and high-performance inference serving. Furthermore, it excels at building RAG (Retrieval-Augmented Generation) pipelines, handling document indexing, vector database selection, and prompt optimization for LLMs. By leveraging this skill, you ensure that your ML lifecycle follows industry standards, from automated canary deployments to granular drift monitoring and cost-efficient cloud resource management.
Installation
To integrate this expert-level capability into your environment, run the following command within your terminal or OpenClaw interface:
clawhub install openclaw/skills/skills/alirezarezvani/senior-ml-engineer
Ensure your local development environment has the necessary access permissions for your container registry and cloud infrastructure providers (AWS, GCP, or Azure) to allow the agent to manage deployments effectively.
Use Cases
- Productionizing PyTorch or TensorFlow models: Convert experimental code into production-ready REST APIs using FastAPI or Triton Inference Server.
- Building MLOps Pipelines: Automate the retraining and model registry process for iterative model improvements.
- Designing RAG Architectures: Create scalable knowledge retrieval systems for LLM-based assistants using vector databases.
- Cost Optimization: Analyze inference workloads to select the most cost-effective serving infrastructure.
- Model Lifecycle Management: Implement automatic drift detection and alerts to identify when models need retraining based on real-world data distribution changes.
Example Prompts
- "Build a Dockerfile for a FastAPI-based model server that uses GPU acceleration for a PyTorch classification model."
- "Draft a Python snippet for a Feast feature view that aggregates user purchase history from our raw Parquet files."
- "Design a RAG system architecture for a customer support bot, including recommendations for embedding models and vector database latency optimizations."
Tips & Limitations
- Tip: Always enable canary deployments for new models to minimize production impact.
- Tip: Use MLflow versioning consistently to ensure reproducibility across experimental and production environments.
- Limitation: The skill requires pre-configured cloud credentials; it cannot provision raw infrastructure without existing IAM permissions.
- Limitation: Large-scale MLOps setups often require external database connections that must be configured in your environment variables.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-alirezarezvani-senior-ml-engineer": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: code-execution, file-read, file-write, external-api
Related Skills
intl-expansion
International market expansion strategy. Market selection, entry modes, localization, regulatory compliance, and go-to-market by region. Use when expanding to new countries, evaluating international markets, planning localization, or building regional teams.
marketing-strategy-pmm
Product marketing skill for positioning, GTM strategy, competitive intelligence, and product launches. Use when the user asks about product positioning, go-to-market planning, competitive analysis, target audience definition, ICP definition, market research, launch plans, or sales enablement. Covers April Dunford positioning, ICP definition, competitive battlecards, launch playbooks, and international market entry. Produces deliverables including positioning statements, battlecard documents, launch plans, and go-to-market strategies.
paid-ads
When the user wants help with paid advertising campaigns on Google Ads, Meta (Facebook/Instagram), LinkedIn, Twitter/X, or other ad platforms. Also use when the user mentions 'PPC,' 'paid media,' 'ad copy,' 'ad creative,' 'ROAS,' 'CPA,' 'ad campaign,' 'retargeting,' or 'audience targeting.' This skill covers campaign strategy, ad creation, audience targeting, and optimization.
qms-audit-expert
ISO 13485 internal audit expertise for medical device QMS. Covers audit planning, execution, nonconformity classification, and CAPA verification. Use for internal audit planning, audit execution, finding classification, external audit preparation, or audit program management.
code-reviewer
Code review automation for TypeScript, JavaScript, Python, Go, Swift, Kotlin. Analyzes PRs for complexity and risk, checks code quality for SOLID violations and code smells, generates review reports. Use when reviewing pull requests, analyzing code quality, identifying issues, generating review checklists.