mlops-dag-builder
Design DAG-based MLOps pipeline architectures with Airflow, Dagster, Kubeflow, or Prefect. Activates for DAG orchestration, workflow automation, pipeline design patterns, CI/CD for ML. Use for platform-agnostic MLOps infrastructure - NOT for SpecWeave increment-based ML (use ml-pipeline-orchestrator instead).
Why use this skill?
Use the mlops-dag-builder to design platform-agnostic ML pipelines with Airflow, Dagster, and Kubeflow. Automate CI/CD and orchestration today.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/anton-abyzov/sw-mlops-dag-builderWhat This Skill Does
The mlops-dag-builder skill is a specialized architectural engine designed for constructing robust, production-grade DAG-based MLOps pipelines. It supports major orchestration frameworks including Apache Airflow, Dagster, Kubeflow, and Prefect. Instead of focusing on simple script execution, this skill operates at the systems architecture level, helping you define component dependencies, data flow logic, error handling strategies, and retry policies for complex, multi-stage machine learning workflows. It empowers users to transition from local notebook experimentation to scalable, platform-agnostic infrastructure.
Installation
To install this skill, run the following command in your terminal: clawhub install openclaw/skills/skills/anton-abyzov/sw-mlops-dag-builder
Use Cases
This skill is ideal for building scalable data pipelines that include ingestion, validation, feature engineering, distributed training, and automated model deployment. It is particularly useful for platform engineers who need to design reusable CI/CD templates for ML projects, ensuring consistent environments across cloud services like AWS SageMaker, Google Vertex AI, or Azure ML. It serves as the bridge between raw code and automated production delivery.
Example Prompts
- "Build a DAG in Dagster that orchestrates a daily feature engineering job, trains a PyTorch model on the new data, and pushes the artifact to an S3 bucket only if performance metrics improve."
- "I need a Kubeflow pipeline design that includes data validation steps using Great Expectations followed by a distributed training job with fault-tolerant retry logic."
- "Compare airflow and prefect for managing high-frequency inference workflows; provide a boilerplate implementation for a blue-green deployment strategy using this skill."
Tips & Limitations
When designing your pipelines, always prioritize idempotency in your task definitions to ensure that retries do not corrupt stateful data. Ensure that you have a clearly defined data versioning strategy before implementing the training phase, as pipeline reliability is heavily dependent on data lineage. Note that this skill is strictly for DAG orchestration architecture and platform-agnostic workflows. If you are working specifically with SpecWeave increment-based ML experiments or require tight experiment tracking hooks, please utilize the 'ml-pipeline-orchestrator' instead to avoid configuration conflicts. Always keep your pipeline code decoupled from the orchestration logic to ensure easier testing and porting between different providers.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-anton-abyzov-sw-mlops-dag-builder": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: code-execution
Related Skills
network-engineer
Cloud network architect for VPC design, service mesh, zero-trust networking, load balancers, and CDN optimization. Use for network troubleshooting or connectivity issues.
jira-multi-project-mapper
Expert in mapping SpecWeave specs to multiple JIRA projects with intelligent project detection and cross-project coordination. Use when syncing to multiple JIRA projects (project-per-team, component-based), or managing bidirectional sync across team boundaries.
helm-chart-scaffolding
Design, organize, and manage Helm charts for templating and packaging Kubernetes applications with reusable configurations. Use when creating Helm charts, packaging Kubernetes applications, or implementing templated deployments.
performance-optimization
React Native performance with Hermes V1, FlashList, expo-image v2, concurrent rendering. Use for slow app, memory leaks, or FPS issues.
release-strategy-advisor
Release strategy advisor - detects brownfield patterns (tags, CI/CD, changelogs), recommends versioning strategy based on architecture. Creates release-strategy.md.