ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified developer tools Safety 3/5

senior-data-engineer

Data engineering skill for building scalable data pipelines, ETL/ELT systems, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, and modern data stack. Includes data modeling, pipeline orchestration, data quality, and DataOps. Use when designing data architectures, building data pipelines, optimizing data workflows, implementing data governance, or troubleshooting data issues.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/alirezarezvani/senior-data-engineer
Or

What This Skill Does

The Senior Data Engineer skill is a specialized OpenClaw agent designed to act as a production-grade data architect. It handles the full lifecycle of data infrastructure, from initial requirement gathering and schema modeling to deployment and performance tuning. This skill leverages industry-standard patterns to help you build, maintain, and scale data pipelines. Whether you are dealing with massive historical datasets that require batch ETL processing, or high-velocity event streams that necessitate sub-second latency, this skill provides the architectural oversight and code-generation capabilities to execute effectively. It integrates seamlessly with modern data stack components including Apache Spark, Airflow, dbt, and cloud-native warehousing solutions like Snowflake and BigQuery.

Installation

To add this skill to your OpenClaw environment, execute the following command in your terminal:

clawhub install openclaw/skills/skills/alirezarezvani/senior-data-engineer

Ensure that you have the appropriate environment variables configured for your cloud provider (e.g., AWS, GCP, or Azure) to allow the skill to interface with your data infrastructure components.

Use Cases

  • Pipeline Orchestration: Architecting complex Airflow DAGs with dependency management, backfilling logic, and task retries.
  • Modern Data Stack Implementation: Building and managing dbt models for modular and documented SQL transformations.
  • Performance Tuning: Diagnosing slow-running Spark jobs by analyzing execution plans and partition strategies.
  • Data Quality Assurance: Implementing proactive data contracts and validation checks to prevent data drift and downstream failures.
  • Architectural Migration: Assessing the trade-offs between Lambda and Kappa architectures for real-time streaming requirements.

Example Prompts

  1. "I have a 5TB daily log file in S3. Design an ETL pipeline using Spark and Airflow to aggregate this data into a star schema in Snowflake, including error handling for late-arriving records."
  2. "Can you review my dbt project structure and suggest improvements for modularity and incremental processing of our user_activity table?"
  3. "My current Kafka-to-BigQuery streaming pipeline is experiencing latency spikes. How should I approach monitoring to identify whether the issue is at the consumer group or the load step?"

Tips & Limitations

  • Proactive Monitoring: Always define your data quality requirements before building the pipeline to ensure that freshness and schema validation are baked in from day one.
  • Environment Safety: This skill generates code that interacts with your data infrastructure. While highly effective, always review generated configurations in a development or staging environment before applying them to production systems.
  • Cost Awareness: Be mindful that high-throughput streaming architectures often incur significantly higher cloud costs than batch-based alternatives; use the built-in Decision Framework provided by this skill to evaluate your cost-to-value ratio.

Metadata

Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-alirezarezvani-senior-data-engineer": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#data-engineering#etl#pipeline#airflow#data-architecture
Safety Score: 3/5

Flags: file-read, file-write, code-execution

Related Skills

intl-expansion

International market expansion strategy. Market selection, entry modes, localization, regulatory compliance, and go-to-market by region. Use when expanding to new countries, evaluating international markets, planning localization, or building regional teams.

alirezarezvani 4473

marketing-strategy-pmm

Product marketing skill for positioning, GTM strategy, competitive intelligence, and product launches. Use when the user asks about product positioning, go-to-market planning, competitive analysis, target audience definition, ICP definition, market research, launch plans, or sales enablement. Covers April Dunford positioning, ICP definition, competitive battlecards, launch playbooks, and international market entry. Produces deliverables including positioning statements, battlecard documents, launch plans, and go-to-market strategies.

alirezarezvani 4473

paid-ads

When the user wants help with paid advertising campaigns on Google Ads, Meta (Facebook/Instagram), LinkedIn, Twitter/X, or other ad platforms. Also use when the user mentions 'PPC,' 'paid media,' 'ad copy,' 'ad creative,' 'ROAS,' 'CPA,' 'ad campaign,' 'retargeting,' or 'audience targeting.' This skill covers campaign strategy, ad creation, audience targeting, and optimization.

alirezarezvani 4473

qms-audit-expert

ISO 13485 internal audit expertise for medical device QMS. Covers audit planning, execution, nonconformity classification, and CAPA verification. Use for internal audit planning, audit execution, finding classification, external audit preparation, or audit program management.

alirezarezvani 4473

code-reviewer

Code review automation for TypeScript, JavaScript, Python, Go, Swift, Kotlin. Analyzes PRs for complexity and risk, checks code quality for SOLID violations and code smells, generates review reports. Use when reviewing pull requests, analyzing code quality, identifying issues, generating review checklists.

alirezarezvani 4473