senior-data-engineer
Data engineering skill for building scalable data pipelines, ETL/ELT systems, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, and modern data stack. Includes data modeling, pipeline orchestration, data quality, and DataOps. Use when designing data architectures, building data pipelines, optimizing data workflows, implementing data governance, or troubleshooting data issues.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/alirezarezvani/senior-data-engineerWhat This Skill Does
The Senior Data Engineer skill is a specialized OpenClaw agent designed to act as a production-grade data architect. It handles the full lifecycle of data infrastructure, from initial requirement gathering and schema modeling to deployment and performance tuning. This skill leverages industry-standard patterns to help you build, maintain, and scale data pipelines. Whether you are dealing with massive historical datasets that require batch ETL processing, or high-velocity event streams that necessitate sub-second latency, this skill provides the architectural oversight and code-generation capabilities to execute effectively. It integrates seamlessly with modern data stack components including Apache Spark, Airflow, dbt, and cloud-native warehousing solutions like Snowflake and BigQuery.
Installation
To add this skill to your OpenClaw environment, execute the following command in your terminal:
clawhub install openclaw/skills/skills/alirezarezvani/senior-data-engineer
Ensure that you have the appropriate environment variables configured for your cloud provider (e.g., AWS, GCP, or Azure) to allow the skill to interface with your data infrastructure components.
Use Cases
- Pipeline Orchestration: Architecting complex Airflow DAGs with dependency management, backfilling logic, and task retries.
- Modern Data Stack Implementation: Building and managing dbt models for modular and documented SQL transformations.
- Performance Tuning: Diagnosing slow-running Spark jobs by analyzing execution plans and partition strategies.
- Data Quality Assurance: Implementing proactive data contracts and validation checks to prevent data drift and downstream failures.
- Architectural Migration: Assessing the trade-offs between Lambda and Kappa architectures for real-time streaming requirements.
Example Prompts
- "I have a 5TB daily log file in S3. Design an ETL pipeline using Spark and Airflow to aggregate this data into a star schema in Snowflake, including error handling for late-arriving records."
- "Can you review my dbt project structure and suggest improvements for modularity and incremental processing of our user_activity table?"
- "My current Kafka-to-BigQuery streaming pipeline is experiencing latency spikes. How should I approach monitoring to identify whether the issue is at the consumer group or the load step?"
Tips & Limitations
- Proactive Monitoring: Always define your data quality requirements before building the pipeline to ensure that freshness and schema validation are baked in from day one.
- Environment Safety: This skill generates code that interacts with your data infrastructure. While highly effective, always review generated configurations in a development or staging environment before applying them to production systems.
- Cost Awareness: Be mindful that high-throughput streaming architectures often incur significantly higher cloud costs than batch-based alternatives; use the built-in Decision Framework provided by this skill to evaluate your cost-to-value ratio.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-alirezarezvani-senior-data-engineer": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read, file-write, code-execution
Related Skills
intl-expansion
International market expansion strategy. Market selection, entry modes, localization, regulatory compliance, and go-to-market by region. Use when expanding to new countries, evaluating international markets, planning localization, or building regional teams.
marketing-strategy-pmm
Product marketing skill for positioning, GTM strategy, competitive intelligence, and product launches. Use when the user asks about product positioning, go-to-market planning, competitive analysis, target audience definition, ICP definition, market research, launch plans, or sales enablement. Covers April Dunford positioning, ICP definition, competitive battlecards, launch playbooks, and international market entry. Produces deliverables including positioning statements, battlecard documents, launch plans, and go-to-market strategies.
paid-ads
When the user wants help with paid advertising campaigns on Google Ads, Meta (Facebook/Instagram), LinkedIn, Twitter/X, or other ad platforms. Also use when the user mentions 'PPC,' 'paid media,' 'ad copy,' 'ad creative,' 'ROAS,' 'CPA,' 'ad campaign,' 'retargeting,' or 'audience targeting.' This skill covers campaign strategy, ad creation, audience targeting, and optimization.
qms-audit-expert
ISO 13485 internal audit expertise for medical device QMS. Covers audit planning, execution, nonconformity classification, and CAPA verification. Use for internal audit planning, audit execution, finding classification, external audit preparation, or audit program management.
code-reviewer
Code review automation for TypeScript, JavaScript, Python, Go, Swift, Kotlin. Analyzes PRs for complexity and risk, checks code quality for SOLID violations and code smells, generates review reports. Use when reviewing pull requests, analyzing code quality, identifying issues, generating review checklists.