csv-wizard
交互式数据清洗 CLI,支持自动类型推断、缺失值处理、重复检测
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/antonia-sz/csv-wizardWhat This Skill Does
CSV Wizard is a comprehensive, interactive command-line interface designed to streamline the data preparation workflow directly within the OpenClaw environment. Data cleaning is often the most time-consuming phase of any data science or administrative project; this skill automates the heavy lifting by performing intelligent type inference, identifying data quality issues, and applying standardized cleaning procedures. Whether you are dealing with messy spreadsheet exports, logs, or raw datasets, CSV Wizard acts as a virtual data steward. It detects data types automatically, handles missing values through statistical imputation, scrubs duplicate records, and ensures that column headers meet best practices for programmatic consumption. Its interactive mode transforms the cleaning process into a guided experience, allowing you to make granular decisions about your data without needing to write complex Python scripts or manual Excel formulas.
Installation
To add this skill to your OpenClaw agent, execute the following command in your terminal or command-line interface:
clawhub install openclaw/skills/skills/antonia-sz/csv-wizard
Once installed, you can immediately invoke the /clean-csv command to begin processing your data files located within your workspace.
Use Cases
- Data Science Preparation: Quickly sanitize training datasets by imputing missing values and removing redundant rows before passing the data to machine learning models.
- Administrative Data Consolidation: Standardize inconsistent naming conventions in CSV exports from multiple legacy systems, ensuring uniform column headers like
snake_case. - Report Preparation: Generate quick statistical summaries and previews to verify file integrity before importing data into visualization tools or dashboards.
- Data Migration: Ensure that incoming CSV files are properly formatted and encoded (UTF-8) before importing them into databases or cloud storage solutions.
Example Prompts
- "/clean-csv customer_leads.csv --info" - This prompt will provide an immediate high-level summary of your customer data, highlighting column types and identifying where missing values are concentrated.
- "/clean-csv raw_sales.csv --drop-duplicates --fill-missing mean --output processed_sales.csv" - This command cleans a sales file by stripping out redundant entries and filling in gaps in numerical data with the column mean, saving the clean result to a new file.
- "/clean-csv legacy_data.csv --interactive" - This launches the recommended interactive mode, allowing the agent to prompt you for specific cleaning decisions as it parses each column.
Tips & Limitations
- Safety First: Always use the
--outputflag to save your cleaned data to a new file. Avoid overwriting your original source files until you have verified the results. - Encoding: CSV Wizard automatically standardizes files to UTF-8. If your legacy data uses unconventional encodings, you may need to verify the output contents after processing.
- Interactive Mode: If your dataset is exceptionally large (multi-gigabyte), the interactive mode may be slower than batch processing. Use batch commands for high-volume automated pipelines.
- Imputation: Be mindful of the statistical method chosen for missing values; using 'mean' or 'median' on categorical data will result in errors, so ensure the column type is numeric when selecting these strategies.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-antonia-sz-csv-wizard": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read
Related Skills
content-automation
内容创作自动化工具 Skill。支持社交媒体内容生成、视频脚本创作、定时发布任务管理。当用户需要批量生成内容、自动化社交媒体运营或创建视频脚本时触发。
style-cloner
提供1-5篇参考文章 + 原始素材,AI 分析参考文章的风格特征, 将素材改写成同风格的成品文章,输出3个版本供选择,支持强度调节和迭代优化。
project-evaluator
描述一个项目想法,AI 从市场/技术/商业/风险四个维度系统评估, 输出评估报告、竞品速查、MVP建议,帮你决策「值不值得做」。
maybe-finance
Personal finance management skill using Maybe Finance OS. Use when users need to track expenses, analyze budgets, monitor net worth, or manage personal finances through the Maybe Finance self-hosted platform. Supports transaction tracking, account management, budget analysis, and financial reporting.
notebooklm
Google NotebookLM 非官方 Python API 的 OpenClaw Skill。支持内容生成(播客、视频、幻灯片、测验、思维导图等)、文档管理和研究自动化。当用户需要使用 NotebookLM 生成音频概述、视频、学习材料或管理知识库时触发。