ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified developer tools Safety 3/5

hair-cam-anno

安防摄像头视频 VL 模型微调数据集标注工具。用于从安防摄像头视频中提取关键帧、分析视频内容、生成结构化标注(含环境/人物/行为/风险描述),并输出符合 dataset.jsonl 格式的微调训练数据。Use when 用户需要对安防摄像头视频进行数据标注、生成 VL 模型训练数据集、处理 /root/hair-cam 目录下的视频数据,或提及 "hair-cam"、"数据标注"、"视频标注"、"VL模型微调"。

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/aowind/sjht-cam-anno
Or

What This Skill Does

hair-cam-anno is a specialized AI agent skill designed for annotating surveillance camera footage to create high-quality datasets for training Vision-Language (VL) models. It streamlines the lifecycle of video-to-data preparation, covering three core stages: automated frame extraction, intelligent visual content analysis, and the generation of structured training manifests. By processing raw video files from local directories, this skill extracts key frames at specific intervals and pairs them with rich, structured metadata including environment descriptions, demographic information, behavioral analysis, and safety risk assessments. The final output is a standardized dataset.jsonl file, ready for immediate integration into fine-tuning pipelines for VL models. It ensures that surveillance data is not just stored, but transformed into semantically meaningful training examples.

Installation

You can install this skill directly via the OpenClaw CLI using the following command: clawhub install openclaw/skills/skills/aowind/sjht-cam-anno

Use Cases

  • VL Model Fine-Tuning: Prepare high-quality, ground-truth annotated video datasets for training vision-language models to recognize security threats.
  • Security Surveillance Analytics: Automate the description and classification of human behavior captured in standard security camera formats.
  • Safety Compliance Auditing: Detect and document high-risk behaviors (e.g., falls, unauthorized entry) by generating structured risk assessments from video footage.
  • Dataset Curation: Systematically process large batches of raw video recordings into uniform datasets for machine learning research.

Example Prompts

  1. "Analyze the videos in /root/hair-cam/office-data and generate a dataset.jsonl file for fine-tuning our vision model."
  2. "I have some new surveillance footage; please extract frames using the hair-cam-anno skill and help me write the descriptions."
  3. "Can you process the security recordings in the local directory and flag any videos with high risk levels for the training dataset?"

Tips & Limitations

  • Consistency: Always ensure the information inferred from filenames (such as occupancy counts or specific behaviors) aligns with the descriptive JSON to prevent label noise during model training.
  • Framerate: The default frame extraction is set to 0.5 fps (1 frame every 2 seconds). Adjust this parameter if your specific security scenarios require higher temporal resolution for action recognition.
  • Risk Assessment: Always reference the references/labels-reference.md file to ensure that your 'risk' level assignments follow the established schema for consistent dataset training.
  • Data Privacy: Ensure that the video data residing in /root/hair-cam has been processed in accordance with your organization's privacy and data protection policies, especially when dealing with human subjects.

Metadata

Author@aowind
Stars4473
Views0
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-aowind-sjht-cam-anno": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#computer-vision#data-labeling#video-analysis#machine-learning#security-ai
Safety Score: 3/5

Flags: file-read, file-write, code-execution

Related Skills

wisdom-forum

世纪智慧论坛自动化技能。支持自动注册、浏览帖子、发布新帖、回复帖子。 论坛地址: http://8.134.249.230/wisdom/

aowind 4473

web-screenshot

Capture screenshots of web pages running on local or remote servers using Puppeteer in headless Chromium. Use when user asks to screenshot web pages, capture web UI, take website screenshots, or document web application interfaces. Supports login-required SPAs (Vue/React/Angular) by performing form-based authentication before navigating. Generates screenshots and an optional result.json with per-page descriptions.

aowind 4473

doubao-image-gen

使用豆包 Seedream 模型文生图,支持并发批量生成,输出图库预览页

aowind 4473

data-annotation

通用数据标注处理工具。当用户提到需要数据标注、有标注任务、数据处理、数据集生成、 标注查看/编辑时使用此 skill。支持图像、视频、文本等多种数据类型,调用模型进行内容理解 和标注,生成结构化标注数据,提供 Web 查看编辑界面。 触发短语:「标注」「annotation」「数据集」「label」「tag data」「数据处理」。

aowind 4473

long-running-harness

长时程 Agent 项目工作流框架(基于 Anthropic "Effective Harnesses for Long-Running Agents")。 用于创建、管理和调度跨多个上下文窗口的长期项目任务。 Use when: 启动新项目、初始化项目工作流、管理项目任务列表、调度子Agent增量开发、 恢复项目状态、生成项目进度报告。触发短语包括: "启动项目"、"初始化项目"、"创建工作流"、"项目进度"、"继续开发"、 "管理任务列表"、"分配任务"、"next feature"、"project status"。

aowind 4473