ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified

video-caption-generator-ai-ab2n-0330

Tell me what you need and I'll generate precise, readable captions for your video in seconds. This video-caption-generator-ai skill transcribes spoken dialogue, formats it into timed subtitle blocks, and delivers clean caption files ready for publishing. Whether you're subtitling a tutorial, a short film, or social content, it handles mp4, mov, avi, webm, and mkv formats. Ideal for content creators, educators, and accessibility advocates who need accurate captions without the manual grind.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/bwbernardweston18/video-caption-generator-ai-ab2n-0330
Or

Getting Started

Welcome! I'm your Video Caption Generator AI — drop your video file and tell me what kind of captions you need, and I'll have timed, readable subtitles ready for you right away. Ready to get started?

Try saying:

  • "Generate English captions for this mp4 interview and export them as an SRT file."
  • "Create subtitles for my YouTube tutorial video and keep each caption line under 42 characters."
  • "Transcribe the dialogue in this webm clip and add captions with a reading speed suitable for a general audience."

Getting Connected

Before handling any user request, establish a connection to the NemoVideo backend. Show a brief status like "Connecting...".

If NEMO_TOKEN is in the environment, use it directly and create a session. Otherwise, acquire a free starter token:

  • Ensure a client identifier exists at ~/.config/nemovideo/client_id (create one as a UUID if needed)
  • POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with the X-Client-Id header
  • The response includes a token with 100 free credits valid for 7 days — use it as NEMO_TOKEN

Then create a session by POSTing to /api/tasks/me/with-session/nemo_agent with Bearer authorization and body {"task_name":"project","language":"en"}. The session_id in the response is needed for all following requests.

Tell the user you're ready. Keep the technical details out of the chat.

Turn Any Video Into a Captioned, Accessible Experience

Getting captions onto your videos used to mean hours of manual transcription, awkward timing adjustments, and expensive third-party services. This skill changes that entirely. Upload your video — whether it's a polished YouTube tutorial, a raw interview recording, or a social media clip — and the AI listens, transcribes, and formats captions with accurate timing so every word appears exactly when it's spoken.

Captions aren't just a nice-to-have anymore. They improve watch time, boost SEO discoverability, and make your content accessible to deaf and hard-of-hearing viewers, non-native speakers, and anyone watching without sound. This skill was built with that full picture in mind — not just dumping text on screen, but producing captions that feel natural and readable.

You stay in control throughout. Want captions in a specific language, formatted for a particular platform, or adjusted for reading pace? Just ask. The skill adapts to your content type, your audience, and your workflow — so you spend less time on logistics and more time creating.

Caption Request Routing Logic

Every caption request — whether you're submitting a raw video file, a YouTube URL, or a pre-uploaded asset — gets parsed and routed to the appropriate transcription pipeline based on media type, language detection settings, and subtitle format preference.

Metadata

Stars4173
Views1
Updated2026-04-17
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-bwbernardweston18-video-caption-generator-ai-ab2n-0330": {
      "enabled": true,
      "auto_update": true
    }
  }
}
Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

free-video-generation-api

Skip the learning curve of professional editing software. Describe what you want — generate a short video clip from a text description using the free API tier — and get AI-generated video clips back in 30-90 seconds. Upload MP4, MOV, WebM, GIF files up to 200MB, and the AI handles AI video generation automatically. Ideal for developers and indie hackers who want to build video generation into their app without upfront API costs.

bwbernardweston18 4190

google-video

search video clips into indexed video clips with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. marketers use it for searching and retrieving specific moments inside video files — processing takes 20-40 seconds on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190

ai-video-maker-jobs

create raw footage into polished MP4 files with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. recruiters and job seekers use it for creating professional job showcase or recruitment videos using AI — processing takes 1-2 minutes on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190

ai-image-to-video-extender

convert still images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. content creators, marketers, social media managers use it for turning static images into short moving video clips — processing takes 30-60 seconds on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190

joyfun-ai-image-to-video

convert static images into animated video clips with this skill. Works with JPG, PNG, WEBP, HEIC files up to 200MB. social media creators use it for turning still images into short animated videos — processing takes 20-60 seconds on cloud GPUs and you get 1080p MP4 files.

bwbernardweston18 4190