volcengine-ata-subtitle
Generate subtitles with automatic time alignment using Volcengine ATA API. Use when the user wants to: (1) add time-aligned subtitles to videos, (2) convert text + audio to SRT/ASS format, or (3) automate subtitle creation workflow.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/blackeight4752/doubao-ata-subtitleWhat This Skill Does
The volcengine-ata-subtitle skill is a powerful automation tool designed to generate time-aligned subtitles (SRT or ASS formats) by leveraging the Volcengine Automatic Time Alignment (ATA) API. This skill simplifies the tedious process of manual timing by synchronizing plain text transcripts with audio files, allowing for rapid deployment of subtitles for videos, social media content, or archives. By automating the alignment process, it ensures high precision and professional quality, saving content creators significant editing time.
Installation
To install this skill, run the following command in your terminal: clawhub install openclaw/skills/skills/blackeight4752/doubao-ata-subtitle. Once installed, you must configure your credentials to authenticate with the Volcengine API. You can either export the environment variables VOLC_ATA_APP_ID, VOLC_ATA_TOKEN, and VOLC_ATA_API_BASE, or create a configuration file at ~/.volcengine_ata.conf following the provided template. Ensure you have FFmpeg installed on your system to handle the mandatory audio conversion to 16kHz mono PCM format, as the API is strictly sensitive to these audio parameters.
Use Cases
- Content Localization: Generate subtitles for translated transcripts to quickly localize international video content.
- Accessible Media: Create accurate time-stamped subtitles for educational videos to improve accessibility and engagement.
- Post-Production Automation: Integrate subtitle generation into a larger video editing pipeline where text is prepared in advance and needs to be perfectly aligned with spoken narration.
Example Prompts
- "Use the volcengine-ata-subtitle skill to align my text file 'script.txt' with the audio from 'interview.wav' and save the output as 'subtitles.srt'."
- "I need to convert my video audio to an ASS subtitle file. Please process 'lecture.wav' and 'content.txt' using the ATA skill and format it as ASS."
- "The current subtitles aren't synced. Use the ATA skill to re-process my audio and text files so the timing matches the dialogue exactly."
Tips & Limitations
- Pre-processing is Critical: Always ensure your audio is 16kHz mono PCM using
ffmpeg -i input.mp4 -vn -acodec pcm_s16le -ar 16000 -ac 1 audio.wav. The API will reject files that do not meet these specs. - Input Formatting: Ensure your text file is clean, UTF-8 encoded, and contains no timestamps or punctuation. Each sentence should occupy a single line for best alignment results.
- Troubleshooting: If you encounter an 'Authorization failed' error, confirm that your
VOLC_ATA_TOKENis prefixed correctly with 'Bearer; ' as required by the API schema. Always check your connection if you receive API timeout errors during peak usage times.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-blackeight4752-doubao-ata-subtitle": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, external-api, code-execution