Official Verified developer tools Safety 4/5

speechall-cli

Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list available STT models and their capabilities, (4) use speaker diarization, subtitles, or other transcription features from the terminal. Triggers on mentions of speechall, audio transcription CLI, or speech-to-text from the command line.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/atacan/speechall-cli

Download Source Code (.zip)

What This Skill Does

The speechall-cli skill integrates the powerful Speechall API directly into your terminal environment. It acts as an interface for high-accuracy speech-to-text transcription, supporting a vast array of providers like OpenAI, Deepgram, and AssemblyAI. Whether you are dealing with raw audio files or complex video formats, this tool handles the heavy lifting of extraction and transcription. It is designed for users who require command-line efficiency to handle large volumes of media, offering features like speaker diarization, custom vocabulary boosting, and flexible output formats such as SRT and VTT for subtitle creation.

Installation

To integrate this skill into your OpenClaw environment, execute the following command in your terminal: clawhub install openclaw/skills/skills/atacan/speechall-cli

For the underlying CLI tool itself, ensure you have Homebrew installed on macOS or Linux, then run: brew install Speechall/tap/speechall If you prefer not to use Homebrew, you can manually download the latest binary from the official GitHub repository and move it into your system's PATH. After installation, verify the setup by running speechall --version.

Use Cases

Professional Transcription: Automatically convert long interview recordings or meeting audio into structured text documents.
Content Creation: Generate subtitle files (SRT/VTT) from video presentations for accessibility or social media captions.
Domain-Specific Accuracy: Improve recognition of technical terminology in medical, legal, or engineering audio using the custom vocabulary feature.
Speaker Analysis: Use diarization to automatically attribute dialogue to specific speakers in multi-participant recordings.

Example Prompts

"Transcribe this interview file named recording.mp3 and save the output as an SRT file with speaker diarization enabled for two people."
"Can you list all the available STT models supported by speechall so I can see which ones support high-accuracy German transcription?"
"Transcribe my meeting video titled sync.mp4, use the Deepgram Nova-2 model, and boost the vocabulary for the term 'OpenClaw'."

Tips & Limitations

To avoid passing your API key in plain text within commands, always set the SPEECHALL_API_KEY environment variable in your shell configuration. When using diarization, performance improves significantly if you provide an estimate of the number of speakers expected in the audio file. Note that while the tool handles video files by extracting audio, very large files may require significant processing time; for long-form content, consider splitting files beforehand. Always ensure you have sufficient credits on your Speechall account for the models you intend to use.

Read Full Documentation on GitHub

Metadata

Author@atacan

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-atacan-speechall-cli": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#transcription#cli#speech-to-text#audio-processing#productivity

Safety Score: 4/5

Flags: file-read, external-api

Related Skills

Record

Skill by atacan

atacan 4473

translate-cli

End-user guide for running and configuring the `translate` CLI across text/stdin/file/glob inputs, provider selection, presets, custom prompt templates, and TOML settings. Use when users ask for command construction, config updates (`translate config`/`translate presets`), provider setup, dry-run validation, or troubleshooting translation behavior.

atacan 4473