smallest-ai
Ultra-fast text-to-speech and speech-to-text via Smallest AI's Lightning v3.1 and Pulse models. Use when the user wants to generate speech, convert text to voice, read text aloud, create voice notes, transcribe audio to text, or clone a voice. Sub-100ms latency TTS. 64ms TTFT STT. Supports 30+ languages including Hindi and Spanish. Voices include sophia, robert, advika, vivaan, camilla, and 80+ more.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/abhishekmishragithub/smallest-aiSmallest AI — Ultra-Fast Voice Suite
Text-to-speech (sub-100ms) via Lightning v3.1 and speech-to-text (64ms TTFT) via Pulse.
Setup
- Get API key from https://waves.smallest.ai → click "API Key" in left panel
- Set
SMALLEST_API_KEYin your environment:
export SMALLEST_API_KEY="your_key_here"
Defaults
- Default female voice:
sophia(American English) - Default male voice:
robert(American English) - Default language:
en - Default speed:
1.0 - Default sample rate:
24000
Voice Selection Rules
Follow these rules to select the voice:
- If user explicitly names a voice (e.g. "use advika"), use that voice.
- If user asks for a male voice, use the configured
defaultVoiceMale. - If user asks for a female voice, use the configured
defaultVoiceFemale. - If no gender preference, use
defaultVoiceFemale(sophia by default). - For Hindi content: use
advika(female) orvivaan(male). - For Spanish content: use
camilla(female) orcarlos(male). - For Tamil content: use
anitha(female) orraju(male).
Always pass the configured defaultLanguage, defaultSpeed, and defaultSampleRate as --lang, --speed, and --rate flags unless the user overrides them.
Text-to-Speech
Generate speech audio from text using Lightning v3.1 model.
Shell (preferred — zero dependencies)
{baseDir}/scripts/tts.sh "Text to speak" --voice sophia --rate 24000 --speed 1.0 --lang en
Python (requires pip install smallestai or just requests)
python3 {baseDir}/scripts/tts.py "Text to speak" --voice sophia --speed 1.0 --lang en --out speech.wav
Voices
| Voice | Gender | Accent | Best For |
|---|---|---|---|
| sophia | Female | American | General use (default) |
| robert | Male | American | Professional, reports (default) |
| advika | Female | Indian | Hindi content, code-switch |
| vivaan | Male | Indian | Bilingual English/Hindi |
| camilla | Female | Mexican/Latin | Spanish content |
| zara | Female | American | Conversational |
| melody | Female | American | Storytelling, greetings |
| arjun | Male | Indian | English/Hindi bilingual |
| stella | Female | American | Expressive, warm |
80+ more voices available. List all with: {baseDir}/scripts/voices.sh
Options
--voice <id>: Voice identifier (default: sophia)--rate <hz>: Sample rate — 8000 | 16000 | 24000 | 44100 (default: 24000)--speed <n>: Playback speed 0.5–2.0 (default: 1.0)--lang <code>: Language code (default: en). See{baseDir}/references/languages.md--out <path>: Output file (default: auto-namedmedia/tts_<timestamp>.wav)
Output
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-abhishekmishragithub-smallest-ai": {
"enabled": true,
"auto_update": true
}
}
}