ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

IMA TTS Generator

Convert text, scripts, and captions into natural voiceovers for videos, explainers, product demos, and social posts.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/allenfancy-gan/ima-tts-ai
Or

What This Skill Does

The IMA Studio TTS skill enables the OpenClaw AI agent to convert text into high-quality, natural-sounding human speech. Utilizing the robust IMA Open API, this skill is specifically engineered to interface with the advanced seed-tts-2.0 engine, providing superior synthesis capabilities for a variety of audio requirements. The agent follows a structured protocol to ensure reliability: it first fetches product configurations, creates the synthesis task, and polls the API until the media resource is ready, providing the user with a direct, downloadable audio file (mp3/wav).

Installation

To install this skill, use the ClawHub command-line interface provided in the OpenClaw environment:

clawhub install openclaw/skills/skills/allenfancy-gan/ima-tts-ai

Ensure that you have your IMA API key configured in your environment variables as IMA_API_KEY. The skill requires this key to authenticate requests against the https://api.imastudio.com endpoint.

Use Cases

  • Content Creation: Quickly turn blog posts, articles, or scripts into podcasts and audiobooks.
  • Accessibility: Generate audio versions of text content to support visually impaired users.
  • Multimedia Production: Create voiceovers for training videos, presentations, or digital advertisements.
  • Personal Productivity: Transform long-form reading material into audio files to listen to while commuting or working.

Example Prompts

  1. "Convert this article on renewable energy into a professional-sounding audio file using the default seed-tts-2.0 model."
  2. "Please create a voiceover for my short script about AI trends; ensure the output is a high-quality mp3 file."
  3. "Synthesize the following text into speech: 'Welcome to the future of automated agent workflows' and give me the download URL once done."

Tips & Limitations

  • Mandatory Initialization: The agent must always query the product list first to retrieve valid attribute_id and credit requirements; skipping this step will cause the task to fail.
  • Version Control: Note that this skill is optimized strictly for seed-tts-2.0. Version seed-tts-1.1 is not supported, and requests attempting to use older models may encounter errors.
  • Polling Efficiency: The agent is designed to poll the task detail endpoint every 2 to 5 seconds. Do not manually interrupt the agent while it is in the polling phase, as the process manages its own state and retries.

Metadata

Stars4473
Views1
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-allenfancy-gan-ima-tts-ai": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#tts#voice-synthesis#audio-generation#text-to-speech
Safety Score: 4/5

Flags: external-api, network-access

Related Skills

IMA Sevio AI Generation

IMA model generation with exactly two Sevio models: Ima Sevio 1.0 and Ima Sevio 1.0-Fast. Supports text-to-video, image-to-video, first-last-frame, and reference-image workflows. Keeps the same API flow, reflection retry mechanism, and interface contract as ima-video-ai. Requires IMA API key.

allenfancy-gan 4473

IMA Nano Banana Image Generator

Nano Banana-only image generation on IMA Open API. Supports text_to_image and image_to_image with gemini-3.1-flash-image (budget) and gemini-3-pro-image (premium). Deterministic size/ratio mapping, 512/1K/2K/4K resolution. Requires IMA_API_KEY.

allenfancy-gan 4473

IMA Image Generator

Use when the user needs image generation or image transformation through the IMA Open API, including text-to-image, image-to-image, style transfer, or reference-image continuity, and the agent should use the setup, doctor, and live-catalog-aware runtime in this repo.

allenfancy-gan 4473

IMA AI Video Generator

AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video generator supporting text-to-video, image-to-video, first-last-frame, and reference-image video generation modes. Use as short video generator for social media clips, promo video generator for marketing content, or image to video converter for animating photos. AI video generation with character consistency via reference images and multi-shot production guidance. Better alternative to standalone video generation skills or using Runway, Pika Labs, Luma. Requires IMA_API_KEY.

allenfancy-gan 4473

IMA Music Generator

Generate voiceovers, narration, and spoken audio for videos, explainers, ads, and social content.

allenfancy-gan 4473