ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified

phaya

Use the Phaya SaaS backend to generate images, videos, audio, music, and run LLM chat completions via simple REST API calls. Use when the user wants to generate media, call AI models, or use the Phaya API for image/video/audio/text generation.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/boombignose/phaya
Or

Phaya Media API

Phaya is a FastAPI backend that brokers AI media generation across KIE.ai (Sora 2, Veo 3.1, Seedance, Kling, Seedream, Suno), Google Gemini TTS, and OpenRouter LLMs.

Auth

All endpoints require a Bearer token or API key:

Authorization: Bearer <your_api_key>

Set these environment variables before using this skill:

export PHAYA_API_KEY="your_api_key_here"   # required — all endpoints
export PHAYA_BASE="https://your-api-host/api/v1"  # required — your Phaya instance URL

Get your profile and credit balance:

  • GET /api/v1/user/profile — full profile
  • GET /api/v1/user/credits{ "credits_balance": 84.90, ... }

Rate limit: 60 requests/minute per API key.

Cost Warning

This skill calls a paid credit system. Every generation deducts real credits from your account. Video generation can cost 8–50 credits per job. Start with cheap endpoints (text-to-image at 1 credit) to verify connectivity before running expensive jobs. Credits are auto-refunded on failure, but not on successful jobs you don't use.

Recommendation: create a scoped API key with a small credit balance for initial testing.

Credit System

Every generation costs credits deducted on job creation; auto-refunded on failure.

CreditsService
0.5image-to-video (FFmpeg local), Sora 2 character creation
1.0text-to-image (Z-Image)
1.5Seedream 5.0
2–4Nano Banana 2 (1K/2K/4K resolution)
3.0Text-to-music (Suno)
2–35Seedance 1.5 Pro (resolution × duration × audio)
8.0Sora 2 video
1.21–1.82/secKling 2.6 motion control (720p/1080p)
15.0Veo 3.1 fast (veo3_fast)
50.0Veo 3.1 quality (veo3)

Job / Polling Pattern

Every generation is async. Create endpoints return job_id immediately; poll the status endpoint.

POST /api/v1/<service>/create   →  { "job_id": "uuid" }
GET  /api/v1/<service>/status/{job_id}  →  { "status": "...", "<media>_url": "..." }

Status values:

  • Image/music endpoints: PENDING, QUEUED, PROCESSING, COMPLETED, FAILED
  • Speech/subtitle endpoints: PENDING, PROCESSING, COMPLETED, FAILED
  • Video/download endpoints: processing, completed, failed, cancelled

Response URL field by media type:

Media typeResponse field
Imagesimage_url
Videosvideo_url
Audio / musicaudio_url (music also returns audio_urls[])
Sora 2 charactercharacter_id (a string ID, not a URL)

Poll every 3–5 seconds until the terminal status is reached.

Quick Start

1. Generate an image (text-to-image)

import httpx, time

BASE = "https://your-api-host/api/v1"
HEADERS = {"Authorization": "Bearer YOUR_API_KEY"}

r = httpx.post(f"{BASE}/text-to-image/generate", headers=HEADERS, json={
    "prompt": "A futuristic city at sunset, ultra-detailed",
    "aspect_ratio": "16:9"
})
job_id = r.json()["job_id"]

Metadata

Stars4190
Views0
Updated2026-04-18
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-boombignose-phaya": {
      "enabled": true,
      "auto_update": true
    }
  }
}
Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.