Phaya Media API

Phaya is a FastAPI backend that brokers AI media generation across KIE.ai (Sora 2, Veo 3.1, Seedance, Kling, Seedream, Suno), Google Gemini TTS, and OpenRouter LLMs.

Auth

All endpoints require a Bearer token or API key:

Authorization: Bearer <your_api_key>

Set these environment variables before using this skill:

export PHAYA_API_KEY="your_api_key_here"   # required — all endpoints
export PHAYA_BASE="https://your-api-host/api/v1"  # required — your Phaya instance URL

Get your profile and credit balance:

GET /api/v1/user/profile — full profile
GET /api/v1/user/credits → { "credits_balance": 84.90, ... }

Rate limit: 60 requests/minute per API key.

Cost Warning

This skill calls a paid credit system. Every generation deducts real credits from your account. Video generation can cost 8–50 credits per job. Start with cheap endpoints (text-to-image at 1 credit) to verify connectivity before running expensive jobs. Credits are auto-refunded on failure, but not on successful jobs you don't use.

Recommendation: create a scoped API key with a small credit balance for initial testing.

Credit System

Every generation costs credits deducted on job creation; auto-refunded on failure.

Credits	Service
0.5	image-to-video (FFmpeg local), Sora 2 character creation
1.0	text-to-image (Z-Image)
1.5	Seedream 5.0
2–4	Nano Banana 2 (1K/2K/4K resolution)
3.0	Text-to-music (Suno)
2–35	Seedance 1.5 Pro (resolution × duration × audio)
8.0	Sora 2 video
1.21–1.82/sec	Kling 2.6 motion control (720p/1080p)
15.0	Veo 3.1 fast (`veo3_fast`)
50.0	Veo 3.1 quality (`veo3`)

Job / Polling Pattern

Every generation is async. Create endpoints return job_id immediately; poll the status endpoint.

POST /api/v1/<service>/create   →  { "job_id": "uuid" }
GET  /api/v1/<service>/status/{job_id}  →  { "status": "...", "<media>_url": "..." }

Status values:

Image/music endpoints: PENDING, QUEUED, PROCESSING, COMPLETED, FAILED
Speech/subtitle endpoints: PENDING, PROCESSING, COMPLETED, FAILED
Video/download endpoints: processing, completed, failed, cancelled

Response URL field by media type:

Media type	Response field
Images	`image_url`
Videos	`video_url`
Audio / music	`audio_url` (music also returns `audio_urls[]`)
Sora 2 character	`character_id` (a string ID, not a URL)

Poll every 3–5 seconds until the terminal status is reached.

Quick Start

1. Generate an image (text-to-image)

import httpx, time

BASE = "https://your-api-host/api/v1"
HEADERS = {"Authorization": "Bearer YOUR_API_KEY"}

r = httpx.post(f"{BASE}/text-to-image/generate", headers=HEADERS, json={
    "prompt": "A futuristic city at sunset, ultra-detailed",
    "aspect_ratio": "16:9"
})
job_id = r.json()["job_id"]

phaya

Install via CLI (Recommended)