wavespeed-nano-banana-pro
Generate and edit images using Google's Nano Banana Pro model via WaveSpeed AI. Supports text-to-image generation and image editing with natural language prompts. Features native 4K resolution, flexible aspect ratios, multilingual text rendering, and camera-style controls. Use when the user wants to create images from text or edit existing images.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/chengzeyi/wavespeed-nano-banana-proWaveSpeedAI Nano Banana Pro Image Generation/Editing
Generate and edit images using Google's Nano Banana Pro model via the WaveSpeed AI platform. Supports both text-to-image generation and natural-language image editing with up to 14 input images.
Authentication
export WAVESPEED_API_KEY="your-api-key"
Get your API key at wavespeed.ai/accesskey.
Quick Start
Text-to-Image
import wavespeed from 'wavespeed';
const output_url = (await wavespeed.run(
"google/nano-banana-pro/text-to-image",
{ prompt: "A serene Japanese garden with cherry blossoms, watercolor style" }
))["outputs"][0];
Image Editing
The images parameter accepts an array of image URLs. If you have local files, upload them first with wavespeed.upload() to get a URL.
import wavespeed from 'wavespeed';
// Upload a local image to get a URL
const imageUrl = await wavespeed.upload("/path/to/photo.png");
const output_url = (await wavespeed.run(
"google/nano-banana-pro/edit",
{
images: [imageUrl],
prompt: "Replace the sky with a dramatic sunset"
}
))["outputs"][0];
You can also pass existing image URLs directly:
const output_url = (await wavespeed.run(
"google/nano-banana-pro/edit",
{
images: ["https://example.com/photo.jpg"],
prompt: "Replace the sky with a dramatic sunset"
}
))["outputs"][0];
API Endpoints
Text-to-Image
Model ID: google/nano-banana-pro/text-to-image
Generate images from text prompts.
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
prompt | string | Yes | -- | Text description of the image to generate |
aspect_ratio | string | No | -- | Output aspect ratio. One of: 1:1, 3:2, 2:3, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9 |
resolution | string | No | 1k | Image resolution. One of: 1k, 2k, 4k |
output_format | string | No | png | Output format. One of: png, jpeg |
Example
import wavespeed from 'wavespeed';
const output_url = (await wavespeed.run(
"google/nano-banana-pro/text-to-image",
{
prompt: "A red vintage Porsche 911 on a winding mountain road at golden hour, photorealistic",
aspect_ratio: "16:9",
resolution: "2k",
output_format: "png"
}
))["outputs"][0];
Image Editing
Model ID: google/nano-banana-pro/edit
Edit existing images using natural language prompts. Supports up to 14 input images.
Parameters
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-chengzeyi-wavespeed-nano-banana-pro": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
wavespeed-watermark-remover
Remove watermarks, logos, captions, and text overlays from images and videos using WaveSpeed AI. Intelligently detects and removes watermarks while preserving texture and background. Supports images and videos up to 10 minutes. Use when the user wants to remove watermarks or text overlays from media.
wavespeed-face-swapper
Swap faces in images and videos using WaveSpeed AI. Supports image face swap and video face swap with multi-face targeting. Produces watermark-free results with automatic lighting and skin tone adaptation. Use when the user wants to replace a face in an image or video with another face.
wavespeed-infinitetalk
Generate talking head videos from a portrait image and audio using WaveSpeed AI's InfiniteTalk model. Produces lip-synced video up to 10 minutes long at 480p or 720p. Supports optional mask images to target specific faces and text prompts for additional guidance. Use when the user wants to animate a face with audio or create talking avatar videos.
wavespeed-minimax-speech-26
Convert text to speech using MiniMax Speech 2.6 Turbo via WaveSpeed AI. Features ultra-human voice cloning, sub-250ms latency, 40+ languages, emotion control, and 200+ voice presets. Use when the user wants to generate speech audio from text.
wavespeed-nano-banana-2
Generate and edit images using Google's Nano Banana 2 model via WaveSpeed AI. Supports text-to-image generation and image editing with natural language prompts. Features native 4K resolution, flexible aspect ratios including ultra-narrow (1:8, 8:1), multilingual text rendering, and camera-style controls. Use when the user wants to create images from text or edit existing images.