ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

gemini-image-proxy

Generate and edit images with Gemini API using the OpenAI Python SDK.

Why use this skill?

Easily generate and edit images using Gemini 3 Pro via the OpenAI SDK. Lightweight, portable, and perfect for containerized environments and automation.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/yspcoder/gemini-image-proxy
Or

What This Skill Does

The gemini-image-proxy skill provides a streamlined interface for interacting with high-end image generation models like Gemini 3 Pro Image. By leveraging the industry-standard OpenAI Python SDK, this skill abstracts away the complex configuration usually associated with proprietary Google AI endpoints. It allows users to perform two primary functions: generating brand-new images from text prompts and modifying existing local images using natural language instructions. Because it relies on the OpenAI SDK, the integration is highly portable, lightweight, and works seamlessly in resource-constrained environments like free-tier Fly.io instances or restricted containerized systems.

Installation

To get started, ensure you have Python 3.10 or newer installed on your machine. Install the required dependency via pip: python3 -m pip install openai. Once installed, configure your environment by exporting your API credentials: export GOOGLE_PROXY_API_KEY="your_api_key" and setting your endpoint via export GOOGLE_PROXY_BASE_URL="https://example.com/v1". These steps ensure the script can securely authenticate and route requests to the model provider.

Use Cases

This skill is ideal for rapid prototyping, content creation, and automated image post-processing. Use it to generate photorealistic assets for web design, create custom icons, or iterate on existing visual content. Its ability to perform edits (such as changing the lighting, art style, or background of an image) makes it a powerful tool for developers integrating AI-driven visual workflows into their own applications without needing to manage heavy dependencies like Pillow or specialized Google-specific client libraries.

Example Prompts

  1. "Generate a high-resolution, photorealistic image of a futuristic cyberpunk city skyline at night with neon blue and purple lights."
  2. "Edit this portrait.png file to change the background from a simple office wall to a lush tropical forest while maintaining the subject's lighting."
  3. "Create a minimalist vector-style icon of a cup of coffee with a steam swirl, white background, suitable for a mobile app user interface."

Tips & Limitations

To achieve the best results, use descriptive adjectives in your prompts. While the script supports PNG, JPG, JPEG, GIF, and WEBP, larger file sizes may increase processing time. Ensure your environment variables are correctly set before execution to avoid authentication errors. Note that the model defaults to Gemini 3 Pro Image, but you can modify the source script to leverage other supported models like Imagen 4.0 or Gemini 2.5 Flash if your specific use case requires different latency or quality characteristics.

Metadata

Author@yspcoder
Stars879
Views1
Updated2026-02-11
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-yspcoder-gemini-image-proxy": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#image-generation#gemini#openai-sdk#ai-art#developer-tools
Safety Score: 4/5

Flags: file-write, file-read, external-api