ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified media Safety 4/5

nanobanana-skill

Generate or edit images using Google Gemini API via nanobanana. Triggers: "nanobanana", "generate image", "create image", "edit image", "AI drawing", "图片生成", "AI绘图", "图片编辑", "生成图片".

Why use this skill?

Use the nanobanana skill to generate custom artwork and edit images using the Google Gemini API directly from OpenClaw. Powerful AI image tools at your fingertips.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/feiskyer/nanobanana-skill
Or

What This Skill Does

The nanobanana-skill is a powerful image processing interface for OpenClaw that leverages Google's Gemini API to perform high-quality image generation and targeted image editing. By integrating directly into your workflow, it allows users to translate natural language prompts into visual media without leaving the terminal or their chat interface. Whether you are looking to create unique artistic assets, generate mockups, or perform iterative edits on existing images, this skill provides a robust command-line wrapper around sophisticated AI models.

Installation

To integrate nanobanana-skill into your OpenClaw environment, ensure you have the necessary dependencies installed by running the install command. You must have an active Google Gemini API key. Configure this key by creating an environment file at ~/.nanobanana.env or by exporting the variable in your shell session. The skill requires Python 3 and specific libraries including google-genai, Pillow, and python-dotenv. You can quickly install the required environment via the provided requirements.txt file located in the skill's plugin root directory.

Use Cases

This skill is designed for creators, developers, and productivity-focused users who need fast image manipulation. Use it for generating rapid prototypes of UI concepts, creating conceptual art for blog posts, or modifying existing photos to include specific atmospheric elements. It is particularly effective for workflows requiring automated visual asset generation, such as batch-producing icons or applying consistent artistic filters across multiple images.

Example Prompts

  1. "nanobanana, generate an image of a cybernetic cat sitting on a neon rooftop, use 16:9 aspect ratio and save it as cyberpunk_cat.png"
  2. "I need to edit this file, create an AI drawing version of input.png that adds a stormy background but keeps the foreground character"
  3. "nanobanana, please create a high-resolution 2K image of a serene minimalist desert landscape for my desktop wallpaper"

Tips & Limitations

For best results, provide descriptive, high-detail prompts. While the default 9:16 aspect ratio works for most social media content, leverage the --size options to match your target platform requirements exactly. Note that the skill relies on an external API, so internet connectivity is required. Always double-check your API key permissions and usage quotas on the Google AI Studio portal. Keep in mind that editing existing images requires precise prompting to ensure the model maintains the integrity of the original source files.

Metadata

Author@feiskyer
Stars2387
Views0
Updated2026-03-09
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-feiskyer-nanobanana-skill": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#ai-art#image-generation#gemini#visual-design#productivity
Safety Score: 4/5

Flags: file-write, file-read, external-api, code-execution

Related Skills

codex-skill

Use when user asks to leverage codex, gpt-5, or gpt-5.1 to implement something (usually implement a plan or feature designed by Claude). Provides non-interactive automation mode for hands-off task execution without approval prompts.

feiskyer 2387

autonomous-skill

Use when user wants to execute long-running tasks that require multiple sessions to complete. This skill manages task decomposition, progress tracking, and autonomous execution using Claude Code headless mode with auto-continuation. Trigger phrases: "autonomous", "long-running task", "multi-session", "自主执行", "长时任务", "autonomous skill".

feiskyer 2387

claude-skill

Use when user asks to leverage claude or claude code to do something (e.g. implement a feature design or review codes, etc). Provides non-interactive automation mode for hands-off task execution without approval prompts.

feiskyer 2387

youtube-transcribe-skill

Extract subtitles/transcripts from YouTube videos. Triggers: "youtube transcript", "extract subtitles", "video captions", "视频字幕", "字幕提取", "YouTube转文字", "提取字幕".

feiskyer 2387

deep-research

深度调研的多Agent编排工作流:把一个调研目标拆成可并行子目标,用 Claude Code 非交互模式(`claude -p`)运行子进程;联网与采集优先使用已安装的 skills,其次使用 MCP 工具;用脚本聚合子结果并分章精修,最终交付"成品报告文件路径 + 关键结论/建议摘要"。用于:系统性网页/资料调研、竞品/行业分析、批量链接/数据集分片检索、长文写作与证据整合,或用户提及"深度调研/Deep Research/Wide Research/多 Agent 并行调研/多进程调研"等场景。

feiskyer 2387