Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.kymaapi.com/llms.txt

Use this file to discover all available pages before exploring further.

Overview

nano-banana is Google’s gemini-2.5-flash-image — a Gemini-family image generator with one killer feature most others lack: native edit-mode. Pass a reference image alongside the prompt and the model edits it in context (inpainting, style transfer, object swap) rather than starting from scratch. The preview successor nano-banana-3-flash ships the newer Gemini 3.1 image stack on the same edit-mode pattern. See nano-banana-3-flash.

Specs

FieldValue
Model IDnano-banana
CreatorGoogle
BackendVertex AI (gemini-2.5-flash-image, us-central1)
Best forIn-context image editing, style transfer, low-cost iteration
Edit modeYes — pass image_url to enable
Aspect ratiosSuggested in prompt (model interprets — not contract-enforced today)
Pricing modeFlat per image (1K tier default)
Default latency~5–10s
OutputBlob-hosted URL (Vercel CDN)

Pricing

VariantProvider costKyma list
nano-banana$0.034$0.046
Flat rate. Vertex’s publisher endpoint accepts an imageConfig.imageSize field but ignores it today (probed 2026-05-16) — every call returns ~1K output. When Vertex honors size control we’ll switch to per-quality tiering (low/medium/high). Live source: GET https://kymaapi.com/v1/pricing.

Compared to other image models on Kyma

Strengthnano-banananano-banana-3-flashimagen-4flux-kontext-pro
Edit mode (image-in)★★★★★★★★★★★★★★★
Speed★★★★★★★★★★★★★★★★★★
Cost★★★★★★★★★★★★★★★★★★
Bleeding-edge quality★★★★★★★★★ (preview)★★★★★★★★
Style transfer★★★★★★★★★★★★★

Use this when

  • You need to edit an existing image (inpaint a background, swap an object, change style).
  • You’re iterating on a reference image and want each variant to be cheap.
  • You’re prototyping conversational image workflows (the model accepts multi-turn editing).

Pick something else when

  • You need text-from-scratch without a reference image → any other image model is fine.
  • You need legible text in imagegpt-image-2.
  • You need multi-reference blending (10 sources) → flux-2-pro.
  • You need vector / SVG outputrecraft-v4-vector.

Example — pure text-to-image

curl -X POST https://kymaapi.com/v1/images/generations \
  -H "Authorization: Bearer $KYMA_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nano-banana",
    "prompt": "A tiny yellow star on a black background",
    "n": 1
  }'

Example — edit-mode

curl -X POST https://kymaapi.com/v1/images/generations \
  -H "Authorization: Bearer $KYMA_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nano-banana",
    "prompt": "Add a small black cat in the foreground, keeping the rest of the scene unchanged",
    "image_url": "https://example.com/scene.jpg",
    "n": 1
  }'
Async endpoint — poll GET /v1/jobs/{id} until succeeded.