Create and edit images in one workflow with Nano Banana 2 (Gemini 3.1 Flash Image) — fast outputs, stable details, and iterative, conversational refinements.
Text to image: generate new visuals from a prompt. Image editing: upload a reference image and change only what you specify. Multi-turn updates: refine step by step without restarting. Flexible sizes: social posts, banners, posters — up to 4K where supported.
Best for creators and teams who need rapid drafts for social visuals, ads, posters, product mockups, and simple infographics.


Using the provided luxury skeleton watch image, generate two side-by-side product shots of the same watch from slightly different angles. Keep the watch design, rose-gold movement, blue jewels, silver case, and black leather strap exactly the same. Use a dark reflective surface with mirror-like reflections underneath. Studio lighting with soft highlights on the metal surfaces. The final image should look like a premium watch brand advertising photo. Ultra high resolution, sharp details, professional product photography.
Nano Banana 2 is Google’s Gemini 3.1 Flash Image model (preview model ID: "gemini-3.1-flash-image-preview"). It’s designed for high-speed image generation and multi-turn editing while maintaining strong visual quality.
Instead of generating a single final image, Nano Banana 2 supports iterative edits: Replace or add objects without changing the whole scene. Adjust background, lighting, and style with minimal drift. Keep key subjects consistent across multiple variations.
In practice, it works as both a text-to-image generator and an AI image editor in the same workspace (app + API availability depends on platform/region).

Optimized for low-latency generation and fast iteration while keeping lighting, texture, and clarity stable across edits.

Uses Gemini’s real-world knowledge and (in supported modes) web search grounding to improve accuracy for specific subjects, diagrams, and infographic-style visuals.

Renders readable text inside images for posters, ads, UI mockups, and invitations. Supports translation/localization so layouts stay legible across languages.

Keeps up to five characters consistent within the same workflow and preserves multiple objects without unwanted distortion — useful for storyboards, comics, and series assets.

Follows detailed composition and style instructions closely, reducing unnecessary changes between generations and edits.

Generate images for different aspect ratios and resolutions — from quick drafts to high-resolution exports up to 4K (where supported).

Edit scenes (day→night, background swaps, angle tweaks) or transfer style from a reference. Continue refining via conversation without losing the core subject.
Generate scroll-stopping visuals quickly and iterate styles fast — from portraits to bold poster designs.
Create ad creatives and marketing mockups with clear text rendering and flexible export sizes for different placements.
Turn notes or structured data into simple diagrams and infographic-style visuals using real-world knowledge and, where available, search grounding.
Place readable text into posters, invitations, and comic panels, then translate/localize while keeping layout stability.
Keep a consistent cast and visual identity across multiple frames for narratives, comics, and campaign series.

