Imagine almost anything – then create it. Generate detailed, playful, realistic, or whimsical images. Or anything in-between.

The Gemini Image model uses deep language understanding to capture the nuance of your prompts — bridging the gap between what you say and what you envision.

Capabilities

An icon of a curved back arrow with an AI sparkle symbol, representing conversational inputs.An icon of a curved back arrow with an AI sparkle symbol inside a dark rounded square, representing conversational inputs.

Multimodal understanding

Upload images and share text instructions with Gemini to create complex and detailed images.

An icon of a chat bubble with lines of text and an AI sparkle symbol, representing conversational inputs.An icon of a chat bubble with lines of text and an AI sparkle symbol inside a dark rounded square, representing conversational inputs.

Conversational inputs

Use everyday language while creating images, and keep the conversation going to refine what the model generates.

An icon of a globe inside a white rounded square, representing real-world knowledge.An icon of a white globe representing real-world knowledge inside a dark rounded square, set against a dark blue gradient background.

Real-world knowledge

Generate images that follow real-world logic, thanks to Gemini’s advanced reasoning capabilities.


Model family

Gemini Image models are natively multimodal, and respond effectively and efficiently to even the most detailed prompts.

A cinematic, wide-angle shot of a vintage orange sedan parked on a dark, asphalt road at night. The scene is enveloped in a thick, teal-tinted fog that creates a moody, ethereal atmosphere. The car's headlights are turned on, casting powerful beams of warm light through the mist. To the left, the silhouettes of towering palm trees and a suburban house loom in the background, while a solitary streetlamp glows on the right, further illuminating the hazy green air.

Try Nano Banana

Gemini Image