The Google Gemini upgrade uses the “nano banana” image model developed by Google DeepMind. The feature is now available globally for both free and paid users. Its biggest strength is its ability to keep faces and objects consistent in photos, something other AI tools often struggle with.
“We’ve really pushed the quality of the images and the ability of the model to follow instructions,” said Nicole Brichtova, product lead at DeepMind. “This update makes the edits more seamless and the results are good enough to be used for any purpose.”
Keep “you” in every photo
One of the things that makes AI photos look fake is that small details get distorted. Google says Gemini solves this problem, allowing you to change the entire scene while keeping the face and expression the same. You can try a new hairstyle, change the color of the wall, or bring a pet into the scene without worrying about image distortion.

Gemini also allows you to upload multiple photos to combine into one, such as combining a portrait with your cat to create a photo of the two of you riding together on the road.
Gemini supports multi-turn editing, allowing users to add every detail to a space: from wallpaper, furniture, to paint color. The plus point is that only the part that needs to be edited changes, the rest remains the same.
Additionally, Gemini can mix styles between photos. For example, turn rain boots into floral print shoes, or create a butterfly-patterned dress.
AI Image Creation Race Between Technology Giants
Google’s upgrade comes as the AI imaging wars heat up. OpenAI previously launched GPT-4o, which can generate images directly, and went viral with a series of Studio Ghibli-style memes. CEO Sam Altman revealed that the number of users increased so much that the company’s GPUs “almost melted.”
To keep up, Meta announced a partnership with Midjourney, while German startup Black Forest Labs with its FLUX model is dominating many charts.

Google hopes Gemini can close the gap with ChatGPT. Gemini currently has 450 million monthly users, according to CEO Sundar Pichai, far behind ChatGPT, which has more than 700 million weekly users.
Brichtova said Gemini is designed for real-world scenarios, from visualizing living rooms and gardens to creating entertaining photos. The model has better “ world knowledge,” and can combine multiple photos and color palettes into a single rendering.
However, Google also imposes strict limits. All generated images have a clear watermark and an identifying mark hidden in the metadata. The company strictly prohibits the creation of sensitive images without permission to prevent deepfake abuse.
Google has previously apologized for Gemini’s inaccurate historical imagery. This time, the company believes it has struck a balance between creativity and safety. “We want users to be creative, but not everything is allowed,” Brichtova stressed.
With Gemini 2.5 Flash Image, Google is betting on elevating the AI photo editing experience, hoping to retain old users and attract new ones in a fierce technology race with OpenAI, Meta, and other competitors.
(According to TechCrunch, Tom's Guide)

Source: https://vietnamnet.vn/google-gemini-nang-tam-ai-tao-anh-doi-nen-kieu-toc-chi-bang-mot-cau-lenh-2436782.html
Comment (0)