(CLO) Google has just launched a new artificial intelligence (AI) tool called "Whist," which allows users to upload photos to retrieve AI-generated composite images, even without entering any text.
Whisk uses AI to combine the subject, context, and style of the uploaded photo, creating a new and more unique image.
Whisk is described by Google as a “creative tool” that helps users quickly generate new visual ideas without requiring professional photo editing skills. According to Google, the tool is not a traditional image editor, but rather a fun AI tool that aims to spark creativity and rapid discovery .
When a user uploads a photo, Whisk uses a combination of Google's AI service, Gemini, and Imagen 3 technology – a tool that creates images from text that Google acquired from DeepMind.
Gemini will analyze the photo and create a caption, then Imagen 3 will combine elements of that photo in a creative way, retaining the “essence” of the subject rather than copying it exactly.
Whisk tool interface. Screenshot.
This means the final result may not be 100% identical to the original image. For example, the height, hairstyle, or skin tone of the subjects in the new image may differ from the original. However, users can still adjust the input information, change the background, style, or combine multiple themes to create different images.
Whisk can generate images not only from text but also from original images, expanding creative possibilities without requiring users to have photo editing experience. Thomas Iljic, product management director at Google Labs, stated: "Whisk is designed to help users creatively remix subjects, backgrounds, and styles, allowing them to explore visually rather than meticulously editing every single pixel."
Although Whisk is still in its early stages of development, the tool has been launched as a website on Google Labs and is now available to users in the US.
Dan Ives, managing director and senior analyst at Wedbush Securities, said Whisk marks another "moment of strength" for Google in the tech race.
Ives also noted that DeepMind, the AI lab that Google acquired in 2014, is a crucial asset that helps Google maintain its position in the AI field. AI products, including Whisk, are a key part of Google's product development strategy for the coming years, with many new products expected to launch in 2025.
Whisk's tools open up new avenues for using AI to create innovative products with minimal user intervention. This demonstrates the progress of AI in understanding and creatively combining visual elements.
Whisk is part of a strong trend among major tech companies, including Google and OpenAI, to develop AI tools for consumers. These tools aim to deliver innovative creative experiences, from image and text creation to video . Recently, OpenAI also introduced a text-to-video creation tool called Sora, a direct competitor to Whisk.
Ngoc Anh (according to CNN, The Verge, ZDNET)
Source: https://www.congluan.vn/google-ra-mat-cong-cu-tao-hinh-anh-ai-tu-hinh-anh-that-post326441.html










Comment (0)