(CLO) Google has just launched a new artificial intelligence (AI) tool called "Whist," which allows users to upload photos to retrieve AI-generated composite images, even without entering any text.
Whisk uses AI to combine the subject, background, and style of an uploaded photo, thereby creating a fresh and unique image.
Whisk is described by Google as a "creative tool" that helps users quickly generate new visual ideas without requiring professional photo editing skills. According to Google, this tool is not a traditional image editor, but rather an exciting AI tool designed to spark creativity and rapid discovery .
When a user uploads an image, Whisk uses a combination of Google's AI service, Gemini, and Imagen 3 technology – a text-to-image creation tool that Google acquired from DeepMind.
Gemini will analyze the image and generate a caption, then Imagen 3 will creatively combine elements of that image, preserving the "essence" of the subject instead of an exact copy.
Whisk tool interface. Screenshot.
This means the final result may not be 100% identical to the original image. For example, the height, hairstyle, or skin tone of the subjects in the new image may differ from the original. However, users can still adjust the input information, change the background, style, or combine multiple themes to create different images.
Whisk can generate images not only from text but also from original images, expanding creative possibilities without requiring users to have photo editing experience. Thomas Iljic, product management director at Google Labs, stated: "Whisk is designed to help users creatively remix subjects, backgrounds, and styles, allowing them to explore visually rather than meticulously editing every single pixel."
Although Whisk is still in its early stages of development, the tool has been launched as a website on Google Labs and is now available to users in the US.
Dan Ives, managing director and senior analyst at Wedbush Securities, said Whisk marks another "moment of strength" for Google in the tech race.
Ives also noted that DeepMind, the AI lab that Google acquired in 2014, is a crucial asset that helps Google maintain its position in the AI field. AI products, including Whisk, are a key part of Google's product development strategy for the coming years, with many new products expected to launch in 2025.
Whisk's tools open up new avenues for using AI to create innovative products with minimal user intervention. This demonstrates the progress of AI in understanding and creatively combining visual elements.
Whisk is part of a strong trend among major tech companies, including Google and OpenAI, to develop AI tools for consumers. These tools aim to deliver innovative creative experiences, from image and text creation to video . Recently, OpenAI also introduced a text-to-video creation tool called Sora, a direct competitor to Whisk.
Ngoc Anh (according to CNN, The Verge, ZDNET)
Source: https://www.congluan.vn/google-ra-mat-cong-cu-tao-hinh-anh-ai-tu-hinh-anh-that-post326441.html






Comment (0)