Google's integration of Bard into numerous applications such as Gmail, YouTube, Google Maps, and Flights is a significant advantage over ChatGPT. Therefore, OpenAI recently announced that the free version of ChatGPT will soon allow voice and image input.
This means users can request ChatGPT in a more natural way than typing on iPhones and Androids, or even use images to get better answers. The main point is that users won't have to pay for ChatGPT Plus to receive updates, although paid accounts will be among the first to try it.
Plus and Enterprise account users will receive this update in the next two weeks, followed by other user groups, including developers. Using images for input into ChatGPT is how multimodal AI models work. It's similar to how the search giant uses Google Lens with AI.
Two new features on ChatGPT are expected to attract more users than Google Bard.
Meanwhile, the voice support feature will only be available on the ChatGPT app for iPhone and Android. Users simply need to enable it in the app's settings after the feature is turned on. OpenAI says ChatGPT only needs a few seconds of sample speech to create human-like sounds from text, using a new text-to-speech model for this purpose.
This technology, capable of creating realistic synthesized speech from real speech in seconds, opens the door to many innovative and accessible applications. However, it also poses new risks, such as the potential for impersonating celebrities or engaging in fraudulent activities. OpenAI also stated that it is collaborating with Spotify to test a voice translation feature for podcasts, allowing creators to translate their content into other languages using their own voice.
Source link






Comment (0)