OpenAI has made a radical interface change to the ChatGPT Voice feature. Users can now receive voice responses directly within the chat screen, rather than in a separate mode.
With the update, ChatGPT Voice is now removed from a private mode. In the past, when you wanted to start a voice conversation, the screen would literally change and a different interface would open. However, with the latest edit, it is possible to initiate voice communication by touching the microphone icon in the window where the chat is ongoing. Thanks to the new layout, it has become possible to see the text transcript of the conversation along with the voice responses. This offers a functional innovation, especially for those who miss voice responses.
You can use this feature by tapping the wave icon that appears in the chat window during a conversation. Voice chat, which worked with a special animation in blue tones in the old interface, is now presented in a simpler structure. Seeing the written version of the speech simultaneously provides practicality, especially in queries involving information. Moreover, it is not just the text; Maps, photographs and graphs can also accompany the answer. Thanks to such visual components, the context of the answer can be understood more clearly. In this respect, text-supported voice chat creates a more comprehensive communication experience.
In the promotional video shared by OpenAI, the list of popular bakeries appears on the screen both in text and visual form during a call via ChatGPT Voice. Tartine When samples from the patisserie named are shown, the user can examine the locations on the map along with the visuals. Such details reveal that visually supported information transfer is more accessible, especially for mobile users. However, an option is offered for those who do not want to give up the old voice mode. You can return to the previous structure by activating “Separate Mode” under the “Voice Mode” heading in the Settings menu.
ChatGPT Voice now both speaks and shows
This new configuration is OpenAI’s multimodal artificial intelligence It is a natural expansion within its architecture. Previously, users could only enrich written responses with visual content. Now, the same integration is provided during the conversation. In addition to visual supports, providing easy access to previous messages makes the user experience more fluid. Sometimes it is necessary to look back at the content we talked about, and this is now possible. The answers are given both audibly and as text on the screen. It becomes much easier to re-read information that is important to you after the conversation.
This move by OpenAI, Google’s Gemini Live It’s part of a general trend to similarly enrich voice responses. Google aims to connect with the user by placing interactive emphasis on the live image. This version of ChatGPT is not built directly on interactive video; But a user experience close to this is provided. Answers supported by side elements such as maps, photographs or tables offer content that complements the voice. This preserves the opportunity to communicate verbally without completely separating from written expression. Thus, the user both listens and sees.
The update is available via mobile applications and web interface. If your application is up to date, you can use this feature. However, if your app is an old version, you will need to upgrade to the latest version from the store. With this transition, users’ voice interaction habits may change. Because performing this operation directly on the chat screen without going to a separate mode is a time-saving step. In the new system, there is no need to switch to different screens to both start a conversation and follow that conversation. This simplification may have practical consequences, especially for subjects with intense information flow.
With this change in the application, ChatGPT Voice has become more accessible. In addition to visual supports, one of the notable innovations was that users could easily access previous voice messages. If you prefer, it is also possible to return to the old voice interface from the Settings section. Thus, you can choose the appropriate usage method according to your own habits.