OpenAI launches artificial intelligence-powered chat platform ChatGPT Advanced Sound Mode (AVM) announced an important update on it. These new features allow users to communicate more interactively with ChatGPT using their phone cameras and screen sharing functions. Especially the ability to share information visually and on screen expands the usage area of ChatGPT. Now, users can enable artificial intelligence to understand and respond to this information by pointing their device’s camera at a specific object or sharing the content on their screen.
Newly introduced features, ChatGPT Plus and Pro subscription holders It has been made available for access. The rollout for education and corporate customers is expected to begin in January 2025. This step by OpenAI reveals how artificial intelligence-supported assistants can be made more functional in different areas of daily life. In addition, the effects that innovations such as video and screen sharing will have on business and education are already a matter of curiosity.
ChatGPT interacts more comprehensively with visual modes
OpenAI introduced these features during a livestream. The company’s Chief Product Officer Kevin Weil and his team used ChatGPT’s visual capabilities to create a pour-over coffee making process did it step by step. In this process, the artificial intelligence model was able to successfully explain the steps of preparing coffee by analyzing a coffee machine. In addition, it provided guidance to the user by understanding the message on a phone screen with the screen sharing feature.
In addition to all these features, another detail that attracts users’ attention is Introducing the Santa Claus voice option in AI-powered voice mode happened. This option, which is activated by touching a snowflake icon in the application, offers a fun experience. However, it is stated that this audio option is only suitable for users aged 13 and over. Such small touches by OpenAI are considered an important step in making artificial intelligence technology more accessible and user-friendly.
Equipping ChatGPT with functional innovations such as video and screen sharing is a feature that Google recently announced. With Gemini 2.0 model It is interpreted as a result of the competition it entered. Gemini 2.0 is capable of processing both visual and audio data and can perform multi-step tasks on behalf of the user. It is known that this model was tested for different user scenarios under the names “Project Astra”, “Project Mariner” and “Project Jules”.
In this context, the visual and screen sharing capabilities that OpenAI added to ChatGPT are considered a remarkable move in the competitive artificial intelligence market. Such features, which provide practical benefits especially in daily life, prove that artificial intelligence-supported platforms can not only provide information but also a more comprehensive user experience.
ChatGPT’s visual analysis and screen sharing functions are expected to find wide use in the business and education sectors, as well as individual users. Especially analyzing visual content can stand out as a time-saving factor in business processes. Likewise, in education, it becomes possible for students to instantly interpret visual materials or simplify complex information with the help of artificial intelligence.