OpenAI rolls out Advanced Voice Mode for select ChatGPT Plus users

· 2 min read
Advanced Voice Mode on ChatGPT

OpenAI has announced the rollout of Advanced Voice Mode to a small group of ChatGPT Plus users, offering more natural, real-time conversations, allowing users to interrupt anytime, and sensing and responding to emotions.

The New Feature

Advanced Voice Mode is a significant upgrade to ChatGPT's existing voice feature, enabling more natural and interactive conversations. The new mode uses a multimodal model, GPT-4o, which can process voice inputs without the need for auxiliary models, resulting in lower latency conversations. The feature is designed to sense emotional intonations in the user's voice, including sadness, excitement, or singing.

Availability and Rollout

The Advanced Voice Mode is currently available to a small group of ChatGPT Plus users, with plans to roll out to all Plus users in the fall. Users who have been granted access to the alpha will receive an email with instructions and a message in their mobile app. OpenAI will continue to add more people on a rolling basis.

Safety and Quality

OpenAI has emphasized its commitment to safety and quality in the development of Advanced Voice Mode. The company has tested GPT-4o's voice capabilities with over 100 external red teamers across 45 languages. To protect people's privacy, the model has been trained to only speak in four preset voices, and systems have been built to block outputs that differ from those voices. Additionally, OpenAI has implemented guardrails to block requests for violent or copyrighted content.

Company Behind the Feature

OpenAI is a leading AI research organization that has been at the forefront of developing innovative AI technologies. The company has been expanding its range of consumer-oriented AI tools, including the recent announcement of a search engine utilizing its AI technology.

Comparison to Similar Offerings

Advanced Voice Mode is a significant improvement over existing voice assistants, such as Alexa or Siri, which are often criticized for their mechanical tones and limited conversational capabilities. The new feature has the potential to elevate ChatGPT from a basic AI tool to a more interactive virtual assistant, allowing users to engage in conversations that feel as natural as chatting with a friend.

User Feedback and Initial Reactions

As the feature is still in its alpha stage, user feedback and initial reactions are limited. However, the demo showcased in May impressed audiences with its quick responses and uncanny resemblance to a real human's voice. The rollout of Advanced Voice Mode is expected to generate significant interest and excitement among ChatGPT users.