OpenAI launches GPTo, improving ChatGPT’s text, visual and audio capabilities

SAN FRANCISCO (AP) — OpenAI’s latest update to its artificial intelligence model can mimic human cadences in its verbal responses and can even try to detect people’s moods.

The effect conjures up images of the 2013 Spike Jonze move “Her,” where the (human) main character falls in love with an artificially intelligent operating system, leading to some complications.

While few will find the new model seductive, OpenAI says it does works faster than previous versions and can reason across text, audio and video in real time.

GPT-4o, short for “omni,” will power OpenAI’s popular ChatGPT chatbot, and will be available to users, including those who use the free version, in the coming weeks, the company announced during a short live-streamed update. CEO Sam Altman, who was not present for the event, simply posted the word “her” on the social media site X.

During a demonstration with Chief Technology Officer Mira Murati and other executives, the AI bot chatted in real time, adding emotion — specifically “more drama” — to its voice as requested. It also helped walk through the steps needed to solve a simple math equation without first spitting out the answer, and assisted with a more complex software coding problem on a computer screen.

FILE — OpenAI CEO Sam Altman participates in a discussion during the Asia-Pacific Economic Cooperation CEO Summit, Nov. 16, 2023, in San Francisco. OpenAI is reinstating CEO Altman to its board of directors and said it has “full confidence” in his leadership after a law firm concluded an investigation into the turmoil that led the company to abruptly fire and rehire him in November. (AP Photo/Eric Risberg, File)

OpenAI has ‘full confidence’ in CEO Sam Altman after investigation, reinstates him to board

It’s not just Elon Musk: ChatGPT-maker OpenAI confronting a mountain of legal challenges

FILE - OpenAI CEO Sam Altman participates in the

Elon Musk sues OpenAI and CEO Sam Altman, claiming betrayal of its goal to benefit humanity

It also took a stab at extrapolating a person’s emotional state by looking at a selfie video of their face (deciding he was happy since he was smiling) and translated English and Italian to show how it could help people who speak different languages have a conversation.

Gartner analyst Chirag Dekate said the update, which lasted less than 30 minutes, gave the impression OpenAI is playing catch-up to larger rivals.

“Many of the demos and capabilities showcased by OpenAI seemed familiar because we had seen advanced versions of these demos showcased by Google in their Gemini 1.5 pro launch,” Dekate said. “While Open AI had a first-mover advantage last year with ChatGPT and GPT3, when compared to their peers, especially Google, we now are seeing capability gaps emerge.”

Google plans to hold its I/O developer conference on Tuesday and Wednesday, where it is expected to unveil updates to its own Gemini, its AI model.

TeslaVision :: Wake up it is time to dream!

TeslaVision :: Wake up it is time to dream!

TeslaVision TV

Diaspora Broadcast Network

OpenAI launches GPTo, improving ChatGPT’s text, visual and audio capabilities

Blog

OpenAI launches GPTo, improving ChatGPT’s text, visual and audio capabilities