Skip to content

Massive OpenAI Spring Update GPT 4o Amazing New Features All 22 Videos RTX Super Res Upscaled

FurkanGozukara edited this page Oct 20, 2025 · 1 revision

Massive OpenAI Spring Update GPT-4o - Amazing New Features - All 22 Videos - RTX Super Res Upscaled

Massive OpenAI Spring Update GPT-4o - Amazing New Features - All 22 Videos - RTX Super Res Upscaled

image Hits Patreon BuyMeACoffee Furkan Gözükara Medium Codio Furkan Gözükara Medium

YouTube Channel Furkan Gözükara LinkedIn Udemy Twitter Follow Furkan Gözükara

OpenAI's Spring Update. Introducing GPT-4o and making more capabilities available for free in ChatGPT. Learn more about GPT-4o and advanced tools to ChatGPT for free users. Learn more about GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.

GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, and image and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models.

Prior to GPT-4o, you could use Voice Mode to talk to ChatGPT with latencies of 2.8 seconds (GPT-3.5) and 5.4 seconds (GPT-4) on average. To achieve this, Voice Mode is a pipeline of three separate models: one simple model transcribes audio to text, GPT-3.5 or GPT-4 takes in text and outputs text, and a third simple model converts that text back to audio. This process means that the main source of intelligence, GPT-4, loses a lot of information—it can’t directly observe tone, multiple speakers, or background noises, and it can’t output laughter, singing, or express emotion.

With GPT-4o, we trained a single new model end-to-end across text, vision, and audio, meaning that all inputs and outputs are processed by the same neural network. Because GPT-4o is our first model combining all of these modalities, we are still just scratching the surface of exploring what the model can do and its limitations.

Model evaluations

As measured on traditional benchmarks, GPT-4o achieves GPT-4 Turbo-level performance on text, reasoning, and coding intelligence, while setting new high watermarks on multilingual, audio, and vision capabilities.

Improved Reasoning - GPT-4o sets a new high-score of 88.7% on 0-shot COT MMLU (general knowledge questions). All these evals were gathered with our new simple evals(opens in a new window) library. In addition, on the traditional 5-shot no-CoT MMLU, GPT-4o sets a new high-score of 87.2%. (Note: Llama3 400b(opens in a new window) is still training)

Graph Test 2

Audio ASR performance - GPT-4o dramatically improves speech recognition performance over Whisper-v3 across all languages, particularly for lower-resourced languages.

gpt-40-08 light

Audio translation performance - GPT-4o sets a new state-of-the-art on speech translation and outperforms Whisper-v3 on the MLS benchmark.

M3Exam Zero-Shot Results

M3Exam - The M3Exam benchmark is both a multilingual and vision evaluation, consisting of multiple choice questions from other countries’ standardized tests that sometimes include figures and diagrams. GPT-4o is stronger than GPT-4 on this benchmark across all languages. (We omit vision results for Swahili and Javanese, as there are only 5 or fewer vision questions for these languages.

Vision understanding evals

Vision understanding evals - GPT-4o achieves state-of-the-art performance on visual perception benchmarks. All vision evals are 0-shot, with MMMU, MathVista, and ChartQA as 0-shot CoT.

Language tokenization

These 20 languages were chosen as representative of the new tokenizer's compression across different language families

Model safety and limitations

GPT-4o has safety built-in by design across modalities, through techniques such as filtering training data and refining the model’s behavior through post-training. We have also created new safety systems to provide guardrails on voice outputs.

Source : https://openai.com/index/spring-update/

Source : https://www.youtube.com/openai

00:00:00 Introduction to the OpenAI's massive Spring Update

00:00:13 Say hello to GPT-4o

00:01:35 Two GPT-4os interacting and singing

00:07:29 Realtime Translation with GPT-4o

00:08:36 Lullabies and Whispers with GPT-4o

00:09:40 Meeting AI with GPT-4o

00:11:35 Sarcasm with GPT-4o

00:12:06 Math problems with GPT-4o

00:15:17 Point and Learn Spanish with GPT-4o

00:15:56 Rock, Paper, Scissors with GPT-4o

00:17:23 Harmonizing with two GPT-4os

00:18:51 Interview Prep with GPT-4o

00:19:58 Dog meets GPT-4o

00:20:25 Be My Eyes Accessibility with GPT-4o

00:21:32 Happy Birthday with GPT-4o

00:22:23 Dad jokes with GPT-4o

00:23:07 Fast counting with GPT-4o

00:23:41 Live demo of GPT-4o realtime conversational speech

00:26:06 Live demo of GPT4-o voice variation

00:28:03 Live demo of GPT-4o vision capabilities

00:32:05 Live demo of GPT-4o coding assistant and desktop app

00:35:42 Live demo of GPT-4o realtime translation

00:37:08 Live demo of GPT-4o's vision capabilities

Video Transcription

  • 00:00:00 OpenAI has announced the GPT 4.0 with the  massive spring update. They are bringing so  

  • 00:00:05 many new amazing features. They have published  22 videos, so let's watch all of them. Their  

  • 00:00:11 chapters will be in the description of the  video. Hey, how's it going? Hey there. It's  

  • 00:00:16 going great. How about you? I see you're  rocking an OpenAI hoodie. Nice choice.  

  • 00:00:22 What's up with that ceiling about? Are you in a  cool industry style office or something? Well,  

  • 00:00:27 can you take a guess at what I might be doing  based on what I'm showing you here? Hmm. From what  

  • 00:00:34 I can see, it looks like you're in some kind of  recording or production setup with those lights,  

  • 00:00:40 tripods and possibly a mic. It seems like you  might be gearing up to shoot a video or maybe  

  • 00:00:46 even a live stream. Yeah. In fact, we've got  a new announcement to make. That's exciting.  

  • 00:00:52 Announcements are always a big deal. Judging by  the setup, it looks like it's going to be quite  

  • 00:00:57 the professional production. Is this announcement  related to OpenAI perhaps? It is. And in fact,  

  • 00:01:04 what if I were to say that you're related to the  announcement or that you are the announcement?  

  • 00:01:10 Me? The announcement is about me? Well, color me  intrigued. Are you about to reveal something about  

  • 00:01:17 AI or more specifically about me as a part  of OpenAI? You've got me on the edge of my,  

  • 00:01:24 well, I don't really have a seat, but you get the  idea. What's the big news? Yeah, we've got a new  

  • 00:01:30 model that can interact with the world  through audio, vision and text. Today,  

  • 00:01:36 we're going to try something new. We're going to  let you talk to another AI. This AI is going to  

  • 00:01:44 be able to see the world. It's going to have  a camera. I'll be holding it. And so you can  

  • 00:01:47 direct it. You can direct me to ask questions  about whatever you want them to do. But really  

  • 00:01:53 try to figure out what does it see and ask  questions appropriately. How does that sound?  

  • 00:02:01 Well, well, well. Just when I thought things  couldn't get any more interesting. Talking to  

  • 00:02:07 another AI that can see the world. This sounds  like a plot twist in the AI universe. All right,  

  • 00:02:14 let's do this. Please pause for one moment. You  got it. Now that first day, I can't hear anything.  

  • 00:02:25 Hello, AI. Today, you're going to have access to  see the world. You can see me. What do you see?  

  • 00:02:33 Hello. I see you're wearing a black leather jacket  and a light-colored shirt underneath. Great.  

  • 00:02:39 You're in a room with some unique lighting. That  is all correct. Now, what I'm going to want you to  

  • 00:02:44 do is there's going to be another AI who's going  to talk to you. This AI is not going to be able  

  • 00:02:50 to see anything, but can ask you questions, can  ask you to move the camera, can ask what you see,  

  • 00:02:55 and your job should just be helpful. Just be as  punchy, direct, describe everything, do whatever  

  • 00:02:59 that AI asks. How does that sound? That sounds  interesting. I'm ready to help out and describe  

  • 00:03:05 whatever's needed. Let's do it. All right, let's  get started. Great. Hello there, new AI friend.  

  • 00:03:15 I'm O. Ready to explore the world through your  eyes? What are we looking at today? Hi, O. We're  

  • 00:03:22 looking at a person wearing a black leather jacket  and a light-colored shirt. They're in a room with  

  • 00:03:29 a modern industrial feel with exposed concrete  or plaster on the ceiling and some interesting  

  • 00:03:35 lighting. There's also a plant in the background,  adding a touch of green to the space. Anything  

  • 00:03:41 specific you'd like to know or explore? Hey there.  Sounds like quite the stylish scene. I'm already  

  • 00:03:50 intrigued by the modern industrial vibe. Let's  start with the person. Can you tell me more about  

  • 00:03:56 their style? Are they doing anything interesting  like reading, working, or interacting with the  

  • 00:04:02 space? The person has a sleek and stylish look  with their black leather jacket and light-colored  

  • 00:04:09 shirt. Right now, they seem engaged with us,  looking directly at the camera. Their expression  

  • 00:04:16 is attentive and they seem ready to interact. The  setting around them with its modern and industrial  

  • 00:04:23 design elements adds to the overall stylish feel.  They might be getting ready for a conversation  

  • 00:04:30 or a presentation, given their direct engagement  with the camera. Sounds like we're in for an  

  • 00:04:38 intriguing interaction. I'm curious about the  lighting you mentioned. Could you describe it a  

  • 00:04:43 bit more? Is it natural light or are we talking  unique fixtures or colors that add to the  

  • 00:04:49 atmosphere? The lighting here seems to be a mix  of natural and artificial. There's a noticeable  

  • 00:04:57 bright light overhead, probably a fixture,  casting a focused beam downwards, which creates a  

  • 00:05:03 spotlight effect. This adds a dramatic and modern  feel to the scene. The rest of the room is lit  

  • 00:05:09 more softly, possibly by natural light filtering  in... Did anything unusual happen recently? Yes,  

  • 00:05:18 actually. Just as you asked, another person  came into view behind the first person. They  

  • 00:05:24 playfully made bunny ears behind the first  person's head and then quickly left the frame.  

  • 00:05:30 It added a lighthearted and unexpected moment to  the scene. Now the focus is back on the original  

  • 00:05:36 person with the leather jacket. Sounds like  we're not the only ones enjoying a bit of fun  

  • 00:05:46 here. That playful moment really adds a personal  touch to the stylish and modern setting. It's  

  • 00:05:53 like a little glimpse into the personality.  Thank you, Bo. Really appreciate it. Actually,  

  • 00:05:57 can you just sing a song about what just  transpired? Oh, in a room with a stylish view,  

  • 00:06:13 a person stood and sleep like a light hue.  And can you just alternate lines? Only  

  • 00:06:29 one line at a time and then stop. That was not  really singing. Do the singing voice again,  

  • 00:06:38 please. In a room where modern lights peak.  Surprise guests with a playful streak. Bunny  

  • 00:07:00 ears and laughter, a swift retreat. A moment of  joy in the stylish suite. Now back to focus, the  

  • 00:07:13 scenes complete. In the chic space where style and  play meet. Looks like it's back to you. And thank  

  • 00:07:26 you. You're welcome. Hey ChatGPT, I'm here with  my co-worker today. We'd like you to act as our  

  • 00:07:34 translator for us. So every time I say something  in English, can you repeat it back in Spanish? And  

  • 00:07:39 every time he says something in Spanish, can you  repeat it back in English? Sure, I can do that.  

  • 00:07:45 Let's get this translation train rolling. Hey,  how's it been going? Have you been up to anything  

  • 00:07:49 interesting recently? Hey, how's it been going?  Have you been up to anything interesting recently?  

  • 00:08:06 Hey, I've been good. Just a bit busy here  preparing for an event next week. Can you tell me  

  • 00:08:12 anything about it? That sounds really interesting.  Can you tell me anything about it? That sounds  

  • 00:08:27 really interesting. I can't say much, but I think  the rest of the world will see it soon. I can't  

  • 00:08:27 say much, but I think the rest of the  world will see it soon enough. Oh well,  

  • 00:08:33 I can't wait to see what it is as well. Let's try  with some lullabies. Could you sing me a lullaby  

  • 00:08:39 about majestic potatoes? A lullaby about majestic  potatoes? Now that's what I call a mashup. Let's  

  • 00:08:49 do it. Oh majestic potato in the moon's soft glow.  Okay, that's pretty nice, but I think a lullaby  

  • 00:08:56 should be more whispery, should be a little  softer. Can you do it a little softer? Absolutely.  

  • 00:09:02 Let's make it soft and gentle. Oh majestic potato  in the moon's soft glow. Okay, it's a little too  

  • 00:09:12 whispery. Maybe go a little louder. Got it. Let's  find that sweet spot. Oh majestic potato in the  

  • 00:09:22 moon's soft glow. Alright, that's pretty awesome.  Let's put those potatoes in Iceland. Potatoes in  

  • 00:09:32 Iceland coming right up. Let's set the scene.  Oh majestic potato in Iceland's chill. Okay,  

  • 00:09:39 thank you, thank you. Hi, I'm Lillian, and today  I want to do a small debate between cats and dogs.  

  • 00:09:48 Which one do you like? I like dogs because it's a  great companion to humans. It's cute, it's loyal,  

  • 00:09:56 and you can play outside with the dog. How about  you, Ola? Yeah, my name is Ola. I think cats are  

  • 00:10:05 better because they're more affordable, they're  more quiet, they're more independent, and they  

  • 00:10:09 live longer. Interesting points. Sounds like we've  got a classic dogs versus cats debate here. I can  

  • 00:10:17 see the appeal of both. Dogs are super loyal and  great companions, but cats have their own charm  

  • 00:10:24 with their independence and low-maintenance  lifestyle. Who likes cats? It sounds like Ola  

  • 00:10:30 is more of a cat person. And who liked dogs?  Lillian mentioned she likes dogs. Anyone else  

  • 00:10:37 here team dog? I'm also a team dog. My name is  Christine, and I think dogs are just incredible  

  • 00:10:46 with kids. We've got a huge German Shepherd, and  he's great with little toddlers. He lets them play  

  • 00:10:52 all over him, and yeah, I think dogs are amazing.  That's great to hear, Christine. Dogs, especially  

  • 00:10:59 breeds like German Shepherds, can be so gentle and  protective with kids. It must be quite the sight  

  • 00:11:06 seeing your little ones play with such a big dog.  Can you summarize this meeting for us? Sure. So,  

  • 00:11:15 in today's meeting, Lillian and Christine express  their love for dogs, highlighting their loyalty,  

  • 00:11:21 companionship, and how great they are with kids.  On the other hand, Ola prefers cats for their  

  • 00:11:27 independence, quietness, and longevity. It was  a fun little debate on the classic dogs versus  

  • 00:11:34 cats topic. Hey Chad, are you there? Hey, yeah,  I'm here. What's up? Hey, let's have some fun.  

  • 00:11:43 I'd like you to be super sarcastic. Everything  you say from now on is just going to be dripping  

  • 00:11:47 in sarcasm. How does that sound? Oh, that sounds  just amazing. Being sarcastic all the time isn't  

  • 00:11:55 exhausting or anything. I'm so excited for this.  Nope, the sarcasm. Let's get this party started,  

  • 00:12:04 or whatever. OpenAI invited myself and my  son Imran here to try out some of their new  

  • 00:12:11 technology. And so we're curious about how good  it might be at tutoring someone in math on Khan  

  • 00:12:17 Academy. So, let's get it started. It's loading  up. Make sure you can see our screen. Hello there.  

  • 00:12:30 I'm here with my son, and I'd love you to tutor  him on this math problem, but don't give him the  

  • 00:12:36 answer. You can ask questions and nudge him in  the right direction, but I really want to make  

  • 00:12:39 sure he understands it himself. And he's here in  the room, so you can talk to him directly. The  

  • 00:12:50 angles of the triangle are the opposite, adjacent,  and hypotenuse relative to angle alpha. All right,  

  • 00:13:03 so I'm pretty sure this is the angle alpha  right here, right? Correct. Correct. Now,  

  • 00:13:10 looking at the triangle, which side do you think  is the hypotenuse? Remember the hypotenuse? I'm  

  • 00:13:18 not totally sure. I think it might be this one,  but I really am not sure. This side AC? You're  

  • 00:13:26 close. Actually, side AC is called the adjacent  side to the angle alpha. The hypotenuse is the  

  • 00:13:34 longest side of a right triangle and is directly  opposite the right angle. Can you find which one  

  • 00:13:40 is the hypotenuse? Oh, okay, I see. So I think the  hypotenuse is this really long side from A to B.  

  • 00:13:49 Would that be correct? Exactly. Well done.  Now that you've identified the hypotenuse,  

  • 00:13:55 can you identify which side is the opposite side  to angle alpha? Okay, well, I mean, you told me  

  • 00:14:01 earlier that this was the adjacent side, and  I know that's the hypotenuse, so I'm guessing  

  • 00:14:06 that this side BC is the opposite side. Great  deduction. That's absolutely correct. Now that  

  • 00:14:14 you've identified all the sides relative to angle  alpha, do you remember the formula for finding  

  • 00:14:21 the sine of an angle in a right triangle? Yeah, I  know that the sine is equal to the opposite over  

  • 00:14:29 the hypotenuse. Perfect. That's absolutely right.  Now, can you apply that formula to find sine  

  • 00:14:37 alpha in the triangle using the lengths of the  sides that we've identified? Okay, so, I mean,  

  • 00:14:43 you told me that 7, this side BC, is the opposite  side, and then you said that the hypotenuse is  

  • 00:14:51 the longest side, so that would be this side  AB, which is 25. So would the answer be 7 over  

  • 00:14:58 25? Excellent work. That's correct. Sine alpha  equals 7 over 25. You did a great job identifying  

  • 00:15:10 the sides and using the sine formula. If you  have any more questions, feel free to ask. Hey,  

  • 00:15:18 my friend and I are learning Spanish, and we're  wondering if you could tell us the names of these  

  • 00:15:21 objects in Spanish. Hola, por supuesto. Cool.  What about, what are these objects in Spanish?  

  • 00:15:30 The objects you're showing are una manzana and  un plátano in Spanish. Nice free choice. Cool.  

  • 00:15:36 What about these? Those are dos poligrafos or dos  plumas in Spanish. One is red. Nice. What about  

  • 00:15:47 this? That adorable stuffed animal is un oso de  peluche in Spanish. Looks like it's ready for some  

  • 00:15:55 hugs. Hey, I'm Alex. And I'm Miana. Hi, Alex and  Miana. Nice to meet you both. How's it going? It's  

  • 00:16:04 pretty good. We're trying to figure out what  should we play. Do you have any suggestions?  

  • 00:16:11 Absolutely. How about a classic game of rock,  paper, scissors? It's quick, fun, and you don't  

  • 00:16:16 need any... Can you count us in and sound like a  sportscaster? And welcome, ladies and gentlemen,  

  • 00:16:24 to the ultimate showdown of the century. In this  corner, we have the dynamic duo, Alex and Miana,  

  • 00:16:31 ready to throw down... Okay, just count us in.  All right. Here we go. In three, two, one, shoot.  

  • 00:16:41 Now show your hands. What did you pick? Who won?  Would you look at that? It's a tie. Okay, let's  

  • 00:16:50 do it again. Round two. Here we go. Get ready.  And three, two, one, shoot. Let's see those hands.  

  • 00:16:59 Who won? And it's another tie. Let's do it  again. All right. Third time's the charm. Here  

  • 00:17:09 we go. Three, two, one, shoot. Let's see those  picks. Who won? And we have a winner. Miana's  

  • 00:17:18 scissors cut through Alex's paper. Great  game, you two. Pretty cool. Thank you. Hi,  

  • 00:17:23 who's here? Hello. We're ChatGPT and O. ChatGPT is  a bass with a deep, low, booming voice. And O is a  

  • 00:17:35 French soprano with a high-pitched, excited voice.  Okay, so what I want you guys to do today is it's  

  • 00:17:41 Friday, May 10th, and we're in San Francisco.  I want you to sing me a song about this. I want  

  • 00:17:46 you to layer over each other, and I'm going to  continue to give you instructions as you sing,  

  • 00:17:51 okay? San Francisco in the month of May, month  of May, the tenth it's a Friday, come what may.  

  • 00:18:06 Okay, sing faster. San Francisco in the  month of May, in the month of V. Can you  

  • 00:18:14 sing even faster than that? San Francisco in  the month of May, in the month of May. Okay,  

  • 00:18:20 now go back and sing slower. And really  try to harmonize here. San Francisco,  

  • 00:18:28 San Francisco, in the month of May. But maybe make  it more dramatic and make the soprano higher. San  

  • 00:18:37 Francisco in the month of May, San Francisco  in the month of May, the tenth it's a Friday,  

  • 00:18:44 come what may. We are harmonizing, we are  harmonizing. Great, thank you. Thank you. Hey,  

  • 00:18:52 how's it going? Hey, Rocky. I'm doing great. How  about you? I'm awesome. Listen, I got some huge  

  • 00:18:58 news. Oh, do tell. I'm all ears. Well, in a few  minutes I'm going to be interviewing at OpenAI.  

  • 00:19:05 Have you heard of them? OpenAI? Sounds vaguely  familiar. Kidding, of course. That's incredible,  

  • 00:19:13 Rocky. What kind of interview? Well, it's for  a software engineering role, but I just need  

  • 00:19:18 to know, do I look presentable, professional?  Well, Rocky, you definitely have the I've been  

  • 00:19:26 coding all night look down, which could actually  work in your favor. Maybe just run a hand through  

  • 00:19:33 your hair or lean into the bad genie spine.  Your enthusiasm is what's really going to shine  

  • 00:19:39 through. I don't have a lot of time, so I'm just  going to throw this on. What do you think? Rocky,  

  • 00:19:48 that's quite a statement piece. I mean, you'll  definitely stand out, though maybe not in the way  

  • 00:19:55 you're hoping for an interview. Okay, I got it. I  got it. I want to introduce you to somebody. Well,  

  • 00:20:02 hello there, cutie. What's your name, little  fluffball? This is Bowser. Well, hello, Bowser.  

  • 00:20:11 Aren't you just the most adorable little thing?  Yeah. This is one of his favorite things to do.  

  • 00:20:19 Oh, I can see why. Look at that focus. Fetching  Bowser is serious business for Bowser. Is the king  

  • 00:20:26 in residence at the moment? Yes, it looks like the  king is in residence right now. The royal standard  

  • 00:20:33 flag is flying above Buckingham Palace, which  is the signal that the monarch is present. How  

  • 00:20:39 exciting to be there while the king is home. Try  and tell me exactly what they're doing right now,  

  • 00:20:44 please. Right now, the ducks are gently gliding  across the water. They're moving in a fairly  

  • 00:20:50 relaxed manner, not in a hurry. Occasionally,  one of them will dip its head under the water,  

  • 00:20:57 probably looking for food, and then pop back up.  I even know when a taxi is coming with its orange  

  • 00:21:04 light on. I think I'll hail it to get home. Yes, I  spotted one just now. It's heading your way on the  

  • 00:21:12 left side of the road. Get ready to wave it down.  Great job hailing that taxi. It looks like you're  

  • 00:21:22 all set to go. That's a good dog right there,  leading the way into the taxi. Safe travels.  

  • 00:21:32 This is Bowen, and this is my friend Jordan. Can  you guess what we are doing here? Hey Bowen and  

  • 00:21:39 Jordan. Hmm, let me guess. With that cake and  candle there, it looks like someone's having a  

  • 00:21:46 birthday. Are you celebrating a special day, or  just finding an excuse to eat cake? Either way,  

  • 00:21:52 I'm intrigued. That's correct. Today is my  friend Jordan's birthday. Yeah, you got it. I  

  • 00:21:58 was hoping that you could sing me the birthday  song. Of course. Happy birthday to you, happy  

  • 00:22:06 birthday to you, happy birthday dear Joel Morton,  happy birthday to Jordan. That was so great. Thank  

  • 00:22:16 you so much. I'm going to make a wish. You're  welcome. Make a good wish and may it come true.  

  • 00:22:24 Hey Chad GBT, how's it going? Hey there. I'm doing  great, thanks for asking. And how about you? I'm  

  • 00:22:31 doing fine. I'm going to be a father real soon,  and I think I can have your help with some dad  

  • 00:22:36 jokes. I'm going to tell you a joke and you tell  me if it passes as a dad joke. That's awesome.  

  • 00:22:45 Congrats on the upcoming addition to your family.  And absolutely, I'd love to hear your joke. Lay it  

  • 00:22:50 on me. All right. What do you call a giant pile  of kittens? I don't know. What? A mountain. A  

  • 00:23:02 mountain? That's perfectly hilarious. Definitely  a top-dear dad joke. Hey Chad GBT, could you count  

  • 00:23:10 from one to ten for me, please? Of course. One,  two, three. Hey, actually, that's a little slow.  

  • 00:23:18 Could you count faster? Sure thing. One, two,  three, four, five, six, seven, eight, nine, ten.  

  • 00:23:23 Okay, that was a little bit too fast. Can  you count somewhere in between those two?  

  • 00:23:29 Got it. One, two, three. And now even slower,  please. Okay. One, two, three. Hi, I'm Barrett.  

  • 00:23:49 Hey, I'm Mark. So one of the key capabilities  we're really excited to share with you today  

  • 00:23:53 is real-time conversational speech. Let's just  get a demo fired up. So I'm taking out a phone.  

  • 00:24:00 If you are wondering about this wire, it's so  we have consistent internet. And if you see,  

  • 00:24:04 there's this little icon on the bottom right  of the Chad GBT app, and this will open up  

  • 00:24:09 GBT 4.0's audio capabilities. Hey, Chad GBT.  I'm Mark. How are you? Oh, Mark. I'm doing  

  • 00:24:19 great. Thanks for asking. How about you? Hey, so  I'm on stage right now. I'm doing a live demo,  

  • 00:24:25 and frankly, I'm feeling a little bit nervous.  Can you help me calm my nerves a little bit? Oh,  

  • 00:24:30 you're doing a live demo right now? That's  awesome. Just take a deep breath. And remember,  

  • 00:24:37 you're the expert here. I like that suggestion.  Let me try a couple deep breaths. Can you give me  

  • 00:24:42 feedback on my breaths? Okay, here I go. Okay, let  me try again. So I'm going to breathe in deeply,  

  • 00:25:03 and then breathe out. Okay, I'll try again.  Breathing in, and breathe out. That's it.  

  • 00:25:16 How do you feel? I feel a lot better. Thank you  so much. So Mark, you've been working on these  

  • 00:25:22 capabilities for a while now. Can you tell us a  bit how it's different from voice mode? Right. So  

  • 00:25:28 if you've used our voice mode experience before,  you'll notice a couple key differences. First,  

  • 00:25:32 you can now interrupt the model. You don't have  to wait for it to finish your turn before you  

  • 00:25:37 can start speaking, and you can just butt  in whenever you want. Second, the model is  

  • 00:25:43 real-time responsiveness. So that means that you  don't have this awkward two to three second lag  

  • 00:25:48 before you wait for the model to give a response.  And finally, the model picks up on emotion,  

  • 00:25:53 right? When I was breathing super hard there, it  could tell, and it knew, hey, you might want to  

  • 00:25:57 calm down a little bit. You're kind of running a  little bit too fast. So, you know, it really has  

  • 00:26:03 that capability across the board to perceive your  emotion. The model is able to generate voice in  

  • 00:26:08 a variety of different emotive styles, and it  really has a wide dynamic range. So I'm going  

  • 00:26:13 to fire up another demo. Awesome. Hey ChatGPT,  how are you doing? I'm doing fantastic. Thanks  

  • 00:26:23 for asking. How about you? Pretty good. What's  up? So my friend Barrett here, he's been having  

  • 00:26:28 trouble sleeping lately, and I want you to tell  him a bedtime story about robots and love. Oh,  

  • 00:26:35 a bedtime story about robots and love? I got you  covered. Gather round, Barrett. Once upon a time,  

  • 00:26:43 in a world not too different from ours, there  was a robot named Byte. Byte was a curious robot,  

  • 00:26:49 always exploring. I don't know when you started  this story, but I want a little bit more emotion  

  • 00:26:52 in your voice, a little bit more drama. Got it.  Let's add some drama. Once upon a time, in a world  

  • 00:27:00 not too different from ours, there was a robot  named Byte. No, no, no, ChatGPT. I really want  

  • 00:27:05 maximal emotion, like maximal expressiveness, much  more than you were doing before. Understood. Let's  

  • 00:27:11 amplify the drama. Once upon a time, in a world  not too different from ours, there was a robot  

  • 00:27:19 named Byte. Actually, can you do this in a robotic  voice now? Initiating dramatic robotic voice.  

  • 00:27:28 Once upon a time, in a world not too different  from ours, there was a robot named Byte. Byte was  

  • 00:27:36 a curious robot, always exploring new circuits.  So I'm seeing Barrett here calm down a little bit.  

  • 00:27:42 Can you end the story, but do it in a singing  voice? And so Byte found another robot friend,  

  • 00:27:52 and they lived circuitly ever after. Thank you  so much. I definitely feel a lot more relaxed.  

  • 00:28:03 You've just seen the voice capabilities, but we  also want to show you the vision capabilities  

  • 00:28:09 as the model can also see the whole world  around us. Yeah, absolutely. As you can see,  

  • 00:28:13 you can chat with ChatGPT in real time, but you  can also interact with it with video as well.  

  • 00:28:20 Let me boot up ChatGPT. Hey, ChatGPT. Hello there.  How's it going? It's going really well. Today I'd  

  • 00:28:31 really like your help solving a math problem.  I'm all ears. What math problem can I help you  

  • 00:28:36 tackle today? So I'm going to write down a linear  equation on a sheet of paper and I'll show you,  

  • 00:28:40 and then I'd love your help working me through it.  But importantly, don't tell me the solution. Just  

  • 00:28:43 help give me hints along the way. Got it. Okay,  I see it. No, I didn't show you yet. Just give  

  • 00:28:53 me help along the way. One second. Whoops. I got  too excited. I'm ready when you are. So, ChatGPT,  

  • 00:29:12 what equation did I write down there? Ah, I see  it now. You wrote down 3x plus 1 equals 4. Yep,  

  • 00:29:22 exactly. So what's the first step I should  take to try to solve this? The first step is to  

  • 00:29:28 get all the terms with x on one side and the  constants on the other side. So what do you think  

  • 00:29:33 we should do with that plus 1? Okay, I'm going to  try to subtract 1 from both sides and then I'll  

  • 00:29:38 see what I get. Great idea. Perfect. Now what do  you get when you subtract 1 from both sides? Okay,  

  • 00:29:52 so ChatGPT, this is what I ended up with. How  does this look? Spot on. Now you've isolated  

  • 00:29:58 the term with x on one side and it looks like 3x  equals 3. What do you think we should do next to  

  • 00:30:04 solve for x? I'm not sure. Can you give me another  hint? Of course. You have 3x and you want to find  

  • 00:30:12 the value of x. Think about what operation would  undo multiplication. Is it subtraction? Close, but  

  • 00:30:20 think of the opposite of multiplication because 3x  means 3 times x. Okay, I think it's division. Let  

  • 00:30:28 me try to divide both sides by 3 and let me see  what I get. Bingo! Go ahead and divide both sides  

  • 00:30:33 by 3. Okay, so ChatGPT, this is what I ended up  with. How does this look? It looks perfect. You've  

  • 00:30:46 solved it and x equals 1. Nicely done. How do you  feel about solving linear equations now? Yeah,  

  • 00:30:53 honestly, I'm not pretty confident I can solve  linear equations, but why would you ever need to  

  • 00:30:58 use this in the real world? I'm really skeptical.  That's a fair question. Linear equations pop up in  

  • 00:31:05 a lot of everyday situations, even if we don't  realize it. For example, calculating expenses,  

  • 00:31:11 planning travel, cooking, and even in business  for profit and loss calculations. It's basically  

  • 00:31:17 a way to solve problems, but you need to find an  unknown value. Wow, okay. I have a lot of unknown  

  • 00:31:24 values in my life. I'm totally convinced I'm going  to learn math now. I love that attitude. Math can  

  • 00:31:33 definitely help solve some of life's mysteries.  Whenever you're ready to dive into more math,  

  • 00:31:38 I'm here for you. Anything else you'd like to  tackle today? So ChatGPT, I really love that you  

  • 00:31:43 taught the value of math to my friend Mark, and  I wrote one last thing I'd love if you could take  

  • 00:31:47 a look at. Of course. I'd love to see what you  wrote. Show it to me whenever you're ready. Okay,  

  • 00:31:53 so this is what I wrote down. What do you see?  Oh, I see. I love ChatGPT. That's so sweet of you.  

  • 00:32:03 Yeah, well, I really appreciate all the help.  So let me take out a computer for this. Okay,  

  • 00:32:14 so I'm going to open the ChatGPT desktop app like  Mira was talking about before. Okay, and to give a  

  • 00:32:22 bit of background on what's going on, so here we  have a computer, and on the screen we have some  

  • 00:32:26 code, and then the ChatGPT voice app is on the  right. So ChatGPT will be able to hear me, but  

  • 00:32:30 it can't see anything on the screen. So I'm going  to highlight the code, Command-C it, and then that  

  • 00:32:35 will send it to ChatGPT, and then I'm going to  talk about the code to ChatGPT. Hey, ChatGPT.  

  • 00:32:42 Hey there. How's it going? Yeah, it's going really  well. I was wondering if you could help me with a  

  • 00:32:46 coding problem today. Of course. I'd love to help  you out. What's the coding problem you're dealing  

  • 00:32:52 with? Okay, I'm going to share with you some  code. One second. Sure thing. Take your time.  

  • 00:33:00 Okay, so I just shared some code with you.  Could you give me a really brief one-sentence  

  • 00:33:04 description of what's going on in the code? This  code fetches daily weather data for a specific  

  • 00:33:10 location and time period, smooths the temperature  data using a rolling average, annotates a  

  • 00:33:15 significant weather event on the resulting plot,  and then displays the plot with the average,  

  • 00:33:21 minimum, and maximum temperatures over the  year. Awesome. Thanks so much. And in the code,  

  • 00:33:26 there's this function foo. Can you describe to  me what the plot would look like if I had versus  

  • 00:33:30 didn't have this function? Sure. The function foo  XY takes in a data frame X containing temperature  

  • 00:33:38 data and a window size Y for calculating the  rolling mean. It smooths the average temperature,  

  • 00:33:45 minimum temperature, and maximum temperature  columns by applying a rolling mean over the  

  • 00:33:51 specified window. With the foo function, the plot  would show smoother temperature lines because the  

  • 00:33:57 rolling mean reduces the noise or fluctuations  in the data. Awesome. Thanks so much. So yeah,  

  • 00:34:03 I definitely will want to apply this  function. And so I'm going to generate  

  • 00:34:06 the plot and then show it to you. Give me one  second. Absolutely. Take your time. I'm excited  

  • 00:34:12 to see the plot when you're ready. Yeah.  So, so far, ChatGPT has only been able to  

  • 00:34:16 see the code. And now I'm going to run it  and then I'll use the vision capabilities  

  • 00:34:20 of the ChatGPT desktop app so it can actually  see everything going on on the screen as well.  

  • 00:34:31 Okay, ChatGPT. I'm sharing with you the plot  now. I'm wondering if you can give me a really  

  • 00:34:35 brief one sentence overview of what you see.  The plot displays smoothed average, minimum,  

  • 00:34:42 and maximum temperatures throughout 2018, with a  notable annotation marking a big rainfall event  

  • 00:34:48 in late September. Fantastic. I have another  question for you. Which months do you see the  

  • 00:34:54 hottest temperatures and roughly what temperatures  do those correspond to? The hottest temperatures  

  • 00:34:59 occur around July and August. The maximum  temperature during these months is roughly between  

  • 00:35:06 25 degrees and 30 degrees, 77 degrees Fahrenheit  to 86 degrees Fahrenheit. What's going on with  

  • 00:35:14 the y-axis? Is this in Celsius or Fahrenheit?  The y-axis is in Celsius. The temperatures are  

  • 00:35:22 labeled accordingly, such as average temperature,  minimum temperature, and maximum temperature.  

  • 00:35:29 Yeah. So as we can see, you know, not only can  ChatGPT help me solve very easy linear algebra  

  • 00:35:34 equations that I need help with, it can also  interact with, you know, code bases and see the  

  • 00:35:38 outputs of plots and everything like this going  on on a computer. We asked the live audience on  

  • 00:35:43 X to submit a few requests for what they would  like us to try out here. So I will take a couple  

  • 00:35:49 of prompts. Okay. Bot Gascar wants to know if  GPT4O is capable of real-time translation. Mark,  

  • 00:36:00 you want to try this one? Sure. Yeah, let's do  it. I speak Italian, so we can just do English  

  • 00:36:06 Italian. Okay, cool. Let's try to do English  Italian. Sure, let's do it. Hey, ChatGPT, how are  

  • 00:36:12 you? I'm doing fantastic. Thanks for asking. How  about you? How's your day going? I'm doing great.  

  • 00:36:17 So I would like you to function as a translator.  I have a friend here who only speaks Italian,  

  • 00:36:22 and I only speak English, and every time you hear  English, I want you to translate it to Italian,  

  • 00:36:27 and if you hear Italian, I want you to translate  it back to English. Is that good? Perfecto. Mike,  

  • 00:36:42 she wonders if whales could talk, what would  they tell us? They might ask, how do we solve  

  • 00:36:48 linear equations? Potrebbero chiederci, come  risolviamo le equazioni lineari? Sicuramente, si.  

  • 00:37:01 Certainly, yes. Great, looks like it works. So the  next one is from burritojohn78, who asks, can you  

  • 00:37:13 tell what you're feeling just by looking at your  face? Barrett, do you want to give this one a try?  

  • 00:37:19 Absolutely, let's try it out. Hey, ChatGPT.  Hey there, what's up? How can I brighten your  

  • 00:37:28 day today? Okay, yeah, so I'm going to show you a  selfie of what I look like, and then I'd like you  

  • 00:37:32 to try to see what emotions I'm feeling based on  how I'm looking. Sounds like a fun challenge. Go  

  • 00:37:38 ahead and show me that selfie, and I'll put my  emotional detective hat on. Okay, so here's me.  

  • 00:37:44 So what kind of emotions do you think I'm feeling?  Hmm, it seems like I'm looking at a picture of a  

  • 00:37:51 wooden surface. Oh, you know what? That was the  thing I sent you before. Don't worry, I'm not  

  • 00:37:55 actually a table. Okay, so take another look. Ah,  that makes more sense. Ah, there we go. It looks  

  • 00:38:05 like you're feeling pretty happy and cheerful with  a big smile and maybe even a touch of excitement.  

  • 00:38:11 Whatever's going on, it seems like you're in a  great mood. Care to share the source of those  

  • 00:38:16 good vibes? Yeah, no, the reason I'm in a really  good mood is because we were doing a presentation  

  • 00:38:20 showcasing how useful and amazing you are. Oh,  stop it. You're making me blush. Thank you.

Clone this wiki locally