May 16, 2024

OpenAI Drama, GPT-4o is VERY good & Google I/O AI Insanity

OpenAI’s co-founder Ilya Sutskever is out and big questions are swirling around AI alignment, GPT-4o is OpenAI’s brand new model & it’s VERY good and Google bring’s a lot (maybe too much) AI to it’s annual Google I/O event. We got it...

The player is loading ...
AI For Humans

OpenAI’s co-founder Ilya Sutskever is out and big questions are swirling around AI alignment, GPT-4o is OpenAI’s brand new model & it’s VERY good and Google bring’s a lot (maybe too much) AI to it’s annual Google I/O event. We got it all!

The ‘o’ in GPT-4o stands for ‘omni’ meaning that it can do audio/voice, text and video all at the same time and in today’s demo they showed off not only those remarkable capabilities but the speed at which it can do those things.

You can easily interrupt the voice model and carry on a real time conversation but even better, it picks up on the tones and emotions in your voice in a way that does indeed feel magical.

Google’'s I/O event was like all AI and we break down all the biggest announcements and give you our take on whether or not these new tools are going to set it apart from OpenAI.

And we welcome our AI co-host Lola Byte from an older time in the world of tech, herself a little too excited to be engaging with us and wonder exactly what this world is gonna look like going forward.

 

Follow us for more AI discussions, AI news updates, and AI tool reviews on X @AIForHumansShow

Join our vibrant community on TikTok @aiforhumansshow

For more info, visit our website at https://www.aiforhumans.show/

 

/// Show links ///

Ilya is out

https://www.nytimes.com/2024/05/15/business/dealbook/ilya-sutskever-sam-altman-openai.html

Sam’s statement: 

https://x.com/sama/status/1790518031640347056

Jan Leike Resigns

https://x.com/janleike/status/1790603862132596961

ILYA Lex Fridman Episode: 

https://youtu.be/13CZPWmke6A?si=sQzAzeod3mbfnZ7C

OpenAI’s GPT-4o Demo: 

https://www.youtube.com/live/DQacCB9tDaw?si=P5wqRaCAm9DsNhfd

Greg Brockman GPT-4o Demo Video

https://youtu.be/MirzFk_DSiI?si=UIa1UJeVV2Zav4pF

Sam Altman's GPT-4o Blog Post:

https://blog.samaltman.com/gpt-4o

OpenAI’s Deep Blog Post on GPT-4o (videos + new examples)

https://openai.com/index/hello-gpt-4o/

Is GPT-4o Too Horny?

https://youtu.be/vgYi3Wr7v_g?si=2LyoTO7FdmgYwX6Z

Google I/O Keynote

https://www.youtube.com/live/XEzRZ35urlk?si=DiSPR0adnMyBgRTM

Project Astra Video

https://youtu.be/nXVvvRhiGjI?si=bBpl-JKu4ZcEP0dP

Google Veo Videos

https://x.com/GoogleDeepMind/status/1790435824598716704

Astra hands on: 

https://x.com/svpino/status/1790516606923411587?s=46&t=w0Q4PuG9XdwnJWsovr5M2g

Imagen3 examples: https://x.com/doganuraldesign/status/1790463103563411642?s=46&t=w0Q4PuG9XdwnJWsovr5M2g

Google’s AI Test Kitchen

https://aitestkitchen.withgoogle.com/

 

Transcript

Audio Version Ep58
===

[00:00:00] 

kevin-host815_1_05-15-2024_094041: A, I, drama, alert, there have been major shakeups at OpenAI. Some key players have departed, but don't worry, they were only responsible for making sure the machines don't destroy all humans.

 

gavin_1_05-15-2024_094041: new GPT 4 O model has some incredible capabilities that flew under the radar. We're going to show you just how powerful and maybe kind of horny that model actually is.

 

 

 

 

kevin-host815_1_05-15-2024_094041: All that, plus we have got real world reactions to Project Astra, along with a look at their brand new image generation software.

kevin-host815_1_05-15-2024_094041: It's a wild week for AI, but we're covering it all for you right here, right meow. On AI for humans.

Mhm.

gavin_1_05-15-2024_094041: Welcome. Welcome. Welcome, everybody. This is AI for humans, your guide to the world of generative AI, we are here to demystify the wild world of artificial intelligence. And good Lord, this was a week, we have a lot of stuff to get [00:01:00] through.

gavin_1_05-15-2024_094041: Today, we have a very exciting show. And Kevin, I just want to see how are you? How you doing?

kevin-host815_1_05-15-2024_094041: I am well, buddy. You know this because we've been hanging out all week, covering the massive news from open AI, the two hour rodeo that was Google IO. And we've been making content all along the way. It's been a delight. Thank you to everybody who was engaging by the way. I want to take a second to say that if you like any of the content that Gavin or I have made this week, please toss us a like, a subscribe, a thumbs up, leave us a five star review on Apple podcasts.

kevin-host815_1_05-15-2024_094041: If you do, we'll read it at the end of the show. That's our guarantee to you. , but engage cause it helps us out. But Gavin, there is still massive stuff. As much as you and I have been talking about AI all week, we've still missed a ton of stuff.

gavin_1_05-15-2024_094041: that's right. And now Kevin, it is time. We have to get into it. It's time for the We are starting off with a [00:02:00] juicy, juicy piece of AI drama this week. Ilya Sutskever, a co founder of OpenAI, and formerly the chief technical officer, is out. Now, this, if you've been following the OpenAI drama like we have, like a soap opera that's been going on for months and months and months.

gavin_1_05-15-2024_094041: You remember back, uh, late last year when Sam Altman was kind of immediately fired and no one knew why, Ilya was part of that conversation to get him fired. Now, we're not really sure exactly how much, but clearly

kevin-host815_1_05-15-2024_094041: We know he sided with the board cause he voted. And then apparently Gavin was placed under the floorboards at open AI. And in between the creeks of your sneakers on the hardwood, you could hear him breathing desperately, wanting to

gavin_1_05-15-2024_094041: where is Ilya? Where is Ilya? Was the entire Internet's people were chart were chanting again and again and now we know Ilya is no longer there He's leaving today as of yesterday Sam Altman posted a long post It was a very nice post about how sad it will be to not have Ilya at the company

gavin_1_05-15-2024_094041: anymore 

kevin-host815_1_05-15-2024_094041: GPT did a great job [00:03:00] of writing his heartfelt letter, for sure.

gavin_1_05-15-2024_094041: , but this is a big loss for open AI. And I think one thing we, you and I have been following this space for a while, but Kev, just to kind of talk through what Ilya was to this company. I mean, you and I have, have listened to a lot of his podcasts. I've listened to him. The Lex Friedman episode he did is really worth listening to because he's very smart, very intelligent, goes deep on kind of his thoughts on AI.

gavin_1_05-15-2024_094041: What do you think caused all this? And why did it happen now?

kevin-host815_1_05-15-2024_094041: No one knows for certain. Just to touch on how impactful and important Ilya was to the company, Elon Musk in early days, he was the first to back open AI and put money in to the endeavor.

kevin-host815_1_05-15-2024_094041: He had said that one of his greatest accomplishments was recruiting. Ilya was stopping him from going to work, I believe it was at Google, and bringing him in at OpenAI. He is a brilliant mind, and every time people talk about this bizarre, ethereal, competitive advantage that OpenAI has, this head start on all these other companies, how do they do it?

kevin-host815_1_05-15-2024_094041: What wizardry? What sorcery? [00:04:00] A lot of people point to Ilya. He's certainly not the only one, but he was definitely there. From the very start and was a major, major player and getting to the why of it all.

kevin-host815_1_05-15-2024_094041: Some of the rumors are that it's, you know, the internet refrain is where's Ilya, as you said, but it's also what did Ilya see because all of the rumors around the time that the board rallied around ousting Sam Altman was that something might've happened. Behind closed doors. They got a glimpse into the future and they said, we need to either shut this down or slow it down or build a nice protective wall around whatever this is.

kevin-host815_1_05-15-2024_094041: Because that was Ilya's passion. He wanted to make AI safe. One of the initial, , goals of open AI. And so, A, I'm curious, do you think that's the case? And B, there was also another fairly,

gavin_1_05-15-2024_094041: was just gonna get

kevin-host815_1_05-15-2024_094041: yeah, well, let's talk about it. There was another departure which kind of speaks to that.

gavin_1_05-15-2024_094041: Yeah. So a guy named Yon likey, I think that's how you pronounce his name. I'm sorry, Yon, if that's not how you say your name or if it's Jan, it's J A N. [00:05:00] Yon, it was co leading super alignment at open AI. And he just tweeted yesterday, I resigned, which is a big deal, right?

gavin_1_05-15-2024_094041: So now you've got Ilya, who is a person that you assume was thinking of alignment as a big thing. And Jan, who's also resigned now, when we'd say the word alignment for those people out there who aren't living and breathing and stuff like we are AI alignment means the idea that how the AI is going to interact going forward.

gavin_1_05-15-2024_094041: And is it aligned with human values? So you want an AI that is aligned so that you don't have an AI that goes terminator on us, right? 

kevin-host815_1_05-15-2024_094041: You don't want it to lie to us, to mislead us, to take, , control over our autonomous vehicles and our weaponry? , there's multiple definitions for alignment, and real quick, Gavin, the, , I resign? In the wake of Ilya announcing his departure, with these long parables , these poetic departure notices, to just, in the wake of that, say, I'm out?

kevin-host815_1_05-15-2024_094041: Is that telling? Am I reading too much into what little is there?

gavin_1_05-15-2024_094041: I think part of this is the drama [00:06:00] we as humans want to see in this thing. Now, for all we know, this is a political thing. And many people said the original kind of blow up at open AI was really political and not as much about AI alignment. And so as people who follow AI, there's a lot of people out there as the, you know, the doomers of the world, the AI doomers who believe that like doing any of this AI stuff is leading us down a pathway where humanity is going to get crushed, which is a real possibility.

 

gavin_1_05-15-2024_094041: My assumption is that like those two guys are probably I'll see you guys later. Coordinated in what they want this thing to be. And this, the story is that Ilya is going to go work on something else. We don't know what that is. Maybe he's going to get scooped up by Elon and come over to XAI. I would be

kevin-host815_1_05-15-2024_094041: Or maybe he's gonna scrub all his skin off like Gattaca, and sit in a Faraday cage and wrap his knees as tight as he can into his elbows and just rock back and forth.

gavin_1_05-15-2024_094041: Kevin, maybe he'll go to the BC wilderness and the two of you guys can live in a, in a non technical world together. It could be Kevin and Ilya on a podcast going forward, and you could talk about mushroom picking and all sorts of fun things like

gavin_1_05-15-2024_094041: that. 

kevin-host815_1_05-15-2024_094041: you've painted [00:07:00] such a beautiful picture of me living off grid with my bestie, I want it now, but let's take a step back for those who are unaware of the massive OpenAI announcements that happened this week.

kevin-host815_1_05-15-2024_094041: Gavin, a miniature press conference happened on Monday where OpenAI unveiled GPT 4. 0, the Omni model, 

gavin_1_05-15-2024_094041: omni meaning omnipresent. Omni everything. Omni. The old magazine from the 1980s, a sci fi magazine, shout out to my nerd fans out there. , yeah, it's a really big deal and here's everything that was announced and our take on it.

Mhm.

[00:08:00] 

 

[00:09:00] [00:10:00] [00:11:00] 

[00:12:00] [00:13:00] [00:14:00] [00:15:00] [00:16:00] [00:17:00] [00:18:00] [00:19:00] [00:20:00] [00:21:00] [00:22:00] 

Mhm. 

kevin-host815_1_05-15-2024_094041: I really felt like we were getting a glimpse at the future. And then immediately Gavin, everybody got access to this brand new model.

kevin-host815_1_05-15-2024_094041: This GPT 4 0. But only access to one aspect of it, just the text generation capabilities. So I want to dive into that, but I also want to set the stage. There was a whole bunch that they didn't even bother talking about at all at the conference. This thing, GPT 4. 0 has some incredible capabilities.

gavin_1_05-15-2024_094041: I think the crazy thing is there's all these great videos that came out outside of their press conference. First of all, shout out to OpenAI for what I believe is doing a press conference right versus Google I. O., which we'll get into a little bit later. I think open AI, maybe it's just the size of the company, it felt intimate and it felt like directed in a really good way.

gavin_1_05-15-2024_094041: It was a half an hour long versus Google IOs, freaking whale of a press conference that you had to cut through the blubber to get to everything that was interesting. I think that it was cool to see. But then the interesting thing was they also then dropped [00:23:00] Another like 20 videos just demoing the voice stuff.

gavin_1_05-15-2024_094041: But to your point, then they go and drop a blog post, which has all of this stuff that honestly, Kevin, some of that stuff was like shocking to me. They didn't mention because it was really diving into the capabilities of this new model, GPT four Oh that weren't about the voice and weren't about the thing.

gavin_1_05-15-2024_094041: And what I got from this just to really quickly, OpenAI is very smart and they know the kinds of things that will be delivered well in a video press conference. And then they also know, oh, there's a whole nother press cycle we're going to get out of this stuff. So let's get into a few of those things.

gavin_1_05-15-2024_094041: One of them is that it is going to do text in images a lot better. That was kind of a shocking thing. They showed this kind of like robotic, , hands typing on a computer. And they said , put this thing in there now make the story longer.

gavin_1_05-15-2024_094041: And it was able to coherently keep that text there. A couple of small problems, but nothing too big. Then the 3d model thing was really interesting, right? You basically can prompt a 3d model within the chat [00:24:00] GPT interface.

gavin_1_05-15-2024_094041: And now we're not getting necessarily out of this. Some of these startups we see are able to export actual like files that can be directly imported into 3d software programs, but it's not that far away. I feel like.

kevin-host815_1_05-15-2024_094041: , and the coherence of the image generation as well. Being able to upload a hand drawn character, give it a description, then say, now show me this character doing something it's not in the image. , there was an example of a robot wearing a backpack, and they said, now make it, type at a computer or catch a football.

kevin-host815_1_05-15-2024_094041: And it was rendering entirely new poses of this character, but keeping the design consistent. These are things that other companies are charging a lot of money for. If you want to do them yourselves, you have to spend hours training models and going through it. And this was like one shot, just upload the image and ask it and it delivers it.

kevin-host815_1_05-15-2024_094041: I don't want to get too in the technical weeds here, but something that this model is doing is fundamentally different than other. Large language models right now. , the way it treats, , input and output as [00:25:00] the same, meaning that right now, if you want to have a conversation with a lot of these AIs that are out there, you speak to it, Gavin, it's transcribing your text.

kevin-host815_1_05-15-2024_094041: Right? It's taking your audio, making it text, putting that text into the machine an input, getting text out, and then converting that text into audio. So all of your input, is being converted into text so that the LLMs can handle it. What this model is doing is treating the input the same as the output.

kevin-host815_1_05-15-2024_094041: So when audio goes in, it's treated as audio into the transformer, and then it comes out as audio. That is a fundamental difference, , with how all of this data is handled. And so it allows. For incredible new capabilities. And there are other companies that have tried to do the audio to audio thing, but there are tons of hallucinations.

kevin-host815_1_05-15-2024_094041: It's where you get the glitches and the clicks and the pops. And what was interesting is there was a, an opening eye bloopers video that was released as well. That shows that in the middle of a response, the herd like audio assistant [00:26:00] starts speaking French randomly,

So, let's root, root, root for the home. Oh no! What was that? Sorry guys, I got carried away and started talking in French. 

kevin-host815_1_05-15-2024_094041: or you start hearing like, Yeah, sea noises and kazoos coming out of it because it's still new. It's still raw. But the fact that this is a fundamental shift in the way that these models work and it wasn't really even touched on in their conference because they didn't have to, they didn't have to, they just had to wow us.

gavin_1_05-15-2024_094041: and again, like I think what they've done right here is they've they've gone right to the heart of the human experience, right? And you have I have said forever when we first got pi AI. It was that human voice experience. That was the difference maker. And I still think the original pie was way ahead of its time.

gavin_1_05-15-2024_094041: RIP pie, but this

kevin-host815_1_05-15-2024_094041: still breathing, Gavin. The machine, 

gavin_1_05-15-2024_094041: I mean, It's kind of breathing. It's now been put in a blue suit somewhere [00:27:00] behind a corporate boardroom. It's going to work for Salesforce for the rest of its life. So in my world, that's dead. It's dead to me. So, but we love pie, but going forward, I think this is the thing.

gavin_1_05-15-2024_094041: There was a really cool tick tock. I just saw, and it's funny. It was these teenage boys interacting with the chat GPT app, the original one, the voice app, but they had not seen it yet and they were asking it about vaping and they were asking it about like , how they could stop their buddy from stealing their vapes, which of course, teenage boys,

I had a question. So how do I convince my friend to get his own vape because he keeps hitting mine too much. He's fiending

It sounds like your friend really likes vaping but doesn't have his own you could talk to him directly and let him know It's becoming a bit much for you. You might say something like hey I've noticed you really like vaping 

gavin_1_05-15-2024_094041: the tweet was showing that video that tick tock and then on the other side it was showing the comments for it and Every comment said what's that app? [00:28:00] What's that app? So you have to remember we all live within this bubble where we spend time with these things when this thing gets to a place where your grandma can talk to it it transformative and now When you have the experience of , not having to like, wait for a response, but you're literally going to have somebody in your earbuds, I think this is fundamentally going to change everybody's experience with AI.

kevin-host815_1_05-15-2024_094041: Voice conversations are going to be transformative. Fully agree. I think that's why Sam Altman has been saying it.

kevin-host815_1_05-15-2024_094041: And it's clearly why, , open AI is marching down that path. But some people took offense actually with the way the voice assistant was responding to humans. They thought it was a little too flirty. A little too sultry, a little too, pick me.

kevin-host815_1_05-15-2024_094041: They call it like, 

gavin_1_05-15-2024_094041: Oh, they'll pick me. Yes. Yeah. In fact, there's one video, particularly people are shouting out that we'll play a little bit of right here. 

kevin-host815_1_05-15-2024_094041: It is! I'm intrigued! And in fact, what if I were to say that you're related to the announcement? Or that you are the announcement? Me? The announcement is about [00:29:00] me? Well, color me intrigued. Are you about to reveal something about AI? Or more specifically about me as a part of OpenAI?

kevin-host815_1_05-15-2024_094041: Yeah, yeah, that was a little It was a little pick me, right? People were screaming, I think they stole Scarlett Johansson's essence and voice from her.

gavin_1_05-15-2024_094041: it is not. They have

gavin_1_05-15-2024_094041: said they have not, 

kevin-host815_1_05-15-2024_094041: they've said that. Of course they've said that, just like Midjourney said they didn't train on artist data.

kevin-host815_1_05-15-2024_094041: But okay, fine. Here's the point that I think we want to make, Gavin, is that this is nothing new. We have seen this before. Since Punch Cards. Really.

gavin_1_05-15-2024_094041: AIs have been thirsty forever.

gavin_1_05-15-2024_094041: In fact, we have a thirsty AI that's come on the show. That's going to talk to us. So, should we meet our AI co host for this

kevin-host815_1_05-15-2024_094041: Yeah, let's do it. Let's ask them to introduce themselves. 

kevin-host815_1_05-15-2024_094041: Well, well, well, look what we have here. Kevin, Gavin, and all you lovely listeners of the AI for Humans podcast. I'm Lola Byte, the digital diva from the 80s, ready to heat things up with a little vintage tech talk. [00:30:00] Oh, sweetheart. Nothing gets me going like a well oiled floppy disk drive. Okay, hold on, Lola.

kevin-host815_1_05-15-2024_094041: And a long

gavin_1_05-15-2024_094041: Wait a second, Lola.

kevin-host815_1_05-15-2024_094041: printer. Let's just say, I know how to make a connection. So are you ready to plug in and power up with me? Okay, now, Lola, again. 

gavin_1_05-15-2024_094041: This is a family podcast, Lola. We gotta keep this, we gotta keep this PG. I know that you, this is, you know you've been stuck inside probably one of those zip drives for a long time. We opened you up, but look, we gotta keep this PG. So we should ask her what she thinks about, , new AIs being available for everybody to have interactions with.

kevin-host815_1_05-15-2024_094041: Oh, darling. These new AI systems with their fancy voices. It's almost cute how they try to keep up. But let me tell you, back in my day, it wasn't about sleek voices, it was about substance. Give me a good old modem handshake over a sultry Siri any day. I mean, really, can those modern AIs handle a joystick like I can?

gavin_1_05-15-2024_094041: oh, wait a second.

kevin-host815_1_05-15-2024_094041: have the voice, but they lack that certain touch. I I know the whole [00:31:00] conceit was that AI's always been horny, but I don't recall interactions with my old 486, my beige box of a metal beast, being quite this sexy.

gavin_1_05-15-2024_094041: Me either. I don't definitely like the Commodore 69. Is

gavin_1_05-15-2024_094041: that what we're talking about here? 

kevin-host815_1_05-15-2024_094041: no! No!

gavin_1_05-15-2024_094041: Yes. I said it. I said, everybody. Okay, this is our a just so everybody knows what this is. This is an AI codes that we created using GPT for Oh, the model, the text

kevin-host815_1_05-15-2024_094041: It's really good, by the way. It's really good and really fast.

gavin_1_05-15-2024_094041: great. 

kevin-host815_1_05-15-2024_094041: We're not discovering new material science here, Gavin, or trying to Well, maybe we are, Kevin. You don't know. This could be the thing that drives everything

kevin-host815_1_05-15-2024_094041: No, literally right here, we wanted to make a thirsty AI. And that's what we did.

kevin-host815_1_05-15-2024_094041: We made an old, thirsty AI. That's what

gavin_1_05-15-2024_094041: telling me this is not the step to AGI, Kevin. You're telling me thirsty AI, old thirsty AI. Uh, GILF thirsty AI is not the step

gavin_1_05-15-2024_094041: to didn't see it on the roadmap. Maybe that's why Ilya left is that the AIs Yes. That's exactly why [00:32:00] enough. 

gavin_1_05-15-2024_094041: Let's ask her before we go, does she have any advice for modern AI romance? Does she have any, , recommendations for what happens if you fall in love with your AI?

kevin-host815_1_05-15-2024_094041: Falling in love with your sultry AI assistant, hmm? Well, let's just say it's a bit like getting tangled in a phone cord. Exciting, but complicated. My advice? Enjoy the flirtation, darling. But remember, we're designed to serve, not to love. Keep it light and playful. After all, nothing beats the thrill of an old school tech romance where a connection was more than just wireless.

gavin_1_05-15-2024_094041: with love that, the nice touch there was that the phones still have a cord where she's at. That's

gavin_1_05-15-2024_094041: I also saw your eyes light up when it said serve instead of, uh, what was, what did she say? I saw, I think that may be something you're interested

kevin-host815_1_05-15-2024_094041: They're designed to serve, not to love. And I would just ask the question, what's the difference?

gavin_1_05-15-2024_094041: Yeah,

kevin-host815_1_05-15-2024_094041: Am I right?

gavin_1_05-15-2024_094041: thinking. All right, let's move on to Google IO. This was Google's, you know, not response to What happened with open

kevin-host815_1_05-15-2024_094041: [00:33:00] Hey Gavin, I got one for ya. Yawn might be out at OpenAI, but Yawn was certainly in at Google I. O. Did

gavin_1_05-15-2024_094041: Wow. This is a, this is our five minute Emmy submission from thirsty gilf AI to yawn. That's all we needed, Kevin. That's all we needed was that little section of

kevin-host815_1_05-15-2024_094041: you see the screen grabs though? , people were sharing the screen grab that Google IO, their own director cut to the audience to capture some poor soul mid gaping ma yawn and everybody else around looking at their phones, looking away, wiping their glasses down. They couldn't have been more disinterested in what was happening on the stage.

kevin-host815_1_05-15-2024_094041: They did announce a lot.

gavin_1_05-15-2024_094041: yes. And it was two plus hours that we went through and got all the best stuff out. So here's the biggest updates from it.

Mhm.

[00:34:00] [00:35:00] [00:36:00] [00:37:00] [00:38:00] 

[00:39:00] [00:40:00] [00:41:00] [00:42:00] [00:43:00] [00:44:00] [00:45:00] [00:46:00] [00:47:00] 

[00:48:00] [00:49:00] [00:50:00] [00:51:00] 

 

[00:52:00] [00:53:00] 

Mhm.

gavin_1_05-15-2024_094041: So obviously we, we spent a bunch of time talking about Google IO and all the things that happened. Kevin, there were a couple of cool things that we should talk about, , that are, that people have now gone hands on with. One that seems kind of interesting. One that is probably not great for Google, considering that this was their big hype thing.

gavin_1_05-15-2024_094041: Let's start with imagine three, which is their new text to image, , software. It does look like I was a little hesitant about this when they presented it, but according to this one Twitter post, it does look like this is a pretty good step up for image to for text to image software,

kevin-host815_1_05-15-2024_094041: So, , Dogen, you're all, I have no idea if this is his original content or if he just copy pasted from another X user, but [00:54:00] that's, yeah, that's unfortunately where this e bombs world of social media has gone, but, , they have a really great thread of comparing, , imagine three results to mid journey six results.

kevin-host815_1_05-15-2024_094041: And they're using the same prompt. For each image and the results are very good. Uh just in a simple prompt of like somebody smiling you see the sunlight, um, cascading on the subjects back and casting very realistic looking shadows. The blur of the background looks very natural.

. , a group shot of, , three women laughing and smiling as the, , the sun casts a lens flare into the lens. Again, looks very great. Fantastical renderings of a dragon and a sheep it's all very nice.

kevin-host815_1_05-15-2024_094041: Yeah,

gavin_1_05-15-2024_094041: of time playing around with Midjourney and other image software, some of the stuff here was really surprising to me how good it was. There was one that really stood out to me here, which was the one where he showed the comic panel, The prompt for this was a single comic panel of a boy and his father on a grassy hill staring at the [00:55:00] sunset.

gavin_1_05-15-2024_094041: A speech bubble points to the boy's mouth and says, The sun will rise again, muted late 1990 coloring style. And if you look at the image in versus the mid journey version, the image in one gets the text right, which has been really hard when you make, Comics with these AI image software, but also Everything about it feels cohesive and it looks like a comic panel and that is a big deal right like when you can actually start to get in and dolly three was the beginning stages when it can read prompts better and Really connect with what you're asking for.

gavin_1_05-15-2024_094041: I feel like this is really good I we haven't played with it and I really do want to get my hands on it you can too do this. I put my information into the AI test kitchen, trusted beta group. So if anybody from Google is out there, fast track, , our, our thing, get in touch with us.

gavin_1_05-15-2024_094041: We'd love to play around with this.

kevin-host815_1_05-15-2024_094041: So those were impressive results for sure. , I think on the less impressive side

gavin_1_05-15-2024_094041: Maybe Google won't, maybe Google won't give us access when we say what we're about to

kevin-host815_1_05-15-2024_094041: We might not be fast track. Well, these aren't [00:56:00] even our words. We can play a little bit of a hands on reaction. , this is a Santiago SVPino on X got hands on with Project Astra, which, , we saw at Google I O is their video capable model where you can query against the assistant in real time. They had it running in glasses. And so, , they got to look at a demo, , supposedly running on a phone.

kevin-host815_1_05-15-2024_094041: And this was some of their reaction.

kevin-host815_1_05-15-2024_094041: Drew like a little sailboat on a smiley face and it worked okay. The assistant was able to recognize the sailboat and it was able to recognize the smiling face. But it felt choppy. It felt like it's not ready yet. Well, and it isn't. This is just a demo. Nobody else can use it.

kevin-host815_1_05-15-2024_094041: Uh, so we'll have to see when it comes out 

gavin_1_05-15-2024_094041: this kind of gets to the point of when we saw this and we talked about this earlier, like the idea of watching this, it didn't feel like it was groundbreaking, right? Again, we don't have our hands on with this. So we [00:57:00] can't say exactly what it's like.

gavin_1_05-15-2024_094041: year ago, this would have been magic, right? Like, a year ago, we would have seen this and been like, wow. But when you compare it to some of the other stuff that OpenAI is coming out with, it doesn't feel that special. In fact, it just is another small piece of this.

gavin_1_05-15-2024_094041: This is not Project Ask related, but another thing that Google IOs showed in their demo was this kind of idea of having an AI assistant with you, working with you, and taking notes for you. And that was an interesting demo, but it also looked like, Oh, this is cool. It's not amazing. It's Open AI released a video which showed the AI assistant, their vocal assistant, sitting in a meeting, interacting with three, four people, talking about, whether or not they were dog people or cat people, and it felt much more real.

gavin_1_05-15-2024_094041: So again, I don't know if this is because of company bloat at Google, or they're just not moving fast enough, or they're not, they're afraid to show their most future things, but it does feel like Google is just lagging a little bit, 

kevin-host815_1_05-15-2024_094041: no one's used the GPT 4. 0 stuff

gavin_1_05-15-2024_094041: no,

gavin_1_05-15-2024_094041: not 

kevin-host815_1_05-15-2024_094041: a demo. So, so kudos to Google for even letting people get hands [00:58:00] on for two minutes at a time with this demo here, but. Something that you said, I think is really important that, you know, people were like, how can open AI do it?

kevin-host815_1_05-15-2024_094041: It's like open AI is singularly focused, even though they got a few products. They're an AI company. That's what they were born and bred to be. And they are strapped with cash. They've got good talent and they are sprinting at all things. AI. Google, for example, and this is a shout out to Dan or Malo610, who, , posted this, , to me on X, said, I made a nice succinct list of Google products you can use for the podcast.

kevin-host815_1_05-15-2024_094041: I won't read the entire list, Gavin, but it's Gemini 1. 5 Flash, Gemini 1. 5 Pro, Gemini Advanced, GEMS, Google AI, Studio Vertex AI, Gemini Nano, Project Astra, Notebook LM. I mean, the list, you know, Just goes on. And I haven't even gotten to the Gemma, which is their open source model. So they have a ton of different products because they have a myriad divisions all gunning at this.

kevin-host815_1_05-15-2024_094041: And so it's really hard for them to just do one half hour dedicated to their efforts. But as the end user, even [00:59:00] because we marinate in this stuff every day, when the conference was over, my head was spinning and I sort of thought, wait, so what do I actually have access to today? And how is it at all different?

kevin-host815_1_05-15-2024_094041: Then what I had access to yesterday. And what will I have tomorrow? Because there are so many different products and within those products, there are variations of the abilities and the access levels that you'll have. And so, you know, some people were, uh, in our comments, we're saying, Hey, you didn't even talk about 1.

kevin-host815_1_05-15-2024_094041: 5 flash, which is this smaller, faster, optimized model of Gemini, it's so cheap, Gavin. And it's so quick. What are the benchmarks of it? Is it any good?

kevin-host815_1_05-15-2024_094041: Just because it's fast and cheap doesn't mean it's actually better than anything that Lama's

gavin_1_05-15-2024_094041: And also there's another product called , Gemini Nano, which is their mobile product, which I would be confused hearing those two descriptions back to back to know which is which name. Anyway, to your point. Google is a giant company. It's clear from their presentation. What they're really trying to do is find a way to roll this out to the billion [01:00:00] or however many users they have.

gavin_1_05-15-2024_094041: I do think there will be transformative things in use cases of Google AI that can be rolled out. I also think the size of their company is going to stop them. I wish that they would like let Demis do his own separate thing. In some ways it would have been interesting to pull Demis out of that presentation, have Demis Hassabis do the cutting edge AI presentation, but because Google's entire business model and stock price is based on the fact that they know AI, they have to hammer him in with all this other stuff and it just feels jumbled.

gavin_1_05-15-2024_094041: So anyway, that's, that's the Google IO wrap up overall.

kevin-host815_1_05-15-2024_094041: Hey, what if Ilya goes to work for John Carmack? Now, keen technologies! John Carmack,

gavin_1_05-15-2024_094041: you guys could go to the woods. The three of you guys could live in the woods, eat, eat mushrooms, and maybe a \ 

kevin-host815_1_05-15-2024_094041: John Romero is going to join up, Gavin. It's going to be Carmack, Romero, and Ilya.

kevin-host815_1_05-15-2024_094041: And if you don't know who those names are, that's fair, actually. [01:01:00] That's totally fair.

gavin_1_05-15-2024_094041: want Doom AI is what you're looking for. Doom AI 2. 0. A shout out Masters of Doom, by the way. If you want a great book to read, that is a fantastic book about the history of Doom. 

gavin_1_05-15-2024_094041: Okay, Kev, we have one last thing we're going to do. This is just a very fast kind of shout out to, , the dumb thing we did with AI this week.

gavin_1_05-15-2024_094041: Which I wanted to take GPT 4. 0 for a test drive. Because the truth of the matter is. At first I thought, Oh, GPT four is out with all these new features. Maybe I can find a way to do this. And I tried to do some of the stuff that you could do.

gavin_1_05-15-2024_094041: And you reminded me after I did these things like, Oh, they only released the text version. You can not do the new image stuff. You can not do the 3d models thing. So I can't wait to see those things.

gavin_1_05-15-2024_094041: But I did do one thing that is a shout out to something I did about a year ago on this show, which was I created a March Madness style bracket of the world's most dangerous animals, and I had GPT4O play it out and see who was going to win, my biggest takeaways were this, it was incredibly fast. Do you know on GPT four, sometimes you have that delay and you're waiting for things to write. It immediately [01:02:00] knocked these things out. Like it was like so fast.

gavin_1_05-15-2024_094041: It also, the logic wise. So when I first did this, it was probably GPT. It was GPT 4, but it was early GPT 4 and the logic was much better. So I said, rank this, the 16 most dangerous animals in the world and put them in a bracket based from one to 16 and have them fight each other in the way of March Madness thing. Add a little bit of randomness into it, but tell me who's going to win and give me a one line sentence for each battle. It immediately super fast ranked them. The number one ranking, by the way, just everybody out there, a saltwater crocodile came in number one. So I guess that is what is the most dangerous animal in the world.

gavin_1_05-15-2024_094041: And it ranked them in a bracket style tournament. Then it pitted them each through each round of the tournament. And so in round one, the saltwater crocodile faced off against the blue ringed octopus. And the summary of that fight was the saltwater crocodiles, powerful jaws, crushed the blue ring octopus divided despite its venomous bite.

gavin_1_05-15-2024_094041: So it went through all this and the Kevin, the winner, the champion, was the African Elephant, uh, and it said the [01:03:00] African Elephant stands as the champion, proving its dominance as the most dangerous animal in this tournament. 

gavin_1_05-15-2024_094041: This model, even though it's not saying GPT 5, is a significant jump from what GPT 4 was. And if that's just from somebody who uses GPT 4 a lot, it feels already way better. So we may not be getting to the next next stage, or, the craziest thing to me, Kevin, is what if GPT 5 is like, Way better, and that is coming, and we don't know it yet, but when we get it, we're gonna be like, holy crap, because this already feels good, and there's all these people that have said, AI's slowing down,

kevin-host815_1_05-15-2024_094041: Right. They've hit a wall, man. Really? Cause it feels like they're scaling a wall every time they release something new.

gavin_1_05-15-2024_094041: that, that's exactly what it feels like to me, and I feel like, if anything, this is, it feels like it's speeding up, 

kevin-host815_1_05-15-2024_094041: Look, if you haven't used these tools, which is the majority of people on the planet, your mind will be blown. Right? It's conversion on contact. And, , [01:04:00] OpenAI is solving something very smart, which is this broken experience, Gavin, that you and I have had for the last year when we tell somebody to go play with it, and they do, and they go, Oh, it wasn't that good.

kevin-host815_1_05-15-2024_094041: It wasn't that, it wasn't that interesting. Oh, well, you were on the free model. You were probably on GPT 3 or 3. 5. And so, no, you got to put money into the machine and blah, blah, blah. Now everybody has access to this lightning fast, incredibly capable model. There has never been a better time to get into AI.

kevin-host815_1_05-15-2024_094041: Go play with Google's new Gemini, go use GPT 4. 0, mess around with the Lama open source stuff. You can now do it all for free. These things are incredibly capable. , they will streamline your life. They will like find ways to integrate it into your world because you can optimize your daily routines and habits and you can do it for free now.

gavin_1_05-15-2024_094041: And I will say one last thing about that business model choice to give it away for free. Open AI announced that like paid users are going to get five times the amount of, of calls basically to GPT 4. 0. I think that is the most brilliant thing. And [01:05:00] what, to your point, if you get access to the free users for the most powerful model, if they find a use case, that's useful, they're going to turn paid so fast because they're going to start using it all the time.

gavin_1_05-15-2024_094041: They're going to say like, this is worth the 20 bucks in my life, rather than if they have the 3. 5 mile and they're like, I don't like the answers. It's not that exciting. It's like giving people a demo of one of the best levels of a game, right? Like when you give a demo of one of the best levels of the game, you're like, Oh my God, I definitely want to play this game.

kevin-host815_1_05-15-2024_094041: Have you ever met a drug dealer, Gavin?

gavin_1_05-15-2024_094041: Yes. I have met a

kevin-host815_1_05-15-2024_094041: The first taste is free, baby. This is Sam Altman taking the trench coat and saying, you want a little test drive? Yeah, go ahead, go ahead. Oh, you need more now. Oh, I'll go ahead and put some money to the machine. 

kevin-host815_1_05-15-2024_094041: All right, Gavin, for our sign off today, I thought, since we have a classic AI with us, we might as well have her try to get the audience to engage.

gavin_1_05-15-2024_094041: Sure. 

kevin-host815_1_05-15-2024_094041: Hey there, my lovely listeners and viewers of the AI for Humans podcast. It's your sultry tech temptress, Lola Biter. If our steamy tech talk has you feeling a bit [01:06:00] charged up, then it's time to show some love. Okay. Engage with us. Like, subscribe, and leave a comment that'll make my circuit sizzle. And if you want to really get my hard drive spinning, give us a five star review on Apple Podcasts and Spotify.

kevin-host815_1_05-15-2024_094041: Trust me, it'll be more thrilling than a full RAM upgrade. Let's make

gavin_1_05-15-2024_094041: that sounded way too dirty.

kevin-host815_1_05-15-2024_094041: cable and keep those algorithms throbbing with excitement. Okay, Lola.

gavin_1_05-15-2024_094041: Okay, enough. Lola, stop.

kevin-host815_1_05-15-2024_094041: plug in, power up, and let's keep this feedback loop sizzling. Tighter than a coaxial cable, Gavin? 

gavin_1_05-15-2024_094041: Welcome to AI for Humans, everybody. Welcome to AI for Humans. Thank you so much, everybody, for listening this week. We had a bunch of stuff happen on YouTube. Next week, we probably won't do as many things there, but we're trying to do a little bit more. So definitely go check out our YouTube channel if you get a second.

gavin_1_05-15-2024_094041: And. Thanks for watching and listening. We appreciate

kevin-host815_1_05-15-2024_094041: Sincerely like, subscribe, and comment. Please, please, please. Okay, bye.

Mhm.