AI for Humans

GPT 5.5 Just Dropped. OpenAI Accelerated The AI Race (Again).

Thanks to @HP & Intel for sponsoring us! More on the Zbook Fury https://bit.ly/4uapNHs OpenAI's flagship AI model GPT-5.5 is here. It's smarter, faster, cheaper, better at long-running tasks and…oh boy, everything just changed again. This week on AI For Humans, OpenAI dropped state-of-the-art GPT-5

Show Notes

Thanks to @HP & Intel for sponsoring us! More on the Zbook Fury https://bit.ly/4uapNHs

OpenAI's flagship AI model GPT-5.5 is here. It's smarter, faster, cheaper, better at long-running tasks and…oh boy, everything just changed again.

This week on AI For Humans, OpenAI dropped state-of-the-art GPT-5.5 and it's not just another model release, it's the start of a much faster iterative rollout. Sam Altman and Chief Scientist Jakub Pachocki both said to expect significantly more releases going forward, admitting the last few years have been surprisingly slow. It may not be Anthropic Mythos numbers but it's actually here.

We walk through the benchmarks, the viral OpenAI dev video about being able to be lazy and letting the model figure things out, and incredible projects people built in the first 24 hours: a full toy railway simulation, Sebastien Bubeck's Unicorn Test, a UFO tank game, and Gavin's Animal Game spun up in 30 minutes.

Plus, Codex got a bunch of new features including browser use and docs. OpenAI shared agents in ChatGPT. More ChatGPT Images 2.0 examples keep dropping. I mean… it's a LOT.

GPT-5.5 IS HERE. THE AI RACE JUST GOT FASTER. VROOM VROOM.

#ai #ainews #openai

Come to our Discord: https://discord.gg/muD2TYgC8f

Join our Patreon: https://www.patreon.com/AIForHumansShow

AI For Humans Newsletter: https://aiforhumans.beehiiv.com/

Join our TikTok @aiforhumansshow

To book us for speaking, please visit our website: https://www.aiforhumans.show/

Thanks again to our sponsors #HP & #Intel

// Show Links //

GPT-5.5 Official Announcement From OpenAI

https://openai.com/index/introducing-gpt-5-5/

Sam Altman's GPT-5.5 Launch Tweet

https://x.com/sama/status/2047378254575685707?s=20

OpenAI Dev Video: Being Lazy With GPT-5.5

https://x.com/OpenAIDevs/status/2047377079352877534?s=20

Codex Gets a Bunch of New Features

https://x.com/thsottiaux/status/2047387017974337611?s=20

Codex Update Video From OpenAI Devs

https://x.com/OpenAIDevs/status/2047381283358355706?s=20

Shared Agents Announced in ChatGPT

https://x.com/OpenAI/status/2047008987665809771?s=20

GPT-5.5 vs 5.4 Toy Railway Comparison

https://x.com/petergostev/status/2047376725106131014?s=20

Sebastien Bubeck's Unicorn Test

https://x.com/SebastienBubeck/status/2047383628922167390?s=20

UFO Tank Game Built With GPT-5.5

https://x.com/intheworldofai/status/2047383340483821798?s=20

Where's Waldo Prompt Test

https://x.com/JeffLadish/status/2046839551403176251?s=20

Gavin's NFL Draft ChatGPT Images 2.0 Test

https://x.com/gavinpurcell/status/2047203603757367426?s=20

Transcript

AIForHumansOpenAIGPT55

===

Kevin Pereira: [00:00:00] GPT 5.5 has arrived Open AI's new flagship model has officially entered the chat.

Gavin Purcell: Smarter, faster, cheaper thinker. GPT five is better on long-term tasks, begins a new stage of iterative learning, which means much faster rollouts and might just make us lazy as hell.

Kevin Pereira: Yeah, the updates are causing that

Gavin Purcell: before previous leave.

A lot of my prompts have to be very detailed. They're very instructionally kind of. Whereas with. GPT 5.5. Sometimes I become lazy and I kind of get hit a very ambiguous task, but then it will figure it out.

Kevin Pereira: We will show you some incredible projects that people have already built with 5.5 and unveil Gavin's latest Animal Death Match arena thing, which we haven't even looked at yet.

Gavin Purcell: That's right. There are new crazy ways that you can integrate GPT Image two, the image model. Into Codex, into the new model, and I did it, and I'm gonna show you all right here. It's a ton of fun. And it's GPT 5.5 day, and this is AI for humans.

That's my, [00:01:00] uh, that's my brain dying at this point. I got a little, uh, uh, uh, Superman curl.

Kevin Pereira: Oh, that looks good. Yeah, I know. It doesn't look like a tapeworm at all.

Gavin Purcell: Welcome everybody to AI for Humans, your twice a week guide to the world of AI News. And Kevin, what a week. What a crazy month we have had. The AI world continues to kind of get nuts again and again, and it's not gonna slow down anytime soon we have a new flagship model. Finally, Spud. The giant potato has landed.

Kevin Pereira: Okay. Uh, Gavin, hold on a second. Where I go to my codex and I click a check for updates and uh, it's okay. Sorry. Hold on. Let me, it's not there. Lemme go to my chat. Hold on. Lemme go to my chat. GPT app. Let me go to my chat. GPT app, A new file, uh, check for updates. And it's, uh, it's not, it's not there. It's not, I have an

Gavin Purcell: enterprise account and a pro account.

Gavin comes. Some of us can't. Some of us can't be as early as others. Kevin, I did get access. I do have access. Kevin, for some reason you

Kevin Pereira: found a magazine on your brother's. On your brother's floor, under his [00:02:00] bed.

Gavin Purcell: Yes, I did. I did. Uh, it's rolling out to everybody today. I got it. Kevin does not have it yet.

We are recording this on Thursday. I'm dying. But it is a very cool new model. We need to dive in and, and really talk through some of this stuff. Kevin, I think the big thing that I was expecting from the get go with this was like, there's a lot of hype around this. There's a lot of vague posting, as they say in the AI world.

We saw the mythos, like kind of mythical benchmarks. The, the model that philanthropic says was too dangerous for everybody to use. And then we saw Opus 4.7 come out last week. So. This is kind of, uh, open AI's answer to that. Now, just the basics right now, some very important things to know, and we're gonna dive into the specifics.

The number one thing that they are touting here is that this is much more reliable on long running tasks. And later on in the show, I'm gonna talk about a thing that I just did an hour ago that I literally only had to kind of input a couple things and I got something useful out of it, and it ran for about an hour.

The other thing that I have seen a lot of people talk about is that it thinks. More for cheaper and better. And I know that's kind of a [00:03:00] lot to unpack there, but one of the things that, uh, was going on with both the move from 4.6 to 4.7 on Opus was this idea that they were gonna try to find a way to control the token costs and that it was gonna get better thinking.

And I'm curious to know what you think about that kind of idea now that we're at this place where things are getting better, but also these companies are a little bit trying to maybe control their costs on the other side.

Kevin Pereira: Well, there, I mean. They need to control cost for the users. Obviously they need to do it for themselves, but you know, at the, the, the meme was the GPT 5.5 nail in anthropics coffin.

Yeah. And people were, you know, posting and reposting that and sharing that because. With the latest 4.7 opus, there was a whole bunch of, uh, users felt regressions, right? Yes. It was, it was costing more, uh, their limits were being eaten up, and 4.7 was supposed to help with that. So when you're running these long-term agents that need to spawn subagents and go out and read documentation and explore the web and write things and test things, all of that takes compute.

It takes tokens. Yes. And so it behooves the, [00:04:00] the companies that serve this in some ways, behooves to make that more, it behooves, it does. It behooves them. If they are, if they are mears, if they are horse, horse, like this is, yeah, this is,

Gavin Purcell: but they behooved, they behoove our,

Kevin Pereira: it's, it's in the best interest.

It's in some ways in their best interest to make these models more efficient. Right. 'cause they can serve more and they can serve it faster. Yeah. On the other hand. In some ways, they're not incentivized because the more tokens these models take, the more the end user has to pay. So it's this delicate balance of trying to extract as much from the end users.

Yes. And their corporate bank accounts as they can, while not extracting too much that people say, Hey, anthropic. We're done with you, we're now leaping to open ai. Yeah, so, um, this has been a huge issue. In fact, by the way, like not to get too in the weeds, but about 30 minutes after 5.5 was announced, Andro posted a big, Hey are bad.

Y'all said, oh, they

Gavin Purcell: did?

Kevin Pereira: I didn't even see this shit. Yeah, that's, ING said Cloud code was getting rough. A bunch of engineers that I chat with were like, Hey, look at this. They basically found three major issues. So instead of gaslighting users, [00:05:00] they said, actually, yeah, you're right. We had some issues on our end.

Wow. We're gonna reset all of your limits, even though some of you might have already paid out the nose. Because of these errors. I digress. Let's talk. It's five point fives today. Let's give OpenAI some flowers. Yeah. Because yes, this model should be more optimized.

Gavin Purcell: Yeah. I mean, I think this is all part of the big conversation right now that we have to talk about as we talk about these new models is you would really have these two companies kind of neck and neck and getting into this.

And I think Kevin, it might as well, we might as well jump into it now. It's time for some Benchmark Boy conversations. Benchmark Boys, benchmark Boys.

There was a lot of people last time who were confused. It is Benchmark Boys and not bros, and we said Benchmark Bros. So just to clear that up from everybody's perspective, I

Kevin Pereira: think they're two separate, uh, war You. Do

Gavin Purcell: you

Kevin Pereira: think? But let's talk, let's get the boys off the bleachers. Let's get 'em in. You're a good game down.

Let's talk benchmarks.

Gavin Purcell: So I want everybody to know first and foremost, benchmarks are a weird thing in that these are, as many people in this audience probably already know, [00:06:00] in case you don't. Benchmarks are these numbers that are released that are, uh, testing these AI models on various specific tests and what they're good at.

And every time they came out, they release a series of these numbers. Kevin, the GPT 5.5 benchmarks are good. They are not as good as mythos. Right. And I think just to talk about this as a whole, mythos had some higher numbers. The one thing I think that I was expecting a little bit from this kind of how people were talking about this were, was a larger jump because Mythos, when you saw the mythos benchmark numbers, and again, all of this is about like what it really feels like to use the model, which we'll talk about in a bit.

These numbers are still very strong. Like we are talking about the GPT 5.5 thinking number is at an 82.7 and Opus 4.7 is at a 69.4. So that is a significant jump over that particular Egen terminal use number. But in some of the other benchmarks that have come out, like even on the one that opening OpenAI released, like the CS world verified, the number is almost the same, right?

So anyway, this is a long way of saying. It's [00:07:00] another step. It is not the kind of thing where you're like, it's gonna do everything for me. But I do think it's important, and again, I'll get to this later on, to kind of talk about what I did with the model already today. That the idea here is that you can give the model more stuff to do that's harder and it can go away and do its on its own.

That is the life change that we're all looking at now.

Kevin Pereira: So a couple things like on the benchmark front, we've talked about this before. There's be maxing, which is where companies overfit their model to crush the benchmarks, and it typically takes. A couple days or a few weeks for the vibes to come through and people say, oh, this is what it excels at and here's where it falls short.

Looking at the benchmark numbers, as you said, there's a couple places where even the, uh, you know, mythos is whatever. It's not out. Yeah, it's not out, so who knows. So comparing against Opus 4.7, which I have open right now in a terminal window, Opus bests this new model in some benchmarks. Yes. The early vibes coming out from like, uh, Dan Shipper and Evry and whatnot, like the early vibes are that this thing is, [00:08:00] is is the best model in certain use cases.

Yes. Yes. That for creative writing, it got a lot better for longer term horizon tasks, which are more specific. It got better. But that for, uh, uh, for being a generalist, some people still prefer opus. And so what I think is, is, is happening here is that. You know, companies have their own philosophies with how engineering should be done in general.

Forget the way these models should work. Right? And they, they tune the models to their preferences, to their taste. And so we're getting like a Pepsi Cola or an Android iPhone. Yeah. Sort of existence where it's like, look, iPhones are amazing. Android phones are amazing. Some people absolutely hate Android.

Ooh, people. I hate

Gavin Purcell: Android. Some people I hate Android. See, I don't wanna ever see Android in my face, ever. So there you go, Kevin.

Kevin Pereira: Oh wow. That's right. Gavin will kick alinker if you're getting tacos delivered. No. In a little rolly bot. I didn't say. Gonna kick kick Aer. He doesn't want an Android in his face.

He said it. He's a clank kicker. Hashtag younker kicker in the check.

Gavin Purcell: [00:09:00] No. Hashtag clinker kicker. I love clankers. Yeah, put put it in the comments. Open. ICTO. Yakob, I think his name is Jakob. Uh mm-hmm. Let me make sure I understand. Jakob Pache. Yako Pache had something really interesting, interesting to say about this and Sam kind of, uh, reiterated this in a couple tweets.

Basically they are saying we see pretty significant improvements in the short term, but extremely significant improvements in the medium term. Yes. I would say the last few years have been surprisingly slow, so everybody at OpenAI is kind of saying this is a new way that they're developing. They're gonna be much more iterative with rolling this stuff out, which we've also seen from Opus and Kevin.

There's been a little, there's a little piece of this in the blog post, but like this is another model that did a lot of work on itself, and I think this is just the speeding up of stuff. And as we've seen Opus ship all those features for cloud code and other stuff, I suspect we're about to see the same thing with OpenAI as well.

Kevin Pereira: Please, let's go let. Let's, let's take off friends. Let's do it. I mean, look, we, we even see it with, [00:10:00] um, like in the open source model community, right? A new Quinn model will drop. Yeah. And then you wait 30 minutes and then there's a distilled or fine tuned, and then a couple minutes later, there's another one that's optimized for a different operating system or a different, uh, you know, uh, processor entirely like the, the, the pace of the evolution here is getting faster and faster, and it would make sense that, that if as their foundational models get better, they're better at improving themselves as well.

Gavin Purcell: Yeah, and I wanna call out a couple other tweets that are really interesting. Um, prince has had it for a little bit and said that the GPT 5.5 thinking heavy, that's, there's different versions of this delivers better answers in two minutes than GPT 5.4 heavy delivered in 10. So like, that's a little bit of what's going on here.

The other thing I do wanna shout out is Sam wrote a longer tweet. Which was this idea of that iterative development. But then he also then said, we believe in democratization. We want people to be able to use lots of ai. We aim to have the most efficient models, the most efficient inference stack, and the most compute, blah, blah, blah.

So this is definitely a shot that feels like it's being taken anthropic. [00:11:00] And then I love at the end of this, he says, we love you and we want to win. We want to be a platform for every company scientist or entrepreneur in person. My whole career has been largely about magic of startups, and I think we're about to see that magic in hyperscale, but we love you.

And we want to win. So we have a combination of things going on here. This is a little bit of interesting stuff that's happening overall. Uh, the other thing we should talk about is Codex, right? So not only is this new model out, but Codex actually dropped a bunch of new features, which is really cool. And I just used Codex with these features.

I'm sorry Kevin, I know you don't have it yet. Keep updating and see if it arrives. Um, better browser, use better docs. One of the experiences I had with this Kev was. In Codex in the past, I don't know if you've had this experience, but I'm trying to build something. The browser's kind of funky and the in browser, which just came out like a week ago or a week and a half ago, sometimes pops up, sometimes it doesn't.

This time it was really solid, like it popped up. It showed me as it was working. I saw the little arrow moving around within the Codex [00:12:00] window, all very clean. So to me that's a pretty big deal. And this also follows up on the announcement that kind of didn't get enough hype earlier this week, which was about the shared agents in chat, GPT.

Did you see this?

Kevin Pereira: Yes. Yeah, yeah,

Gavin Purcell: yeah. So that's another way that like, you know, you can open the door to specific agents that have use cases within either Codex or Jet GPT, this whole world of like having things that can be spun up. It feels like to me there's a little bit of like a setting of the table.

For things like an open claw like world where you can go out and get all these agents that can do stuff for you, but maybe living within the open AI world itself.

Kevin Pereira: Well, that's exactly what it was. That's the, the, the open C qualification of the Codex app was adding these agents. Yeah. So if you want. Uh, an agent that just does email triage for you.

Now, you can easily set that up if you're running a small business and you need a dedicated agent to look at your CRM and, and check, uh, the, the, the status of your, your ab testing, of your ads and your marketplace. Like now, you can have all of these dedicated agents that can talk to each [00:13:00] other and be shared in the ecosystem.

The, the. Browser and computer use specifically the computer use on the Mac version of Codex is incredible. I think it bests the, uh, anthropic Cloud plugin. It definitely does.

Gavin Purcell: I think it a hundred

Kevin Pereira: percent does.

Gavin Purcell: Yeah,

Kevin Pereira: it seems way faster, seems way more capable. Odd to me that Sam Altman got on a live stream this week, and it wasn't for GPT 5.5, it was for image generation.

So that just goes to show you how powerful Tuesday's announcement was, how, how powerful the new image two model is. I, every day I'm seeing people generating wild stuff with image two, like generate a birthday cake that has code on it that when rendered actually makes an image of a birthday cake. Yes.

Was one of the ones that I saw that kind of blew my mind or, or complex mathematical functions integrated into like children's rugs. Like that would, they would play on like weird, weird stuff. And when you start pairing that. With a model like 5.5, now you start unlocking some really incredible capabilities.

Gavin Purcell: I'm very excited to talk about that [00:14:00] and show off some really cool examples of what's been made with 5.5 with the image model. But first, a message from a new sponsor. I'm about to do something I never thought I'd be able to do with a laptop, and that's because I have this HP Z ZBook Fury workstation to work with.

There are powerful computers and then there is this. We are very thankful to HP and Intel for sponsoring AI for humans this week. Sending us this absolute beast of a pc. This thing is powered by an Intel core Ultra V nine Pro processor, and it came ready to go right out of the box. I've been using it for everything, local ai, AI, video, running, cloud code, and even spinning up local LLMs for my own private research.

It's. That powerful. I'm gonna spin up comfy UI for local AI image gen right now. So I've installed a bunch of local models like Quin and Flex, which are free to download and free to generate, and I'm gonna start making something really important images for my new AI series, the Raccoon Bachelor. Here's why this matters, because I'm doing this locally and the models are open source.

I'm not paying per gener. I'm not waiting in a cloud queue and I'm not [00:15:00] sending anyone to anyone else's server that's at least a subscription or two. I'm saving per month and I can just make a lot more. And because this Bat Boy has an NVIDIA RTX Pro 5,000 Blackwell GPU, you can see just the size of it.

It's crazy. It can handle the bigger models, and it has 256 gigabytes of ram. A crazy powerful Intel CPUI am running stuff that used to require a dedicated. Desktop computer on my laptop, which is pretty incredible. And now thanks to this computer, I've got all the images I need to make that little raccoon bachelor.

Break the raccoon. Ladies' hearts, check out the link in our description if you wanna spec out the ZBook Fury. And thanks again to HP and Intel for sponsoring AI for humans.

Kevin Pereira: Well, as much as I love words from sponsors, Gavin, I love words from our dedicated followers. And, uh, you can leave them as a comment below.

And if you don't wanna say anything, I guess that's chill too. Just like and subscribe, leave a five star review. And if you wanna back us on page, your Honor, or buy us a coffee, you can do all that too. AI for Humans Show. That's our site. But sincerely, uh, thank you to our sponsor and thank you to everybody who helps grow this operation each and every week.

We appreciate your [00:16:00] time.

Gavin Purcell: That's right. And last week, thank you to everybody who said Kevin is beautiful. At the end of the show I see you YouTube commenters. There were a lot of them. Kevin, you're very happy. Okay, let's talk more about 5.5 because there are some really cool examples I've seen already. And I'm gonna show off my 64, uh, animal tournament game.

First and foremost, Kevin, there was a really interesting demo from Peter Sev. Which he made, he asked 5.5 to make a toy train set in, in GPT 5.5 heavy. Kind of crushed it. What was really interesting here is seeing he compare it to what 5.4 did, and you can really get a sense of like, okay. These are the kind of different quality sets of the model.

Like if you're not watching it, it's just very, very detailed. It's all being done like in a browser. He can kind of spin around it and it's just a much less detailed version in the 5.5, in the 5.4 version. And I don't know, it's, it's, it's one of those cool things that lets you see. See what the differences are a little bit.

Kevin Pereira: Yeah. I love, I love these same prompt tests and for those that are just getting the audio version, the 5.4 is cool, right? It's like a, yeah. A table with a model train set, literally chugging [00:17:00] along and then you can jump into like the conductor seat and look first person through it, but it, you know, it, it looks a little primitive.

It looks like an old Roblox type game when you jump to the new 5.5 high. The town that the toy track is going around is fully flushed out. Um, you know, there's buildings, there's trees, there's a little river with a boat going through it or whatever. And when you jump to the first person mode, you have controls that make sense and they're labeled appropriately.

And it's like just staring at it and going like, oh, that's a cool prompt. I like that comparison. It, it makes my head spin about what this test is going to look like in a year from now, Gavin, where? Or six months from now. Now, right. Whole. Sure. Yeah, sure. But like the whole room is gonna be modeled and you'll be able to go in and take.

Control. Control. And it will be multiplayer and it will run in browser. And it's just, I, I, I like, I, I'm so excited for this near future.

Gavin Purcell: I know I had a moment of that this morning thinking about like a year, year and a half ago when you and I would be excited about what these new models would look like and the fact that we can just spin up these things so much faster is crazy to me right now.

Uh, um, [00:18:00] another cool thing from Sebastian Beck, who actually works at OpenAI, uh, put a, uh, unicorn together with an SVG. And he said, basically he says. GPT 5.5, not fully saturating the tick Z unicorn test yet, but getting awfully close. He says, this is actual tick Z code. I find it so unbelievable that I'm putting the code below for anyone to verify for themselves.

So what you're seeing here is a code generated unicorn that kind of looks like a My Little Pony, but it's definitely a few far steps from what we used to see with code generated graphics. Like even the unicorn looks a little demure, like it's kind of like. Sadly winking at us. Or maybe not winking. Maybe it's closing his eyes.

It could be sleeping. I dunno what you think, Kevin. Is it winking? We don't see the other eyes, so who knows?

Kevin Pereira: Kevin, I actually don't wanna explore this with, this is a weird

unicorn.

Gavin Purcell: Move on. VORs SH test for you and you're move on to your thing.

Kevin Pereira: I actually love the way the unicorn is playing Coi and it's subtly kind.

Gavin Purcell: Just it's a little wink towards

Kevin Pereira: me letting me know. Gavin, everything you are doing is working these days in the gym. Really looking. [00:19:00] I came across a UFO tank game by, uh, in the world of ai. Um, and this was a like, supposedly like a one shot and it, there is a 3D tank that you can drive around a map as little UFOs whiz about and shoot at you and you can shoot at them.

And when you make a collision with a bullet, pew pew, UFO. Bye-bye. This is just like, again, like the, the, the new grounds of gaming. Yeah. I'm sure there's a thousand startups that are going after it, but the games are gonna start being good enough. They're gonna actually wanna participate in them and create them and remix them.

Gavin Purcell: Yeah, so let's talk about that. Uh, the project I gave 5.5 this morning was a classic project that I have given lots of times to AI models. Kevin will remember this well, uh, you and the audience may be new, you may not. I had an idea forever ago, I think it was two and a half years ago, which was I wanted to make a march madness tournament of the world's most dangerous animals.

You take 64 of the world's most dangerous animals, and you fight them one by one until there's a champion. The goal here is you as the player play one animal. [00:20:00] And then you go through this and I, this morning I literally, this is 45 minutes ago, I gave it two additional prompts for this. I said, go make this as a card battler.

I gave it a pretty complicated prompt to start just so it had the information on it. But Kevin, the big difference here is I gave it the image gen tool in Codex. So what I said to it was like, Hey, don't just give me, 'cause often what happens with this, when you try to get to make a game, it'll give you like some sort of, almost looks like a website.

I said, don't do that. Pull up images, so you're gonna pull it up for the first time. Right now I've pulled it up earlier. It's not great, but it's also like amazing that I made this in 45 minutes.

Kevin Pereira: Okay. So I'm at the Dangerous Animal Madness site. Um, I love that there's some particle effects going on in the background or whatever, right off the Rip Gav.

Nice. Okay. Um, win six ridiculous fights with the animal. The wheel gives you, I'm gonna spin for my animal here. And, uh, oh, I got Chaos Intern, which is, uh,

Gavin Purcell: oh, I got

Kevin Pereira: parking a lot, chip minutes.

Gavin Purcell: Parking lot. Menace is a goose. So again, play with yours. [00:21:00] Then we'll keep your

Kevin Pereira: recording. I'm gonna enter the bracket here.

So I see the Dangerous Animal Madness bracket. I'm using Chaos Intern versus the Buzzkill Committee, which is a TSI fly Swarm. So let's see if, uh, I can win. I'm gonna zoom to the match here. Chaos Intern versus Buzzkill committee. I'm entering the match. Opponent intent. Clamp down. Attack eight, block nine.

Let's go. Come on. Oh, I have to choose. I have to choose my hand, right?

Gavin Purcell: You have to choose your hand. Yes. You have to choose your hand. It kind of plays out like, um, slay the spire or another game like that.

Kevin Pereira: Well, I guess I'll brace for weirdness, which is a defense move, and then I'm gonna do a wild swing.

Don,

Gavin Purcell: go for it.

Kevin Pereira: Okay. Yeah, yeah. Take that tt fly swarm. Okay. I guess I gotta end my turn now. All right. This is, um, this is actually too complex for me to just shoot from the hip and start and start clicking ga. Yeah. Like, dude, I don't wanna actually lose here.

Gavin Purcell: No. Well, so here's an interesting thing about this.

So, basically, again, this is the first time I'm testing it or seeing it. What's very cool about this. Is, [00:22:00] it's the speed to demo, right? Like that's what we've been talking about here before. The idea that you can get from zero to like this is probably, I'd say maybe 25 to 50% through a game, but the idea that you can play it right away makes a huge difference.

And, oh, dude, I might be. Yeah,

Kevin Pereira: I'm op, sorry. Yeah, yeah. No, no, you go ahead. You go ahead. I'm just op. I'm crushing thistle,

Gavin Purcell: but you get the sense of what it means to like be able to demo something quickly in your brain and just drop it out. This was about a one paragraph prompt and I sent it away. It worked for about a half an hour for the first time it came back and I said, do it a little bit better.

Make sure you're using the imagen tool. It worked then for 45 minutes and came back with this now. It's not perfect yet clearly, but to the speed to demo idea is pretty phenomenal.

Kevin Pereira: Chaos. Intern survives. Choose one card. I can choose an evasive flop panic geometry, or double tap dance. Woo Gavin.

Gavin Purcell: So all of that was stuff that was just kind of prompted in.

Now again, there's gonna be a lot of balancing into game like this. I'm playing a lot of sleigh aspire [00:23:00] too right now. That's part of where this inspiration came from. But like. You get the sense that like you, the person at home, I am dummy. I do not have coding abilities, but the fact that you could spin up a demo like this very quickly and actually get it playable and get it, so, I mean, it's not pretty yet, but like, it's not ugly, right?

Like this idea that like, it's not just like a prototype that looks like, you know, boxes knocking to each other, that sort of thing.

Kevin Pereira: The fact that, uh, there's any graphics on screen. This early in, in what would be a development cycle is wild. The fact that that's deployed and playable and you can share it, is also wild.

And I'm assuming you just told it, Hey, go put this website up on Versal or whatever, and it deployed it for you.

Gavin Purcell: Yep, that's exactly right. So I, even while it was working, I said, Hey, I steered it, you know, I said like, Hey, throw this up on Versal so I can share it with Kevin in the middle of this, uh, conversation.

So again, yeah, speed the demo capability, long form agents, like all of this stuff is finally coming together. Let's,

Kevin Pereira: let's focus in on, on GPT [00:24:00] image too as well. 'cause it's has only been out for a few days. I am, yeah. Um, amazed. At how good it is at certain tasks to the point where, yes, like it has disrupted my usual workflow, which, uh, you know, I'm, I'm working on a feature, uh, for tele right now.

I typically make a PRD. I talk with our designer. I make some mockups, whatever. But now the speed with which I am iterating is, it's almost quicker and easier for me to make the full thing, have the designer anoint it, like make their adjustments because they're, they're better at design than me, but then I go and implement it as well.

Like that. And that just changed this week.

Gavin Purcell: I had a crazy moment. I'm consulting with a, a friend of mine on some stuff for him, and he had an idea, so I spun up. Me, not coder, I spun up the demo. I spun up the design and one of the things you can do with image two is so fascinating is like you get, Hey, give me a, a website what this might look like, right?

Mm-hmm. So you get a file back. But the thing that I did, Kevin, which I was kind of blew me away. 'cause when it tries to implement that file, sometimes it's better or worse at it knowing all the different [00:25:00] elements on the screen. You can ask GPT to image, to, to send you just the elements on the screen so that, like in my thing, it had a really good logo and it had a couple other things that were cool.

I said, give me all that stuff as individual elements. And then you put that in your file and you let it build. You can do it all. It's like a one person shop. It really is shocking. Yeah.

Kevin Pereira: Oh, so I had it do the mockup of this like, uh, uh, product that I'm building basically. Uh, and then I said, oh, uh, go ahead and install hyper frames or use remotion.

In fact. Use both and then make the mockup move like this. Yeah, I want the icons to come in. I want things to highlight, animated, blah, blah, blah. And then gimme like a 15 second video. It went off, this was 5.4, but it went off, uh, and, and did all of that using the GPT, uh, the image two, uh, image. And it looks great.

It like just, it looks like a fantastic little mockup. And I mean, that's like, okay, or whatever. That's me being actually productive. Let's get to the, where's Waldo Games?

Gavin Purcell: Yeah, well that's, there's a bunch of people [00:26:00] making where's Waldo versions with this because one of the things it can do is very detailed, very specific, larger prompts.

There's a good example from Jeff Latish who, uh, made a University of Berkeley anti ai, where's Waldo sort of thing, where there's a bunch of jokes. And then I stole his prompt and used it to make a thing about the NFL draft. Today if you're a football fan, you know, the NFL draft happened. So I had it make one of these things.

And what was interesting for me was. The going, like what we said last time on the show, like the little jokes and little things that ads are so interesting and this image, this NFL draft image I made is so complicated. There's so much stuff going on in there. And now not all of it's perfect. There's a few things that are wrong, but like it's making jokes like a mad magazine sort of thing, right?

Like it almost feels like it's like this giant thing that somebody drove, drew and wrote a bunch of stuff on. It is a shocking moment when it comes to what's possible with that. And then when you compare it and, and contrast it with what you can do with the code, those two things together, just like overpower a person, I feel like,

Kevin Pereira: yeah, I saw [00:27:00] your draft image and I zoomed around and was like looking at things I don't understand.

Like this is me. I looking at like actual code. I don't understand half the references, but I can tell that every little frame is packed. Yeah. Like every little pixel is playing some sort of joke or being part of, uh, like referential humor. I don't what is. I don't even know what some of these things are.

Gavin.

Gavin Purcell: Well, the funny thing about it is like, there's a couple things that gets wrong. Like it it like. One of the teams that gets the wrong team, but stuff like that. But it goes through, there's 10 draft picks in the middle, and it's the actual people. I, I asked it when I, when I created the images, I said like, go find who these draft picks are and put little jokes about each of them in, and some of them have very specific jokes, but then all around the edges.

There are other jokes about what happens during the draft or things like that. So anyway, this is a very fun prompt to try for yourself, for whatever world that you live in. Like, it's probably a good thing if you're a corporate person, like you could do a thing where it's like, make it about my company.

Like it probably knows a fair amount of stuff, you know, and you can make these little jokes a very cool thing to show off. I do wanna say one more thing, Kevin. I, [00:28:00] I sent this image to my daughter last night 'cause my daughter was like, oh, the open eyes new image model is interesting. And she had made a picture of herself and did some stuff with it.

When my daughter was a kid, hopefully they don't kill me for telling this story. There was a character that she created called Mr. Brewster where she wore this kind of white wig and she went around. It was like an old man character that she made. She was very embarrassed of that character. We loved it. My wife and I thought it was one of the funniest things in the world at the time.

She was probably eight or nine. She's always had this thing of like, oh, you guys thought Mr. Brewster was so funny? It was stupid, but I think it's funny. Anyway. I sent her back this image and I said, Hey, you wouldn't believe what I saw Whole Foods. And I made an image that was Mr. Brewster's wonderful concoction.

Like it was kombucha. Yeah. Yeah. And she's like, wait, what is that? And I was like, did somebody

Kevin Pereira: take our name? It looks like a real end cap with all the different, you know, different kombuchas available in a Whole Foods branded appropriately. Yeah. Uh, that's amazing. She, she actually thought that someone made Mr.

Brewsters for a moment.

Gavin Purcell: Yeah, she thought so. Yeah. So my ma my daughter said, I thought you saw this in the store, and my other daughter said, is this ai? Like, it's just an interesting thing at large. So [00:29:00] this is where we're at right now, folks. Alright, everybody, that is it for now. We will see you all next week.

Thank you for joining us. And play around with 5.500.

Kevin Pereira: I still, I still don't have five.

Gavin Purcell: Kevin still doesn't have it. He'll have it soon. All right. Bye y'all. We'll see you next week.