← All Episodes
AI for Humans

OpenAI's GPT-5.4 Is a Beast. But Good Luck Staying King.

OpenAI's GPT-5.4 is the newest state-of-the-art AI model. But for how long? Sam Altman and team have to fend off Dario Amodei's Anthropic in more ways than one.  GPT-5.4 beats humans at computer use, hallucinates 33% less, and costs half what Opus 4.6 does for coding. Meanwhile Anthropic just got bl

OpenAI's GPT-5.4 Is a Beast. But Good Luck Staying King.

OpenAI's GPT-5.4 is the newest state-of-the-art AI model. But for how long? Sam Altman and team have to fend off Dario Amodei's Anthropic in more ways than one. 

GPT-5.4 beats humans at computer use, hallucinates 33% less, and costs half what Opus 4.6 does for coding. Meanwhile Anthropic just got blacklisted by the Pentagon, Claude shot to number one in the App Store, and Dario went on CBS looking like he aged five years. Plus Kling Motion Control 3.0, Grok Imagine Extend, Notebook LM cinematic overviews, Ben Affleck's secret AI startup sold to Netflix, and a device that jams AI from recording you.

THE KING WEARS THE CROWN UNTIL NEXT TUESDAY. THAT'S HOW THIS WORKS NOW.

#ai #ainews #openai

Come to our Discord: https://discord.gg/muD2TYgC8f

Join our Patreon: https://www.patreon.com/AIForHumansShow

AI For Humans Newsletter: https://aiforhumans.beehiiv.com/

Follow us for more on X @AIForHumansShow

Join our TikTok @aiforhumansshow

To book us for speaking, please visit our website: https://www.aiforhumans.show/

// Show Links //

 

OpenAI's GPT-5.4 is here…

https://openai.com/index/introducing-gpt-5-4/

CUA (Computer Use Agent) OpenAI Demo

https://x.com/OpenAIDevs/status/2029620984853188738?s=20)

ChatGPT 5.3 Instant (the emojis are back)

https://x.com/OpenAI/status/2028893701427302559?s=20

Dario Amodei Rips Sam, Calls OpenAI "Mendacious" And Other Things About Trump in Memo

https://www.theinformation.com/articles/read-anthropic-ceos-memo-attacking-openais-mendacious-pentagon-announcement?rc=c3oojq&shared=d3dcdfe79bb39c22

Sam Altman Response To Backlash 

https://x.com/sama/status/2028640354912923739?s=20

Voice Mode For Claude Code

https://x.com/trq212/status/2028628570692890800?s=20

AI for Humans Link Blog

https://www.aiforhumans.show/linkblog

Kling Motion Control 3.0

https://x.com/Kling_ai/status/2029391461880574136?s=20

Solid Examples

https://x.com/maxescu/status/2029223608854434000?s=20

Grok Imagine Extend Video

https://x.com/grok/status/2028571874188259337?s=20

Gavin's Grok Extended video:

https://x.com/gavinpurcell/status/2029423165571776587?s=20

Cinematic Overviews for NotebookLM

https://x.com/NotebookLM/status/2029240601334436080?s=20

Justine example

https://x.com/venturetwins/status/2029592813277683750?s=20

Netflix Buys Ben Affleck's Secretive AI Company

https://variety.com/2026/film/news/netflix-acquires-ben-affleck-ai-filmmaking-startup-interpositive-1236679498/

Spectre 1 Stops AI Audio Recordings IRL

https://x.com/aidaxbaradari/status/2028864606568067491?s=20

The Shape Store Viral Tiktok/IG

https://www.tiktok.com/@a.i.solation/video/7612337372667071774?_r=1&_t=ZT-94RCk4vtqYg

Bilawal Sidhu's Operation Fury Visualization With Actual Geolocation Data

https://youtu.be/rXvU7bPJ8n4?si=n4aoCsgHLQ6bq432

Infinite Favicons

https://www.favicons.art/

Open Source Vibe Coded Music Video Maker

https://x.com/blizaine/status/2029330454122242106?s=20

New AI For Humans Website!
https://www.aiforhumans.show/

AIForHumansOpenAI54AnthropicClaudeCode
===
Kevin Pereria: [00:00:00] Open AI's newest model, GPT 5.4 has landed and it is a big deal for everyone.
Gavin Purcell: We get yet another state-of-the-art AI model. It is a huge boost in coding. And guess what? It costs about half as much as Opus 4.6,
Kevin Pereria: and it turns out 5.4 beats the average human at the little things like using a computer.
Gavin Purcell: Oh no.
OpenAI Employee: When we ask the model to use QA compared to 5.3 codex, it doesn't have to spin up like a new environment to do it. It's more like how you or I would interact with a computer
Gavin Purcell: and maybe even crazier. Kevin Open. I sees no wall and expects intelligence capabilities to continue to increase this year.
Kevin Pereria: Yeah, dramatically.
Gavin, are you okay
Gavin Purcell: Kevin? I, I just need sleep.
Kevin Pereria: That's adorable. It's not gonna happen not this week, because meanwhile, anthropic, CEO was on the phone with the frigging Pentagon, tried to save a $200 million military contract while simultaneously being called quote a [00:01:00] liar with a God complex. That's fun, right?
Gavin Purcell: Plus big unlocks for everyone. New AI video updates from Cling, grok, and even Notebook, LM weirdly.
Kevin Pereria: Yeah, the Notebook LM thing is honestly the most exciting part, which is crazy. Plus, big friend of the show, Ben Affleck, has been secretly running an AI company, which just sold to Netflix. What a week,
Gavin Purcell: all of that and more on this week's AI for humans.
Everybody. Let's get started. I
Kevin Pereria: think Gavin, we have to, you gotta refresh. We have to redo the teases here. Um, it looks like
Gavin Purcell: No, you're, no. Why
Kevin Pereria: open AI just launched a new model. Yeah.
Gavin Purcell: No, come on. No, no, no.
Kevin Pereria: Start the show.
Gavin Purcell: Welcome everybody to AI for Humans. This is your weekly guide to the wonderful world of ai. And Kevin, we got a new one. A new state-of-the-art model has dropped again. We are at maybe at. I'd probably at like seven for this year so far. Is that where we are right now? If you like. I don't,
Kevin Pereria: are we about to summon the boys?
Is it too early? Oh,
Gavin Purcell: not yet. We got yet. Wait a second. Wait a [00:02:00] second. The benchmark boys will come. They will be here. Okay. We gotta, we, we will summon 'em a second, but do break. Get ready to do the
Kevin Pereria: magical whistle. I mean, because we do have a new way to summon them.
Gavin Purcell: We will. Okay. Let's get that a second. So, yes.
OpenAI has a new state-of-the-art model. This is GPT 5.4, a brand new model. And Kevin, it is very good. The biggest thing I think we have to dive into, and we're gonna talk about a bunch of stuff, is that it is very good at computer use. And as you pointed out at the top of the show, it is very good at doing the sorts of things that humans do.
And we are inching ever closer to that ability for AI to do most of what we do. So first and foremost, yeah. Let's just dive into some of the basics in this, and then we can definitely talk about benchmark boys.
Kevin Pereria: First of all, it's 5.4, which is too louder than the last model that most people used. Yes.
Naming conventions aside. For those that actually care about this detail, it's because they want 5.3 to remain for the Codex model. Yes. Which is their, uh, coding model. Uh, it's dedicated for that. This is now 5.4. They acknowledge that their naming [00:03:00] situation is a mess, but if anything, that just makes me feel so much better about Skynet not happening tomorrow.
If they can't get the names right still, we probably don't have to worry. Um, computer use is the reason for the season. Um, humans. Why don't
Gavin Purcell: we, why don't we explain what is that? Tell people what computer use means because I think some people in our audience may understand what that is, but a lot of people don't.
Kevin Pereria: It's, it's, it's what it sounds like. It's not like a fancy ai AI term. There are tests which are, uh, designed to test a model's ability to use a computer. And we're talking like see what's on the screen, maybe hear what's coming outta the speakers, click drag, poke around, type text into a box, close windows, manage them, et cetera.
Um, one of the benchmarks is called os world Humans typically score. That's you and I gathering.
Gavin Purcell: You're doing it already. You're benchmark bullying, but that's fine. Go ahead. What Benchmark Boys do you want to call? Do you wanna sum it up? Benchmarks? Yeah, let's sum it. 'em up. Boys
Benchmark Boys: Bench. Boys Bench[00:04:00]
Benchmark. Benchmark Boys.
Gavin Purcell: Benchmark Bench Benchmark Boys.
Kevin Pereria: Bench
Gavin Purcell: Benchmark
Kevin Pereria: Boys.
Gavin Purcell: We'll have another new graphic that just rolled because Will, I'm making, will make a second graphic here. Yeah. Let's talk about the benchmarks. So this, again, this is just for if you're out there. These are the numbers that help you immediately understand how it is different than the others.
But Kevin, talk about the specific number you were referencing.
Kevin Pereria: And again, like these, these models can be tuned to perform better on benchmarks than they do when you and I try to use them. Yes, the vibe test, if you will, but. Let, let's just hear this out at least as it is on paper. The OS world benchmark is designed for things like I just said, like clicking and dragging.
Um, in this case, uh, looking at a browser, right? Getting a screenshot of a website. I'm very good
Gavin Purcell: at
Kevin Pereria: those
Gavin Purcell: scheduling things. Kevin. I can click and drag with the best of them.
Kevin Pereria: What about scheduling calendar events, Gavin, or sending, sending documents?
Gavin Purcell: I'm in the 50% range on that. Probably 50% range.
Kevin Pereria: What about sending me your [00:05:00] business?
EI Ns last year, Gavin.
Gavin Purcell: Pretty bad at that. I'm pretty bad at that. I'd say
Kevin Pereria: that was a 12 month horizon task, but you eventually got it done. The point is humans normally score 72% ish on this test. That's the average human. This model just hit 75% accuracy. So on the the broad, using a computer thing, it's better than the average human.
It's way better than the average human on things like spreadsheet manipulation, generating presentations. Um, and then if you really want to get me excited, Gavin, the tassels will pop out. We can talk about tool usage.
Gavin Purcell: I don't know. I don't wanna keep those tassels back for a second. Let's actually, let's hear you.
My
Kevin Pereria: tool tassels. You want me to keep those
Gavin Purcell: tool tass? Your to your toe tassels are fine. It's the other ones I'm worried about. They're back in the satchel. So Kevin, all that benchmark boy stuff is good, but actually opening, I took some time to show how this might matter to an. Actual person. They had a website they were creating for a coffee shop, and they kind of showed how computer use can [00:06:00] make a difference here.
OpenAI Employee: So right now it's calling the image gen tool, and it has a smart use of image gen as well, because images take a while to generate. So it's actually doing all four of these images concurrently, which is pretty neat. Now the model is able to check its own work using ku. What KU did here is like open up the image, inspect it, open up the website, also look at it, compare them side by side, and make sure that the website created is as close as possible to the image that could with this update, it makes the work a lot cheaper, a lot more efficient, and also ultimately helps you do better work.
Gavin Purcell: So a, there is a short word for computer use, right? That's what they're talking about. Ua. So computer use
Kevin Pereria: agent? Yeah.
Gavin Purcell: Yeah. So in this instance, you have a computer use agent that is actually. Using the computer to do these sorts of things and kind of manipulating in this instance, photos and different things like that and trying to get a sense of how it will all lay out.
So this is a really interesting way of a real world use case of this, uh, agentic idea. Yeah.
Kevin Pereria: And that the workflow being [00:07:00] demonstrated is, is like one that's been shockingly missing from so many tools. Gavin? Yes. Like you, you know, you've been vibing very hard. Yeah. I can see it in your eyes and we will get to that later in the show.
But if any of the folks out here have ever tried to prompt a website into existence or make a game or anything else, you know, oftentimes, like the scaffolding is there now. Yeah. But the visuals just don't match. It's missing all that stuff. And this is a concrete example of, Hey, I have this site, here's the layout.
Go make it pretty and it yes knows how to do it. A lot of these tools can generate images, but they fail to generate and place them in there. And this is just just a very simple workflow that's so impactful.
Gavin Purcell: And I think one thing that's really important to point out here is we've talked on the show before about how like coding is the kind of opening door to all of this, kind of like a GI stuff where we need this stuff to do real world things.
And the reason why is 'cause the better it gets at understanding how to write code or read code or do stuff with the computer, the, the smarter it gets at making things that we actually might want to use. Right? So. [00:08:00] Each of these models has been a pretty big step change in terms of actual coding from, you know, the early stage, like kind of Opus 4.5 was when it started to feel like it was all really clicking, and I think that's the moment, kind of end of year last year where it was like, oh, you can just make something and it works.
Finally, now we're getting these steps where. We're improving with each of these steps. And Kevin, I'd just be, it's March 5th, right? It's March 5th, 2026, which is crazy. Um, and that feels like we've got a long ways to go until it's really perfect, but we are actually seeing stuff that really makes a difference now.
Kevin Pereria: The, it's, the line is still very much going up. I, we have not hit a wall. We might be another, uh, algorithmic breakthrough or three away from unlocking this a GI massive super intelligence future that we're all, you know, patiently waiting for or impatiently filled with anxiety over. We're not slowing down.
And, uh, other criticisms, Gavin used to be, well, these things you can't trust, right? [00:09:00] Because they're just gonna make things up hallucinations with this model, according to open AI's benchmarks, again, they're down 33%. That's a huge
Gavin Purcell: deal. That's a massive deal, right?
Kevin Pereria: It's a big draw deal. And even as something as critical as like legal document review, there's a, an a, a AI company out there called Harvey, which a lot of lawyers use to review their legal documents.
Um, they say it scored a 91% on there. Big law benchmark. Oh,
Gavin Purcell: big law.
Kevin Pereria: Big law. Big law's. Got a big cowboy hat. Big six shooters, and a massive gavel baby.
Do
Gavin Purcell: you know who has to be in the big law courtroom? Is wonky walrus? I bet he's got a lot of court. He's probably got a lot of things going on there. Often.
The
Kevin Pereria: actually wonky walrus has P priors, uh, Gavin.
Gavin Purcell: Oh no, what did he do? Oh, believe that Will, that's gotta be ble. We're not putting that
Kevin Pereria: wonky. Walrus was caught in a seven 11 parking lot.
Gavin Purcell: No he wasn't. Three
Kevin Pereria: times. Three times
Gavin Purcell: all. So a couple other quick things about this. Uh, yeah, it is [00:10:00] faster and uses fewer tokens.
So that is a big thing when you know you're vibe coding something or doing anything. Tokens are how you, especially if you're working with an API, but even if you're working with a max plan or one of the larger plans, it means you'll be able to get more done with less, which is a big deal. There's a 1 million um, token context window, which equals I think what Opus 4.6 now has.
So this allows you to have a much longer code base. As Kevin mentioned, I've been doing a bunch of vibe coding. We will talk later on about like what I've been doing with Fourex, and also what is annoying sometimes is when it has to reset that context window. At this point, I'm like, give me a 5 million context window.
Like, I just want the longest possible context window I can have.
Kevin Pereria: Right? Can I take the tassels out now for tool search?
Gavin Purcell: Take, take the tassels out. Take the tassels out.
Kevin Pereria: One of the nerdiest things, but probably the most exciting to me whenever an AI agent has to go and use a tool, um, any sort of integration, yes, it's check an email, uh, book a thing, uh, make an image, et cetera.
Usually all of the tools available are dumped into that context window and it [00:11:00] slows things down. It makes the prompts bloated. It, it ends up costing you more money. Um, and again, the the time thing, like when I say it slows things down, it can really bring things to a grinding halt as it uses tools. This model reduced token usage by 47% with no accuracy loss.
This is again, what they're reporting. It just came out. You and I are probably gonna get. Real hands-on with this thing over the next, yeah, 72 hours or so and see, but, but that's a big deal and it doesn't come cheap. I should mention yes, 'cause this thing is more expensive. Input tokens went from a dollar 75 to $2 and 50 cents per million output from 14 to 15.
So it is a bit more expensive. However, OpenAI is claiming, Hey, we made all these advancements, we use way less tokens. So overall it should be cheaper
Gavin Purcell: and
Kevin Pereria: again, on how that's gonna play out.
Gavin Purcell: Yeah, and Dan Shipper from every who had a little bit of early access to this is comparing it to Opus 4.6 and he says it's half as much.
So when you think of the edge case coding stuff, [00:12:00] that's a big difference right now. We will see how this all plays out. Like one of the interesting things I've been tracking over the last bit is like, you remember like last November, Kevin, when it felt like Gemini was gonna run away with the AI world models and it became a big thing.
Yeah. A lot of people have said since then that Gemini's new models feel a little bit more too, too benchmarky adjusted that the benchmarks may not be feeling as much like the real world use cases. And I think a lot of this, uh, going forward is gonna feel like, is it making a difference for you? Right.
The person who's using it. Because I don't think these benchmarks other than like a sense, a sense of tracking about what it looks like over time, like. One of the things we've talked about with benchmarks in here is the A GI ARC two benchmark, which this is, again, has a good number on, but it might also be what they call saturated, which means that the questions have been seen by a bunch of these models.
It's easier to answer them in a bigger way, right? There is an Arc, a GI three benchmark coming out soon. In fact, by the way, this is the funniest thing. There's gonna be like a party for the release of it in San Francisco. So if you're living it up in San Francisco in this world, [00:13:00] it's a whole new thing. I wanna mention one other thing about that's really important, and I mentioned this, was mentioned this in the tease, is that Nome Brown, who is one of the best researchers at OpenAI, he's a really smart guy to follow at Polynomial.
Um, he tweeted GPT 5.4 is a big step up in computer use and economically valuable tasks. We see no wall and expect AI capabilities to continue to increase dramatically this year. And I think just to sit on that for a second. We are following this every week, and maybe you're a listener reviewer who comes in every week.
Maybe you're somebody that just pops in when a new model comes up. I think the thing that's important to take away from here is we have now unlocked that thing where things are going to be moving a lot faster. And you can imagine, Kevin, the model that we're gonna see in September from these companies being way better, right?
Like, like significantly better on a lot of this stuff. And again, we don't, I'm not doom scrolling or doom posting or doom. Fucking sorry, will, you can beep that. But like you have to prepare for a [00:14:00] world where you are living in a space where these models can do stuff that you didn't think they could do a year ago.
That's just an important thing to be aware of.
Kevin Pereria: I, I fully agree, but I sleep. Soundly at night, Gavin, knowing that the best in class, most capable, most token efficient tool is the one that's going to make the decision whether or not to drone Strike Me. Oh,
Gavin Purcell: do
Kevin Pereria: you? Or ing,
Gavin Purcell: is that right?
Kevin Pereria: Yeah. I mean that's a, because like, listen, you know, the Department of War has picked a winner.
Here it is OpenAI. Are we ready to transition to that off this discussion or you wanna talk
Gavin Purcell: about a few more things? Yeah. Let's transition to that. Let's transition to that, um, the very fast before we move on to that, because it is a very important conversation. We have to talk about it. There was another model that dropped early this week from OpenAI, a 5.3 instance.
So they decided to go with one other model, 5.3 instant. This is their essentially gonna be driving their free model. Um, the emojis are back in this model, which is interesting and supposedly it's better at writing. It was not that good at my Pacman test. I keep trying to get them to write better, uh, Pac-Man pen.
So Pacman isn't
Kevin Pereria: good.
Gavin Purcell: I [00:15:00] mean, I don't know how we do this, but at some point, one of these models, even the new one, 5.4 is not very good at it. In fact, it came up with my favorite. Pacman 5.4 came up with my favorite Pacman fun of all time, which I texted you right before this. Did you see this? It's, it, the, the pun that it came up with is.
Pacman's autobiography is called Eat Pray Waka. That was the one it came up with. So anyway, there's still not that great at coming up with really funny things. Okay. Yes. Let's transition to, to more, let's transition to this scary conversation. Actually, before we transition to that, please support your local podcast.
We are an independent production and we make stuff for you if you like and subscribe to this YouTube channel that's doing a lot. But we've also provided you many different ways to open the door to give us money. That's the most important thing you can give us money through. Tell me about
Kevin Pereria: the tote bag.
Gavin Purcell: Well, there's no tote bag yet, but maybe soon at some point we promise merch and we will get merch to everybody. Maybe we can set our agents agents to that. There is a Patreon, which thank you very much. Many new people have joined, which is exciting. [00:16:00] And Kevin just opened a Buy me a coffee, uh, on our website at AI for Humans Show underneath the uh slash pod command, which you can see a lot of fun songs that we are going to update this week.
We will talk a little bit more about that website later, but you can support our podcast there. We appreciate every single one of you. And now Kevin. It is time to move on to probably one of the scariest AI conversations, um, we've ever had. This was kind of brewing last week and it kind of, no pun intended, blew up over the, the week and weekend.
So this, uh, maybe you wanna do the kind of set the stage thing and then we can kind of get a little bit into what's going on right now. Well,
Kevin Pereria: yeah, there, listen, the. Anthropic is one of the few companies that before, I guess last week, was doing a lot of work with the government. They had their models running on a secure cloud where government agencies could access them and do all of the things that government agencies want to do.
Right? Um, analyze mass communications, look at satellite imagery. Uh, drop up war plans, [00:17:00] simulate war games, et cetera. I dunno, probably play five d, chess, whatever they're into. Maybe they were just vibe coding, snake games. I don't know what the government does. I don't pretend. I guess maybe Document redaction was high on the list.
Sure. Point is Andro was the go-to, but they had two red lines, two red lines, which existed under the Biden administration, which the Trump administration also agreed to. One was that their models could never autonomously pull the trigger. Basically there had to be a human in the loop. Before one of their models was, um, involved with deciding to take down a target, right.
Human or otherwise. So that was one and two was mass surveillance on US citizens.
Gavin Purcell: Yes,
Kevin Pereria: those were the two red lines,
Gavin Purcell: both pretty good lines. I kind of agree with those lines mostly like right. Not bad lines. I
Kevin Pereria: have. I have done talent agreements to like appear on stage, which have more red lines than using AI with weaponry.
So yes, I'd say those are two good red lines. Maybe there should be 200 of them, but that's yes. You know, outta my depth. Um, the [00:18:00] government, our present government did an abrupt about face and basically said, remove these two red lines. Oh else. Uh, yes. And there was some fa there was some fo and that's the summary.
Are we good?
Gavin Purcell: Well, that's, yeah, that's that. So there's a little bit more than that. The big part is it was there's a lot more that that, that there's a lot more than that. That, yeah. So, but, and I don't wanna rehash this for a lot of people who are probably familiar with it, the very quick TLDR of it is that a tropic said, we are not gonna change that.
Uh, then they got into kind of a fight with the Secretary of War, Pete. He, Pete. He then went out and said that they are a supply chain risk, which just today has been updated to, they're gonna hold that up. So this is a big problem for Anthropic because what that means is they cannot have government contracts, which also might make them unsecure for a lot of other company use cases.
Kevin Pereria: To point out real quick, the gravity of that thing, this has never been done before for a company like this, like for
Gavin Purcell: in American companies.
Kevin Pereria: You've been
Gavin Purcell: done for an American company before.
Kevin Pereria: Yeah. And, and it's not only that they can't have government contracts, it's pretty much anybody that does work with the [00:19:00] government can't use Anthropic products either.
Yes. So this is a massive hobbling.
Gavin Purcell: That's right. So then, uh, Dario went on CBS and did a interview that kind of blew up and had him, in fact, maybe we can play just a tiny little bit of that interview, because first of all, there's two things to know about this one, if you're not watching it. Dario looks strangely skinny.
Like literally this has been holding and pulling weight out of him, but two. He is pretty much holding a line here.
Journalist: If you had a moment with the president right now tonight, what would you say to him?
Dario Amodei: You know, I, again, I would say we are patriotic Americans. We have done, everything we have done has been for the sake of this country, for the sake of supporting US national security.
Our leaning forward in deploying our models with the military was done because we believe in this country, we believe in, uh, defeating our autocratic adversaries. We believe in defending America. The red lines we have drawn, we drew because we, we, we, we believe that crossing those red [00:20:00] lines is, is contrary to American values.
And we wanted to stand up for American values.
Gavin Purcell: So you get a sense there, like he's taking a pretty significant stand. Now, Kevin, the crazy part of this story, which you know we didn't cover last week, but everybody has seen now, is that Sam Altman and OpenAI then came in and made a deal with the Department of War.
That is a very mixed conversation around like where did they cave and where did they not cave based on where philanthropic stand. And Sam Altman has dealt with a lot of internal and external strife about this. Agreement. It sounds like at this point, uh, here we are on Thursday, that they are kind of pushing back.
Even Sam is pushing back on this. More importantly, I think the bigger thing is that, you know, the US government has reiterated this supply chain risk situation right now. Even though there's also a story that Dario is renegotiating with the Department of War, so, right. It is a very loose situation right now and we're not exactly sure where it's going to end up.
Kevin Pereria: And honestly, like we didn't really get into it last week. Um. And some of that is my fault 'cause I was too busy vibe [00:21:00] coding dumb songs into an iPod. But you know, the, the tendrils in this run very deep with like super packed donations being made by one side. Yeah. And, you know, bending the knee or ring kissing as some people would observe from one company Yes.
Versus another. Right. And, and maybe that all factors in here. Maybe this is all again, a big five d chess move. Maybe it's not. I like, we can really, really get in the weeds about this, but where does this. End up though because. Like to the end user, right? The, these tools might be used or, or the end user, I say as our, our listeners, our watchers, right?
But to everyone out there, myself and, and you included, yeah. These tools may be used to surveil you. These tools may, uh, someday be used to suppress you. Yeah. One of these companies is going to be providing that. On the other side of that, does this send a chilling effect to any company? Doing anything AI related in the us you know, or anything tech related where it's like, well, hey, you really better be able to play nice.
And is this anything new or [00:22:00] is this just being so, um, um, so, so blatantly communicated and, uh, you know, and, and argued in the public. Is this new? I mean, uh, is this really unprecedented?
Gavin Purcell: I think this is unprecedented in the fact that like the biggest issue is that it's the public and the government all waking up to how important AI is going to be going forward.
And I do wanna call out the fact that Dario, Dario like released an internal memo where he kind of ripped open ai, a new one called them Mendacious, which is like kind of like not exactly truth telling, and also kind of said that the reason why, to your point before that they're kind of in with the Trump administration is that.
Perhaps there are people internally at OpenAI and there are Greg Brockman. The president donated money to the Trump campaign, so this is also like a lot of like internal politics between these two companies. Now, granted, I think the most important thing for our viewers to understand is like. We are also in a one level up war, let's call it, not, maybe war is too much.
'cause I, I know obviously there's a terrible situation [00:23:00] happening in the Middle East right now that a lot of people are suffering. But we are in a one level up kind of cold war around AI with other nations. Right. Specifically with China. And there's an argument to be made here, and I'm not saying about this administration, but in the American government at large that perhaps.
The government should be able to control exactly what of these things are and what they do. Now, I'm not gonna make that argument myself, but you could say that if a private company has the ability to say yes or no based on a technology, then that does become a very big lever for nationalizing that private company, which means that the government takes control.
In fact, just this week. Uh, Palantir, CEO, Alex Karp had a, a pretty, uh, uh, divisive comment, let's say, and used divisive language in it, mentioning this specific idea that they would get nationalized if they continue to ref, uh, to refuse this. So, Kev, this is also very sci-fi weirdness, right? Like the idea that you have a company that has a piece of technology.
That is so [00:24:00] significant that the government, they're afraid to give it to the government. This is like, it has echoes of the nuclear race and I know that Dario, I found it this week. You show this that Dario gives everybody who starts at Anthropic the book how, uh, the atomic bomb was made because he believes that like this thing he's working on has that same level of importance.
So anyway, this conversation's gonna be ongoing. The bummer is I really do think Claude is. About to maybe win. And that's partly why like OpenAI is racing these updates out. Because if you haven't noticed, first of all, when all this went down, Claude is rocketed to the number one app in the app store, which it has never been before.
Katie Perry is now a Claude Max user, which is also, she's part of the cloud resistance.
Kevin Pereria: Listen. Anecdotal, but I, uh, three people that I know canceled their OpenAI subscriptions Yeah. And immediately signed up for Claude. I don't think that that is, um, a single tier in the drop of a bucket in the sea of money that OpenAI will make from the government contracts.
But still people are waking up and [00:25:00] canceling GPT.
Gavin Purcell: Well, and you say that, but there's like actually some really interesting charts coming out about like where the actual money that is getting made by these companies and anthropic is catching up significantly. Yeah. Anthropic is making about. $19 billion a year right now, which is a lot of money.
And OpenAI just raced out and said, well, we're making 25 billion a year. Now. People have said the fact that OpenAI has all these consumer users and they haven't turned ads on that, that might take that stick and go even further up. But it is a big deal and I think they actually, this is a good, I mean, let, we should just land this, this plane here a little bit and just make sure we're on the same page.
This is the sorts of stuff that we've been talking about kind of existentially that we are worried about with AI from the very beginning of the time we've done the show, right? Like this idea that. You're gonna have to ask big questions, and if I am a part of a government that is fighting another government, even if I don't agree with that fighting.
What, how do I respond when I, when my government wants to use the best possible tool in order to save the people that are defending our government, [00:26:00] right? This is a weird, big questions that people have to ask themselves.
Kevin Pereria: And here's the other fun one, could you. Frigging imagine the Manhattan Project taking place in public with like three or four, let's say 12 different companies going after it.
And TikTok exists at the same time. Yes. Yes. Could you imagine what social media would be at the same
Gavin Purcell: time or, or under this administration, right, which is another Sure. The thing is like pure chaos, the idea that like you get into these public fights and that the administration people. Air, their dirty laundry in, in public, in your point.
Like, no, it's horrible. It's a terrible situation for this to happen in
Kevin Pereria: that, and that's a channel that we need to make, by the way, is TikTok influencers reacting to things as if the Manhattan Project is being built in the real time. And, oh,
Gavin Purcell: that's a
Kevin Pereria: good
Gavin Purcell: depression. I don't, that'll be amazing. SEO it'll be like, it'll be 10 for 20 years, but
Kevin Pereria: hey Kevin, there's so many more exciting things happen.
Gavin, I'm glad you're getting what you voted for. Let's move on. Yes, thank you to talk. Thank you, Claude. [00:27:00] Code advancements because uh, let's pour one out for the startups. We need a graphic for this. Yes. We need to pour one out every week for time. Oh,
Gavin Purcell: that's a good idea. Idea. You pour one out for everybody of the startups.
Yeah, voice mode. I would actually,
Kevin Pereria: and even,
Gavin Purcell: even before that. Yeah. I wanna talk about a new term that I've coined that I believe is gonna take the world by storm, which is CLA rotting. And this week Kevin, I CLA rotted, which CLO rotting is basically, do you know what bed rotting is? You're familiar with that term, I assume?
Kevin Pereria: I do. I do know and brain rott as well, but I don't like. Claude Rod
Gavin Purcell: Claude rotting is I, I took a TikTok on myself just to point it out. It's when you are, you
Kevin Pereria: caught Claude Mid is I think what happened.
Gavin Purcell: No, I did not. Caught Claude Mid Claude mid is not what this is. Basically, this is spending too much time doing Claude code and not taking care of yourself.
So I spent a bunch of time this week. But yes, there's some big updates. Uh, voice mode is coming to cloud code. There are a bunch of new cloud code skills that are coming. So Kev, this week in my cloud code update, I added a link block to our new website, AI for Humans Show. Mostly because I wanted to do something that was 20 years old and just en enjoy a process of it.
It does have an RSS feed for you and for [00:28:00] your agent, though, if you want to go to it. But I think this is an important thing to understand, is that. Claude, there's something about Claude for me, and I dunno about what you think about this for you. I don't know what it is, but like maybe it's just literally how it's talking to me or what I'm used to, but like I do find it better than even using the Codex app.
Are you having that experience at all? Like, do you feel like Claude has like something. I don't know. I'd never had this before and it might just be that I'm starting to use the terminal window stuff, but there is a weird connectivity that I've just started to use. Use to talking to. I feel you
Kevin Pereria: pair bonding in a way that you haven't since GR launched there.
I haven't. Anime babe assistance. Gavin, this is really nice. You have to twinkle back in your eyes. Yes. This is
Gavin Purcell: me and
Kevin Pereria: Claude. Me and Claude is my daily driver. Full stop. I've got multiple terminal windows open that are running instances of Claude Code. My Claude Bots are all powered by Claude Code. Yes, they have access to open AI models as well.
'cause I bounce them off of each other when Claude gets stuck. But it is my daily driver. I use Claude Cowork to plan some trips and travel that I have coming up. I use it to make, uh, PowerPoints [00:29:00] and slideshows like it is still my number one. So whenever they release a new something, I'm, I'm all for it. Um, and I like the link blog by the way.
People should don't sleep on it AF News Show slash link blog. It's really
Gavin Purcell: cool. And by the way, like yeah, it's very cool to have and it's a, what, what's interesting about it is I used it as a way so that I created a iPhone shortcut so that I can easily like send a, a link from anything to the link blog.
It'll post automatically. Same with I have a bookmarklet on Chrome or Safari where I can add stuff to it. Those are the little cool things that you can do. And one of the things, Kevin, that's really interesting about, now that I'm doing this on a regular basis, and again, you and I talked about this earlier, that you were maybe like six months prior to this kind of starting this process for yourself or even earlier than that.
But now that like it's easy to do this, like I'm just got a terminal window up all the time, and if a, if a new feature comes into my brain for the website, I'm like, oh, let's. Explore that, right? Yeah. Let's go try this thing, and it just goes, and then you're just having this back and forth. So again, all of this talk about Claude and hopefully Claude will not go [00:30:00] away.
We are hoping that the government and Claude can come to some sort of agreement that we can move forward with Claude. You should try all of these models, but specifically, there is something special about what's going on with Claude right now. And now that they're adding voice mode, I feel like they're gonna continue to grow in use.
So it's a good one to show your friends and family as a trial,
Kevin Pereria: um, not just Claude, several companies releasing updates this week that people can get their hands on. Um, some incredible new features. Stuff that, uh, you know, here, here's the wash, rinse, repeat. Um, stuff that took us hours or days or weeks to do months ago, which are now instant and within a single app.
Um, cling Motion Control 3.0 is something that really caught my attention.
Gavin Purcell: Yeah, so I mean, this is their next version of motion control. You and I have talked about this sort of thing since runways Act one and Act two. This can basically, you can pop it a video or add a look to a video if you want to based on what it is.
It's very good in general, like it does a very good job of it. Um, they have some examples on CL site.
Kevin Pereria: Yeah, I mean, listen, the ability to [00:31:00] record yourself and transform your performance into or onto any character in a scene is massive. And yes, there's been some other tools that allow for it. This one looks.
Incredibly accurate, very nuanced, captures the detail of the performer's face, um, and it transfers it really well to the scene, including lighting and shadows and everything else. So here's, uh, uh, Alex Petrosky or Max Escu on, uh, x posted some examples of using audition tapes as an input. And then, uh, so basically if you're on the video version, you'll get it.
If you're on the audio, someone is just delivering a dynamic performance and it comes out amazing.
AI Video: Helena will never love you as I could. But you'll love her nonetheless,
and you will tell her every day that she is beautiful and she'll believe it because she is.
Kevin Pereria: Okay. Audio only. I know that that was a weird as SMR moment video Friends, you saw, uh, a performer, you [00:32:00] saw the face transfer to a completely different character. Um, physics in the dangling earrings. Yep. On the really good, the ears of the character, right.
The, this off shadow of candlelight. Um, creating shadows across their face, which work in the scene. And then if you watch the rest of the clip, you see them kind of just change the character over and over again, and it just works.
Gavin Purcell: So, Kevin, I did it. It mostly just works. I decided why not try to take, why not try to take one of the most viral videos this week if you missed it?
The McDonald's CEO tried a big arch burger Yes. And kind of failed miserably at eating it. Um, it, it's a very fun video to watch. He's very kind of stiff. So, Kevin, I tried first. To take this, to take this video. And I tried to, I tried to make the, make him into a woman eating the burger. And what I didn't realize when I first did this is I, I gave it, it, I, it somehow used him.
So it made this woman that kind of looks like him, like eating the burger. So play this one first.
Big Mac CEO: Some crispy onions on here as well. See those kind of coming out. Alright, the moment of truth.
Kevin Pereria: Now, okay, before [00:33:00] I'm stopping at five seconds in. Uh, yeah, because there's, it looks like it's like a dog with an Instagram with the tongue hanging outta the mouth.
So far de despite it being an alternate version of the McDonald's CEO, with the red hair and the, uh, sports bra. Yes. It did a really good job. Like I thought it, it captured the look and everything else. I'm about to get to a big bite moment though.
Gavin Purcell: Just wait. Just wait. But the biting part, you'll see.
You'll see.
Kevin Pereria: All right.
Okay, so the model doesn't understand eating. That is
Big Mac CEO: so good. That's a big bite for a big arch.
Kevin Pereria: Okay,
Gavin Purcell: well, so what did you see? Did you see there's so, so, okay, so what's going on here is in the actual video, that's when the, the guy shows the burger to the camera and it did not know what to do with that at all.
So that is invented the hand. That is
Kevin Pereria: horrific. Jump scare.
Gavin Purcell: Okay, wait.
Kevin Pereria: Horrific. That was a ring style jump scare. Yes. Because a hand just appeared outta nowhere and the model started. Flailing like mad.
Gavin Purcell: So wait, now go to the next one. So I wanted to kind of try to create an image of a [00:34:00] woman who will we're gonna use in a later video that would be doing this instead.
Right? So I created a woman with red hair and a fancy restaurant. So I did this. So play this one and it's a similar sort of problem, but you'll see at the end as well
Big Mac CEO: some crispy onions on here as well. You see those kind of coming out. Alright, the moment of truth.
That is so good. That's a big bite. A big
Gavin Purcell: no. You see that one goes like, kind of like a, like a crazy thing at the end. So I don't like, so what's so funny about this don't, like, that's like
Kevin Pereria: a left for dead enemy when it
Gavin Purcell: gets agitated. That's, so if you, if you're not seeing this, what basically happened in the video was like, again.
The CEO holds the burger up to camera. The video model doesn't know what to do with it, so it's kind freaking out. But both of these instances, I thought the talking actually looked really good. The thing that didn't work very well was like the biting of the burger, right? Like each time when they, the CEO took a bite.
Now the CEO E took a weird bite, which we all know that's why it got famous, but it took a bite and it kind of mistakenly did something. So with interactive stuff is [00:35:00] not that great. Talking wise though, really pretty good, I think in general.
Kevin Pereria: Yeah. I think for, you know, obviously dynamic scenes and dynamic motion, maybe not there, but for the intimate, very important to capture the nuance of someone's face.
I think it's, it's just getting better. Again, we did a launch video for our startup months ago. It was a little painstaking to bolt on multiple services to get this out, like these types of performance outs. And now it's like, okay, yeah, here it is. So if you want to go and. Um, I don't know. Launch your OF, uh, as your alt If those words mean anything to you, here's your chance.
Or if you want a new spokesperson for your business or you wanna make UGC content. Here you go.
Gavin Purcell: Clean control. You're talking about open fries. Open fries is my favorite new startup, right? Mm-hmm. Like that's a robotic, uh, fry making startup. Anyway, let's move into gr Speaking of OF, let's talk about grok.
Imagine, extend. So grok, imagine, extend, is a way to extend your grok videos. Now, grok, we've talked about a lot on this show. It's got some interesting ins and outs and, uh, I definitely experienced [00:36:00] that when trying to use it yesterday a little bit. The, if you're, if you're all interested in spicy stuff, rock still got it there.
I didn't even mean sometimes for stuff to come out and it came out and I was like, wow. Well there it right is anyway. Roc Xtend allows you to take a video and basically extend on the end of it and add stuff on. Now, Kevin, I will be honest with you, in general, this feels like a kind of thing that we have seen before in other video things, but mm-hmm it does a pretty good job of understanding where it was at last.
I took a video of that same woman that I used in the McDonald's one and I, I kind of made her go into a restaurant on a date, play this and you'll kind of get a sense of like, there's a little bit of a degradation over the course of it. Yeah. It's about 30 seconds. You can go to 30 seconds long. What,
Kevin Pereria: just so I know, what did you start with?
Um,
Gavin Purcell: exactly. So I started with the image of the woman at the dining room table. So the very first it just a still image. Image, it was just a still image and I uploaded it, image to video and I said, animate this. And I, I did give it direction as it went along, but not a lot.
Kevin Pereria: Great. Alright, here we go.[00:37:00]
She is sipping a glass of wine, smiling at the camera in walks, a waiter, a delicious tray of tins of cat
Grok Extend Video: food. I love this brand. It's so delicious and nutritious.
Would you like to try some fancy feast tuna Taco time? John here. Take a bite.
Seedance Video: Take a, I don't know who you're talking to, Carol. There's no one there.
Grok Extend Video: Leave the fancy feast tuna taco time and ski at all.
Gavin Purcell: So that's 30 seconds. It was my, I did invent, uh, fancy feast tuna taco time, but mm-hmm. What's interesting about that is the voices in gr are still not amazing.
And you, you know, the audio at the very top, the first animation where you just hear the piano playing was actually quite nice, I thought.
OpenAI Employee: Yeah.
Gavin Purcell: But in general. It's a little tricky because like, I don't know exactly what I'm supposed to do with this thing. Right? Like it's like I get that it's very cool that you can extend this out and there's ways you could extend it, but you'll also notice that the faces shift a little bit over each generation.
Yeah. And again, it's like. [00:38:00] Timing wise was okay. It's fine. Like I think there's a lot to be done here. And by the way, I, no shade against Grock. Imagine. 'cause it has gotten way better, like actually as an image model is very fast and really good. I just am not sure that Xtend video does a lot where you, I think it's gonna be more of a shot for shot world going forward.
Kevin Pereria: I, I think it's interesting, like the, the physics on the wine within the glass are really good. Yeah. Um, her reaching around the waiter's arm with the fork to, to kind of fork, it's, it's a bizarre movement, but it's one that actually kind of makes sense within the framing of the scene. So there's some stuff here that's actually like, very impressive and then some other stuff that would make it again.
Yeah. To your point, like. I dunno if this is production ready because the faces shift a bit.
Gavin Purcell: So just an example of like kind of how much better sea dance is in some ways is I, I took a 15 second clip of the same image and I gave it a very specific prompt. But again, this is not about extend, but this is 15 seconds so you can watch this, uh, and played out.[00:39:00]
Kevin Pereria: Big burger bite.
Grok Extend Video: Hey, wait, what is this?
Gavin Purcell: Geez, lady. So there's a little bear in the burger. It what's amazing to me though, Kev is like, you hear the burger bite you, her voice sounds like a natural voice. Like these are the kinds of things that like state-of-the-art video is doing right now. And I think that grok is maybe not there yet for everything out.
Side of just generation of image or video.
Kevin Pereria: So it's not just grok, it's not just ance. Uh, Google released a new video, something this week, um, but it's not the same type of experience. This is different enhancement. Yeah. To notebook lm, this is their cinematic overviews. In the same way, you could go and drop a bunch of links and create an interactive chatbot, a flash flashcard game, an audio podcast to go over whatever your content is.
Now Google will stitch together these video overviews that have. Imagery and they have animation and they're very kinetic, and they have soundtracks and good voiceover, and it's kind of a one [00:40:00] shot explainer that if I were a YouTube channel, especially a faceless one, I might be a little concerned.
Google LM Video: The idea of a limit is one of the most powerful to tools in all of methods.
Gavin Purcell: Okay, stop it. That's enough that. Here's the thing, Kevin, I would not be concerned because I think this is not very good, and I don't mean this in a bad way. I'm not saying I'm not. I'm not, I guess I do mean it some. Here we go. Yeah. How do you mean it? Here's the thing, when you watch these, it feels like an AI made these things right?
Like the graphics feel like AI made them. They're not that interesting or dynamic as to what I'm seeing. And yes, is it video to words? Are they connecting a vo VO three videos to words? Yes, but like. I, I am a believer that faceless YouTube channels can work if you're having interesting conversations and interesting things that are happening behind the scenes.
But this iteration of it feels very much like the kind of thing like your seventh grade teacher would put together. And I just don't know how compelling that is to me. Do you know what I mean? When you look at, sorry, the seventh, look at the [00:41:00] graphics.
Kevin Pereria: The teachers out there graphics, please don't, uh, don't stop
Gavin Purcell: supporting us.
I said my seventh grade teacher, Uhhuh, not generals, a seventh grade teacher.
Kevin Pereria: Okay. Uh, Mrs. Mrs. Basile, please don't withdraw your Patreon. Gavin didn't mean it. I What was her name? It doesn't matter.
Gavin Purcell: Mrs. Buse, is that what you called her?
Kevin Pereria: Basile? It's a name. Um. Look at the, look at the Justine Moore example of the Disneyland one that she put together.
I thought that it was making smart choices about forced perspective. It was making interesting choices about, uh, when it was cutting the shots or whatever. Is it early days? Of course, but
Well,
Gavin Purcell: that's what I'm saying. I
Kevin Pereria: could,
Gavin Purcell: yeah. Yeah.
Kevin Pereria: Yeah, I can absolutely see this being like a great starting point. And I, the number of times I have gone to Notebook LM to digest big topics lately and generated like the, a podcast Yeah.
Is, um, non-zero and it's getting higher each and every week. I would opt for this if it could get to just as quick, might as well have a little video overview and have that playing. Um, as well. So I don't know. Yeah, I think this is the, the beginning of a, a [00:42:00] very interesting product.
Gavin Purcell: I think that's an important thing to be aware of is that like, I think it's the beginning stages of it and, you know, maybe there is a world where this, and as everything we've talked about in the show, like in six months, it just gets way, way better.
Um, and it will get compelling. But right now, I think when you watch it, it still feels like it's not there yet for me. In the same way what happened with Notebook LMS podcast? I get kind of bored by that podcast. I don't, I don't love the way that it kind of always feels the same. I, after a while, I just was like, I don't want to hear.
These two same people digest it all the time. Like it just felt to me like I
Kevin Pereria: was, well, you do understand our audience.
Gavin Purcell: They at least we are making weird choices and we might feel differently about each other. Week to week. We might have different passions about one thing that starts to feel like two robots who are having this same exact feelings towards every piece of content that comes in.
So that's my worry with this sort of stuff is like, I think I would rather see a less interesting graphic. With a human choice behind it, maybe. I don't know. This will be interesting to see, to see how that works.
Kevin Pereria: Well, if Ben [00:43:00] Affleck has its way, Gavin, there won't be any creatives making any decisions because Ben Affleck sold out the industry, which he supposedly loves so much, and he's been working on an AI company because he hates water and he hates.
He hates. We got a good, the untouched land.
Gavin Purcell: We got a good clip out now that's going right up on TikTok, just that alone. It'll be fascinating. Yeah. So, yes. Uh, a variety and a bunch of other places are reporting that Netflix of all places has bought a company called Inter Positive. And this is Ben Affleck's, secret AI startup.
That is not making AI models, but specifically Ben and and did this company so that he could find ways so that on productions doing dailies and things like that, there would be ways to use AI tools to help productions shoot better and more efficiently. So. You know, this talks about the idea of AI jobs.
We dis, we discussed on the show where Ben Affleck actually discussed, I think it was, it was at, uh, one of those conferences where he talked about AI and he was like, had a pretty interesting take on it. He [00:44:00] didn't think it was gonna replace the creative process, but it would improve the, the actual process of it.
So this leads into that. I did see Kev, somebody mention in the background that like. This deal maybe wouldn't have gotten done if the Netflix Warner Brothers deal had gone through because they had spent a bunch of money. Sure. So maybe Netflix is out there gonna kind of snatch up a couple things like this.
But overall, I think it just shows you that like Hollywood has been doing stuff with AI and just not being very vocal about it,
Kevin Pereria: point out a few things. Um, they built a custom data set like it's a 16 person company that actually built a full production set and shot. A ton of footage. So they could probably tag, here's what the lens is, here's the light temperature that we chose, here's the distance and the framing, and the blah, blah, blah.
And they made their own model. And the idea is that if you are working, and Netflix is gonna give this to partners for free. So if you're working on a Netflix show, um, you'll have access to this tool. Maybe the lighting was a little bit off in one of your shots, or maybe you're trying to get VFX into a shot, but it the, it's not playing well with the type of lens that [00:45:00] you chose.
That's what this tool promises to make better, is that you could go and it's mostly a post-production tool. Yes. There's a lot of going out of the way. Gavin to reinforce, Hey, guild, relax. We're not replacing humans here. We're enhancing humans. Which of course is like the right messaging and, and like a great approach, but if this does replace, uh, some sort of post supervisor or a colorist or whatever else, sure.
Are they gonna make that announcement as well? Are they gonna be like, oh, our bad, our tool accidentally did this?
Gavin Purcell: No they're not. And like this is the struggle that they're gonna run and that all guilds and people who work inside guilds. And like I said, I am a member for an hour of the Writer's Guild. A bunch of places like that that are gonna have a hard time with, because this is gonna keep pushing forward as we see more and more people adopted.
So yeah, this is where. I think we said on the show a couple weeks ago this idea that like creatives will have roles in making things for a very long time, but production might change very fast and there will be a lot less roles in production than there were before. Oh, Kevin,
Kevin Pereria: you should have activated your Specter [00:46:00] device.
Yeah, when you said that Gavin, yeah. You should have activated your brand new cyberpunk audio jamming device, which is apparently going to be a real thing. Before you said anything bad about AI taking human jobs.
Gavin Purcell: That's right. So this is the Specter one device. It is a, from a news. Startup, and this is a prototype so far, so we're not sure how this is gonna work.
But this basic idea is that in order to stop AI devices from recording you, we have seen things like friend or other devices that are out there that are trying to record IRL conversations. We are now gonna be interested entering into a world where you are going to be able to block those conversations.
And basically this is kind of like if you've seen those videos of the people wearing cloaked. Clothes where like, you know, you take a picture and like it breaks the camera. This is the same idea with, um, audio, so that when you try to record it, it actually kind of covers it up over time.
Kevin Pereria: Yeah. It's basically the size from what I can see of like a Google Nest or maybe the mini, it's like the little puff sit it
Gavin Purcell: on your table.
Kevin Pereria: Yeah, and it supposedly uses AI and whimsy. I guess it, it says [00:47:00] Specter uses AI and novel physics, uh, to reinvent jamming, so it's gonna be more targeted and smart and portable. They say conventional audio jammers work by overpowering microphones through a lot of power, which is inefficient. So I look, I. I could, I don't know if, I don't know if this is, this device is going to be the one, if it's gonna work as advertised, but 100% I see a future for devices like this, especially as everybody's got the meta ray bands coming around or you know, whatever apple's going to release.
And if you don't wanna be recorded all the time, whether it's your, your face or your voice. You're probably gonna have to have some sort of anti rerecording device on you.
Gavin Purcell: Did you, this might be Conspiracy Corner, and I don't really know, because sometimes you'll do it, end up on a lake, do it, you'll end up on a lake.
But did you see that thing where. Yeah. Temp fo hat. Temp fo hat. So never been, this isn't real. But you see that thing where they think that wifi might be able to give you a video signal of people in their homes. Have you heard this at all? Oh,
Kevin Pereria: real, real,
Gavin Purcell: real. Yeah. Is that real? Real? That's pretty [00:48:00] crazy.
Kevin Pereria: Yes.
Yeah. Yeah.
Gavin Purcell: So tell types. Maybe you have a better sense of what that is, but I just read that the other day and I was like, wow, that's insane. But that feels crazy to.
Kevin Pereria: Well, so wifi is bouncing around everywhere, right? Yeah. Like all the wifi signals, like all the, the, the whole spectrum from your 2.4 gigahertz network all the way up to whatever the hell the, the new standard is.
They're pushing outta Costco. Um, all of that is generating all these like waves. Well, as things move. Through those waves. It creates disturbances. So if you can, uh, build a piece of hardware or software or sometimes both that can read those waves, you can see the distortions and the disturbance. And so they basically train models on that.
So as you move around crazy in your house, if they can get, you know, access to reading those, those waves, they can determine who's moving about. So that is just like. Really
Gavin Purcell: welcome to our future. Welcome to our future. So we're gonna need blockers for that too. You know else we don't need a block for.
Kevin is seeing what you did with AI this week. That's right. It's ai. See what you did there
AI See What You Did THere: So times. Yes. Rolling without a care. [00:49:00] Then suddenly you stop. Fun shout.
Gavin Purcell: Okay, Kevin, we have some really fun stuff this week. Um, one of the first things I wanna point out is there's a video that's gone viral. On TikTok and then on Twitter, and it is called The Shape Store. And this is from the I love it. Yeah, so
Kevin Pereria: much.
Gavin Purcell: So this is from the team that made the Bird Game videos. If you remember those videos where it was Bird Game was like this fake game that got created.
So describe what you are seeing so that the people at home maybe will play it while, while you're talking about it.
Kevin Pereria: Yeah, I mean, it feels like a late nineties early aughts, um, fisheye, uh, camera lens, a glimpse into this underground movement. It seems of this like shape store where a bunch of people are there, they're hanging out, they love big blocks, they're playing mini games with big blocks.
They are, uh, you know, these are like the big, the big primary color. Wooden blocks Yes. [00:50:00] That you probably played with at like a dentist's office when you were in the waiting room when you were younger. Um, I know I'm speaking to a certain demo here, but it's like the old trains with the magnets, the little cars you built, mini castles or whatever.
Well, this is an underground world that I want to visit so badly at the Shape Store where people are playing with those blocks. They're going down slides, they're uh, riding like bumper cars, they're playing midway games. And it's all this like, uh, it's kind of like a hip hop centric feel. To this underground movement.
It's really, it's just like a world I wanna live in.
Gavin Purcell: There's a big story this week that came out where like the Supreme Court decided they weren't gonna like, uh, um, argue that the story that like AI can't be copyrighted. And there was this kind of big fight that went online around the idea that like, well, that's only because if you have a purely AI output that can't be copyrighted.
But once human input touches it, this is just a good example of what it means to be a really interesting voice. Like this is a creative voice. With AI tools put together like this, they took that music, they, they generate all these images, but they made it [00:51:00] look like this thing. Like this is a really cool thing and just go.
If you wanna make sure you go to at a dot I ation on TikTok and on Instagram, that is the person that generated this. You see this get shared everywhere on other socials, but make sure to go follow them and they're very fun. That's
Kevin Pereria: great.
Gavin Purcell: Yeah. Um, okay. Another big video that came out online that I saw this week, which has to do with the fact that we are in a war now, was Billow Wallace, who, who is a awesome YouTuber and does a lot of stuff.
He actually worked in Google's geolocation geospatial world. Does a lot of covers, uh, geospatial tech and ai. He made this incredible visualization for Operation Fury. It is a long, um, YouTube video you can watch, but what he has done is basically taken a UX and ui, laid it over real time data from the actual, uh, rollout of the war, and allows you to kind of jump from place to place throughout it.
It is fascinating. He's productizing this, he's gonna make a product, but for right now it's just this YouTube video. But if you get a second, go look at [00:52:00] this. This is one person doing this. Now he does have an expertise in geolocation and uh, spatial data stuff, but it shows you how far we have come with the ability to kind of vibe code something like this into, into being.
So I just thought this was an amazing thing.
Kevin Pereria: Yeah. Super fascinating to see how quickly something like that goes together. I'm sure there's a million poly market bot bros that are trying to feverishly put their own dashboards together, uh, to look at the conflict in real time. But I would rather scroll infinite fava cons, Gavin?
Gavin Purcell: Yeah. This is a really cool little project that was created by Joseph jojo, which is a great, uh, ex handle name, but he basically took a bunch of favicons. Are they favicons or favicons? I thought they were favicons.
Kevin Pereria: Well, do you put it in your f your fa John Favres or your favorites?
Gavin Purcell: I don't know, man. This is, we gotta figure out, this might be a good one for the comments.
I think it's a Fon. I mean it's a fon, I'm pretty sure. I'm pretty sure.
Kevin Pereria: Yeah. 'cause it's your favorites. So you would say
Gavin Purcell: fabric I
Kevin Pereria: put in my,
Gavin Purcell: yeah,
Kevin Pereria: I put it in my fa my [00:53:00] fabric folder. My where my fabrics go.
Gavin Purcell: Yeah, right where my fabric, anyway, what this is is finitely. What are
your
Kevin Pereria: f What's your favorite fla of ice cream?
Gavin, do you have a favorite fla?
Gavin Purcell: We're moving on. We're moving on. I've got many fabrics. You have many fabrics that I love in my life Anyway. Infinite favicon is a thing where you can scroll through all the favicon that exist in the world essentially, and a favicon. If you're not familiar or favicon, according to Kevin, yeah.
Is the thing that goes in the upper left corner at the far left of your URL. It's the little kind of image that makes it look like that you're on that particular website. I'm gonna ask
Kevin Pereria: FLA or Flav what he thinks. Hip hop artists. Flab a flap.
Gavin Purcell: Flab a flap. Uh, all right, moving on. We're moving on. Let's on lastly, lastly, we have from our friend Za, who we've talked to about a bunch.
He open sourced a music video maker, and he hasn't released this yet, but is very cool. He does a lot of work with local, uh, open source video models, and he basically created a front end to create a music video for [00:54:00] any songs you would upload a music vi, you'd upload a music track, and it would create a music video.
He just goes to show that like these people are out there right now making between, uh, between Blase and Bill Wall. These are two really different people who have specific expertise, vibe, coding something right now. So again, this is your call to go out there and think about what you want to make this week.
Kevin Pereria: I can think of some fun songs that they should plug into their music video making
Gavin Purcell: machine. Oh, okay. I thought you were gonna say like My flab, uh, Valentine or some other song names with flag. But you wanna talk about the fact that we've updated our website. That's fine. I like that too.
Kevin Pereria: Um, yeah. Um, so you my thank you
Gavin Purcell: Flab things.
These are a few of my flab things.
Kevin Pereria: Well now we're
Gavin Purcell: not f fabric
Kevin Pereria: you.
Gavin Purcell: What about Jar Favre? Kevin? John Favre is fab.
Kevin Pereria: That's how I started this was if you're, oh, I
Gavin Purcell: missed that.
Kevin Pereria: Oh yeah. I was too
Gavin Purcell: busy. It's really embarrassed. That was too late now.
Kevin Pereria: So, um, returning fans know that last week I launched a cursed iPod on our [00:55:00] website, AI for Humans Show slash Pod.
Um, using an audio model that I'm still shocked, has not been shut down. Yes. To create, yeah. To create artist covers. From other songs. So we had Adele covering President's, United States lump, uh, the Eagles covering, covering Africa. People really liked a rage against the machine doing Britney Spears Toxic.
Um, I continued to iterate the prompt on my end 'cause I'm not just going to the website and saying, make this. A cover of that, I created a skill for my Claude Bot, Mr. Tibs, that does research on the artist, like pulls like lyrics and what they usually talk about analyzes their cadence, how many words they typically use in a sentence, et cetera.
Um, and then generates a prompt and will modify the lyrics of the song they're covering to try to fit the artist better. And. I would say with like, again there, there's no actual benchmark here, but I would get a nominal improvement, a 10% improvement of the output or whatever. I kept grinding on it after hours, just generating song after song and sort of AB testing be like, that one's better.
This [00:56:00] one's better. It's still a little hit miss, but I've now gotten to the point, Gavin, where. All day in a special telegram group, I have a bot that is ripping out songs for me. Oh wow. It's just choosing which artist goes where and how, and I have like a personalized playlist of weirdness. What are you paying of for
Gavin Purcell: that?
Kevin Pereria: Um, I've given the Sonato site maybe 30 bucks for now.
Gavin Purcell: Wow. Okay. All
Kevin Pereria: right.
Gavin Purcell: Great.
Kevin Pereria: That's not
Gavin Purcell: crazy,
Kevin Pereria: which is, it's it's not crazy. I mean, it's a lot. And, and let me be clear, I wish that money were going to the artists. Sure. I, I like, they're just, it's just not right now. And I try to talk about it last week and a lot of AI haters came after me.
Uh, especially the music AI haters, which are a specific breed. It's kinda like the gaming ones. Um, yeah, and I get it. I totally get it. I wish that money were gonna, the creatives. I think this website is probably going to be shut down.
Gavin Purcell: Yes.
Kevin Pereria: Um, I, I wish it wasn't shut down. I wish the recording industry would make a deal with them and figure it out.
But in the meantime, I'm having fun, so don't yuck my yum. Stay [00:57:00] outta my shit store because, so anyway,
Gavin Purcell: go, go, go to the website and you'll see the new song.
Kevin Pereria: Right? Here's, here's what you'll see. Yes. The prompts that, and Will can cut this all down. I get it. The prompts that I have fallen in love with are one, um, rock and metal bands doing nursery rhyme covers.
Yeah, fine. So you'll get system of a down, doing the wheels on the bus. You'll get rage against the machine doing Humpty Dumpty, where it has lyrics like who put that egg on the wall like. It, it did a good job of, you know, modifying the lyrics to be the performer and then telling the model Mr. Tibs to go out and generate songs based off of marketing, marketing slogans and advertising swap in the style of the Mars Volta, arcade Fire, Emma Rosa, these bands that I love.
Ugh, tell me it's not you. You can tell me it's not good. I'm not gonna listen, but you can tell me it's not good or, or you can throw a couple bucks in to buy me a coffee so I can afford to
Gavin Purcell: make more. Buy me coffee, buy me a coffee. But also, this is what's so great about what we're talking about.
Kevin Pereria: Yes,
Gavin Purcell: this [00:58:00] tickles Kevin, right?
Like and tickles Kevin in a way that like is very particular to Kevin and that is awesome. That is a very awesome thing. Like, and you know, there's all sorts of things like that that exist out there. Go listen to it. AI for Humans Show slash Pod. You can see it right now. We'll have the new tracks there and we will see you all next week like you sign off.
I didn't know we were
Kevin Pereria: gonna be back
Gavin Purcell: next week.
Kevin Pereria: Bye
Gavin Purcell: friends. Bye everybody.