Aug. 22, 2024

Ideogram & Flux Make AI Imaging Too Realistic, OpenAI Updates & More AI News

Join our Patreon: AI news coming in HOT: Ideogram 2.0 dropped and, along with Flux community updates & Google’s IMAGEN-3, continues to show how AI imaging tools keep improving… but also opens the door to whole new messes. Plus, OpenAI’s...

The player is loading ...
AI For Humans

Join our Patreon: https://www.patreon.com/AIForHumansShow

AI news coming in HOT: Ideogram 2.0 dropped and, along with Flux community updates & Google’s IMAGEN-3, continues to show how AI imaging tools keep improving… but also opens the door to whole new messes. Plus, OpenAI’s drops new blog posts (wee!), Unitree sends their cheap humanoid robot into production and we use Hedra and a bunch of other AI tools to interview Baby Joe Brogan. It’s quite a moment. 

 

Follow us for more on X @AIForHumansShow

Join our TikTok @aiforhumansshow

And to contact or book us for speaking/consultation, please visit our website: https://www.aiforhumans.show/

 

// SHOW LINKS // 

Ideogram

https://ideogram.ai/t/explore

Ideogram 2.0 Launch Trailer

https://x.com/i/status/1826277550798278804

Fluxstanza

https://civitai.com/models/657252/fluxstanza?modelVersionId=735368

https://x.com/itspoidaman/status/1824957283803308536

FAL.ai

https://fal.ai/models

Political AI Image Controversy

https://www.nytimes.com/2024/08/19/us/politics/trump-taylor-swift-ai-images.html

Imagen-3

https://deepmind.google/technologies/imagen-3/

Procreate Boss Says They’ll Never Use AI 

https://www.theverge.com/2024/8/19/24223473/procreate-anti-generative-ai-pledge-digital-illustration-creatives

OpenAI Fine Tuning Launched For GPT-4o (explain what this is…)

https://openai.com/index/gpt-4o-fine-tuning/

Partnering with Conde Nast

https://openai.com/index/conde-nast/

Matthew Berman SearchGPT Video

https://youtu.be/DV9I_fu0ba8?si=Kcd4wb54wFgwaQuJ

Unitree’s 16k Humanoid Robot Goes Into Production 

https://x.com/UnitreeRobotics/status/1789931753974517820

GoT AI Rave

https://x.com/andr3_ai/status/1825600625754911091

Space Vets Children’s Series: 

https://www.storybookstudios.ai/space-vets

Demis Hassabis on the Google Deep Mind Podcast

https://x.com/GoogleDeepMind/status/1824447036847993292

Runway GEN-3 Turbo

https://venturebeat.com/ai/runways-gen-3-alpha-turbo-is-here-and-can-make-ai-videos-faster-than-you-can-type/

Hedra 1.5 Character Generation

https://x.com/hedra_labs/status/1824113944757457157

 

Transcript

AI4H EP072

[00:00:00] Ideogram 2.0 is crushing text and the new flux updates are getting super personalized. Which is great if you're George Costanza, and it's also super problematic if you're Taylor Swift. We will dig into how this is literally changing media as we know it. Plus OpenAI just released new blog posts! What? That's, that's cool.

It's not Strawberry, or the Advanced Voice stuff, or Sora, they're Video model, but we got a lot of words, Kevin! We got words, baby! That's so exciting! All of that plus we'll be showing you some new AI tools that allow you to make sketches, comedy sketches, that are pretty fun in under an hour. What were we talking about again?

Oh yeah, boats. Wait a minute, they're easy to make and they're watchable? And it's almost good, Kevin. Speaking of almost good, this is AI for Humans.

 

Gavin Purcell: Welcome. Welcome. Welcome [00:01:00] everybody. This is AI for Humans, your weekly guide into the wonderful and wild world of generative AI. We are here and I am here and Kevin is here. Kevin, what is going on? How are you today?

Kevin Pereira: Hey buddy. It's good to be back. Thank you for holding down the old fort and thank you for all of the not kind messages that everybody posted about me being gone last week.

Gavin Purcell: We, we had a great time with Ben Rellis, but we are very happy you are back. But Kev is another giant week in the AI world as it is every week. And so it is time to get into the news. ,

 

 

Gavin Purcell: that's right. Big, huge AI imaging updates this week. The biggest and most latest one is that ideogram 2. 0 just launched. , this is a brand new AI, , imaging update to ideogram, which we have talked about on the show before.

Gavin Purcell: Ideogram was always very well known [00:02:00] for the early days of being pretty good at text. And when we say pretty good back in the day, it was like, Oh, it got the word shaped right. And most of the words on screen. But Kevin, this update just to me showed me that we are entering a new world where all of this stuff is going to continue to get better and better.

Kevin Pereira: I feel like it's a secret weapon amongst some AI creators, the new update has a bunch of features that make the text experience way better, but there's additional updates outside of that, like color palette support, which at first might not seem like a huge deal, but if you're dealing with text, A brand, and there are very specific colors that you want to assign to things.

Kevin Pereira: They have an example of multiple living rooms, Gavin, where there's like a blue palette, a red palette, or a yellow palette. And they're just applying those colors to the scene and it accurately and reliably does that. If you're a developer, they now have an API, which lets you use the service via their backend. So you can do tons of requests. You're not restricted to just their website. They have an iOS [00:03:00] app out now, so you can go and browse and create right from your phone, which is really cool. But you like the realism stuff here, Gavin.

Gavin Purcell: One of the things about this is all of these new AI imaging models. And we talked about flux in the show. Obviously, mid journey 6. 1 has come out. They are all getting so good at realism and what this is going to end up being is kind of like, you know, choose your, choose your poison is maybe the wrong way to say this, but choose your tool and figure out what you can do.

Gavin Purcell: The realism here is great. There's images they released of , pictures of people that look very solid. I don't know if it holds up to say the top of the line flux or mid journey exports. We played around with a little bit. I used it to generate. An image of monster milk. If you're familiar with our show, you know, monster milk is the drink that sometimes we feed to a eyes.

Gavin Purcell: I wanted to see, could it create a, essentially like a magazine or a website ad for monster milk? So what I asked it was, I said an ad for an energy drink with the name monster milk, it features a can of monster milk and the subtitle, your AI will [00:04:00] never be the same. So Kev, you've seen this. It very much looks like, to me, something you could see in like, say, Game Informer, R. I. P., you know, circa 2012, let's say, right?

Gavin Purcell: Like, it's a can of, of Monster Milk with a very clear, you know, graphic that says at the bottom, Your A. I. will never be the same. The words all look very solid. There is one interesting thing about this, if you are not watching this, that I think Kevin will point out.

Kevin Pereira: What are you talking about? If you're a lawyer and you love IP infringement, is that what you're talking about? The fact that it has. The monster energy logo directly on the can but you didn't ask for that specifically. You said monster and energy drink so that would be an interesting connection to make but clearly the logo is in there and not being censored in the output

Gavin Purcell: Yeah, and what's interesting to me is, it is very directly the Monster Energy Drink can. It is very directly the Monster Energy logo. And yet, it just had no

Kevin Pereira: even their famous tentacle because uh as the preferred beverage of manga and hentai enthusiasts it even got the [00:05:00] weird

Gavin Purcell: that isn't part of really monster, monster energy, is it?

Kevin Pereira: no, no, no, it's not.

Gavin Purcell: don't know if that monster energy, you know, if that's actual monster energy, uh, lore or not.

Kevin Pereira: I will say from a font perspective, like not only is it getting the text right that you asked for, right? The monster is in a very monster energy like font. It shows a different font beneath that. Your AI will never be the same. It went with this purple gradient bold font down below the only gibberish seems to be like the vitamin labeling across the top where it would normally be like asbestos, taurine,

Gavin Purcell: gibberish anyway, most of the time, you

Kevin Pereira: Yeah, basically I'm like, it's all lead paint. That's all it is in these energy drinks, but , it's really, really solid. It does a great job with, with lots of text in a single image, but I do wonder about, the IP infringement here.

Gavin Purcell: Absolutely. And I think that's something that's important to think about, especially as you get into the idea of these things being used commercially. Right.

Kevin Pereira: So one of the things you mentioned, , was that, these different image models have [00:06:00] their different strengths. And when we talk about the realism here, it might not be as good as the best in class mid journey or flux, or even stable diffusion stuff.

Kevin Pereira: But what's impressive about the ideogram approach is that you don't have to bolt on a bunch of different products and pieces and custom train things. It does a really good job. Out of the box. And you generated some realistic looking people that were stylized

Gavin Purcell: so I, we're gonna talk about this later, but over the weekend I made up a weird series of videos called AI dates for you, which are kind of a parody of the late eighties, early nineties dating videos that we all love on the internet. But I wanted to see, could it make a screen grab of a ad for this show?

Gavin Purcell: And so all I prompted it , was like screenshot from the television ad for an in quotes, AI dates for you circa 1989. Okay. And it provided a pretty good image of two people sitting in front of what looks like a TV set on a studio set where behind their heads it says AI Dates For You. The guy has a nice mustache [00:07:00] and a blazer with a t shirt underneath it.

Gavin Purcell: And the woman, it's a little funky age wise in terms of like you're not exactly sure what year these people are from. The woman could be from like 1950 or from 2022. In Bushwick, for all we know, but like this is, you know, a really cool look at that. And then the funniest thing is, of course, and we're going to get into this more later.

Gavin Purcell: I then took it and ran it through runway gen three turbo. And I had these people walk onto the set. And other than the fact that thought the name of the show was Al dates for you, which is a different show, probably. It was really compelling in a literal five minute workflow to get something that was like an idea that was in my brain straight into the real world.

Kevin Pereira: We're going to deep dive into that workflow later on in the show But it's just really exciting to finally be able to create at the speed that you can imagine things

. And before we move on to our next big story, there's a couple of things we have to tell everybody here. If you're watching this video on YouTube, please like, and subscribe. We do these every week. There's a full episode that comes out [00:08:00] every Thursday and fun stuff that comes out throughout the rest of the week.

, and also join our Patreon. Our Patreon will be our first link below this video or on our audio. We are actually opening our discord next week. It is for real. We have a discord. I'm excited to share these workflows with people that are in our audience.

Gavin Purcell: And also Kev, I'm sure I would love to hear their workflows as well, because that's one of the biggest parts of this whole space.

Kevin Pereira: So let's switch gears, but stay on the topic of AI imaging because Flux came out of nowhere and it seems like every day there's new updates. There's people creating new, amazing with it. That's the power of open sourcing a model .

Kevin Pereira: And we're seeing personal training happen. I remember like Flux came out, what, two weeks ago and people said, oh, but you'll never be able to do this. And then after the ellipses, a whole string of things that everybody was going to be wrong about because communities find a way and now we've got personal training, creating Loras for whatever you want, for products, for people, for brands, for yourself.

Kevin Pereira: Yes.

Gavin Purcell: yeah, it's pretty. What's, what's crazy to me about this is [00:09:00] you and I have been doing this show now for a year and a couple months and stable diffusion 1. 5, which was like my favorite early model of the open source models allowed you to do this, but it was so much worse. Flux is such an incredible model that it allows you to make extremely realistic looking images.

Gavin Purcell: And Kev, one of the best things that came out was Flux Stanza. A ex username, its Boy Aman or Ghost put out a Flora, which is a train model of George Costanza, and he called it Flux Stanza. And this basically allows you to take George Costanza, the character from, , Seinfeld and put him into almost ev anything you'd want him to put into. So , the examples that are on the Civic AI page, which we'll link to in the show notes, is like him on a motorcycle.

Gavin Purcell: It's him as on a tarot card. There's just all sorts of interesting things you can do with this.

Kevin Pereira: There were entire startups, , dedicated to training these custom models where you could upload all these photos and you could give them 20 bucks a month or sometimes hundreds of [00:10:00] dollars. Now, not only can you do it for free locally, if you have a decent enough machine and you want to spend the time to do it, but a website that you turned me on to, FAL, F A L, lets you for free.

Kevin Pereira: Five whole dollars. Train a custom Flux Laura without any real technical knowledge. You just upload the images, as many as you want, of the person or the object that you want to train, hit the button, and now you can apply it to so many different styles. Obviously the internet is going insane with it and style transferring themselves to be Spider Man with big bushy mustaches and beards or going full anime with it. And unfortunately, this leads us to our next topic, which is the present hellscape that is mis and disinformation, Gavin, because a certain presidential candidate tweeted out, , AI imagery of Taylor Swift fans and made it seem as if the contingent of Swifties out there is supporting his candidacy.

Gavin Purcell: This is [00:11:00] something we talked about a little bit last week and in terms of grok, especially, right? So grok images are powered by flux. And this all goes back to that idea that , you now have a cutting edge image model, AI image model in the hands of basically tens or hundreds of millions of users who are going to make interesting edge case stuff with it.

Gavin Purcell: And in this case, what's fascinating is There were pictures of fans of Taylor Swift ostensibly, these are all AI images that were wearing shirts that said Swifties for Trump on them. And then the orange one actually, , posted it on his own, , true social account. And obviously this is the kind of thing that we expected.

Gavin Purcell: Now, Kev, I will say one thing about this that has kind of been interesting to me is the stories , that came out of this and we, there's a great New York times story on it, but a lot of the ones, the stories weren't like, Oh my God, are these Taylor Swift fans actually Trump fans? It was much more right away. Donald Trump , is retweeting AI images, right? So like, it wasn't like it fooled the mainstream press,

Kevin Pereira: yeah, , it doesn't [00:12:00] surprise me in the slightest that the New York Times, their bend on the story was, Oh, we're sharing AI imagery. I'm sure on my father's Facebook page, it's a very different share from a very different something going like, finally the Swifties have seen the light.

Kevin Pereira: Right. Like that's,

Gavin Purcell: it's shrimp Jesus with a shirt that says shrimp Jesus for Trump. And he's like, that's, I love that. I love shrimp Jesus. Is

Kevin Pereira: when we started this journey together, you know, it feels like eons ago we talked about where

Gavin Purcell: that what you would describe this as? We're on a journey. Are we holding hands? Am I carrying you? Are you carrying me? Is

Kevin Pereira: There's one set of footprints on this podcast, Gavin,

Gavin Purcell: started this

Kevin Pereira: our patrons.

Gavin Purcell: That's

Kevin Pereira: we started this, we knew we were staring down the barrel of an upcoming election year and we said, Oh man, the potential for misinformation and disinformation. I, at the time was imagining these little cabals releasing modified video clips to catch a candidate doing an oopsie that they never did or [00:13:00] whatever.

Kevin Pereira: And we got a little bit of that, but to me, this is actually the more insidious stuff because whether you or I, or someone watching or listening to this knows immediately, Oh, that's AI imagery. That's not the point. The point is if you can flood the lane with little nudges like this all the time, even if 1 percent of the audience receiving the messages thinks, Oh, , these people are supporting this person.

Kevin Pereira: Oh, these people are against this person. It doesn't matter what it is. It's just confusion and noise and chaos that with these tools, you can deliver those messages at lightning speeds.

Gavin Purcell: I bet there are a lot of things that have been waiting to drop until post election. And I've said this on the podcast a couple times, lots of people shouting out the fact that, , AI is dying, that the advances are not that strong.

Gavin Purcell: I think if you look at the stuff we just talked about and the stuff we're going to talk about later, you may not just be looking at the right side of things per se, but like there are new advances coming for sure.

Kevin Pereira: Yeah. Insane new tools dropping left and right. And we consider it a slow news week. That is [00:14:00] how wild this scene is. I will say though. Hey, kudos and shout outs to Google, Gavin. You're not going to find any AI generated Swifty nonsense over there because you might not be able to generate anything with the power of Imogen 3.

Gavin Purcell: So I do want to say Imogen 3 is a new AI generator from Google and Imogen has been their kind of like cutting edge image generator. And there's a lot of interesting, cool updates that they've made with it. But I tried to use it to create a image that I use to make one of these videos I'm going to talk about later, AI dates for you specifically, I was trying to create a woman from the 1990s.

Gavin Purcell: Who was a beautiful actress who was staring at the camera in a cheetah leotard with a, in a vintage gym. And I said, realistic 35 millimeter film. And it gave me zero results. And you look at that prompt and you're like, well, I don't know if that's like, is that, is there anything in there that seems problematic?

Kevin Pereira: looking for something problematic, which is what I do the moment you send me anything. Gavin, because I have to save you from [00:15:00] yourself. That's how this

Gavin Purcell: Oh, thank you. I appreciate

Kevin Pereira: I'm the one set of footprints. I was like, is cheetah leotard too presumptive or something? But not only did it recognize it in the prompt, it highlighted it and made it a dropdown, which is one of the things that you can do with the image effects app.

Kevin Pereira: It'll highlight certain descriptors and then give you dropdowns to select alternates. It highlighted it in this case. I would think that means it understands what it is because it's going to suggest additional things, which would tell me that it should know this is a totally benign prompt.

Gavin Purcell: Yeah. And what's interesting to me is it goes back to that thing where if you have a couple bad interactions with something, there are so many other tools that work that why would you go back to it? And again, this is Google's problem. I've now had this with Google a couple times because this is the issue that I ran to with Gemini as well is that Gemini a couple times didn't work for what I wanted.

Gavin Purcell: And like, Well, why would I go back to Gemini? Why didn't I just use chat GPT or why didn't I just use Claude? There are all these things out there.

Kevin Pereira: I Had similar results. I went to image in three. And if you look at my prompt, [00:16:00] I tried to make a baby and a cream white hoodie wearing black headphones, sitting at a podcast desk, maybe in a few minutes, if you stay tuned, you'll see exactly why that is, but Imogen, Similarly, shut down that request.

Kevin Pereira: And I have no idea why.

Gavin Purcell: Maybe yours because it's a baby, but both of them, it didn't give you any information about why it rejected it.

Gavin Purcell: It just doesn't give you results, which is again, it feels like, you know, Google had this problem where they would make these kind of hyper, , using the word woke, like images of the presidents. Right. And they got in huge trouble because it was like, why isn't it making an image of George Washington?

Gavin Purcell: It's making a bunch of different races of George Washington. I wonder if they've just gotten so gun shy that they're really being careful about what this makes, and then it becomes, to be honest, almost useless, right? Like, if you're that gun shy and you can't see any images, it just doesn't feel like it makes sense to me.

Kevin Pereira: Yeah. If I'm in my Google slides and I'm making a presentation about our quarterly [00:17:00] earnings, maybe Imogen 3 will be able to generate something. Line go up, hooray. But , for our use cases right now, it was a little underwhelming. And I think what you said is really, really important when you have a bad experience, which could be an inappropriate image generation, or in our case, a refusal to generate any images, you're moving on.

Kevin Pereira: You're going to another product. And I think we are all going to move on to Procreate. Because when I think, when I think AI and cutting edge implementations of the latest technologies, I want to go to one of the most popular iPad design apps, Gavin, and I want my AI and they're going to give it to me.

Gavin Purcell: Kevin, I'm sorry, but it's impossible. You're not going to get it. Actually, this is a very interesting story. The CEO of procreate came out with a video a couple of days ago where they have vehemently said they are never going to , interject AI tools into procreate.

Gavin Purcell: Take a listen to this.

[00:18:00]

Gavin Purcell: Procreate is a huge app. I've used it a bunch. My kids have used it. It's a great iPad app. It has become like a very big creator tool for art and making art. It kind of launched as a kind of weird, like kind of alternative to illustrator when the iPads first came out, but it's become a very big company.

Gavin Purcell: So my initial take on this, and I want to hear yours is I get it. You know, there are so many creators that absolutely despise AI tools. But I do think in some ways this could be shooting them in the foot a little bit because in a world where You have all these possibilities come up and [00:19:00] you want to use things like the stamp tool, which guess what everybody?

Gavin Purcell: There's AI involved in the stamp tool in Photoshop I think this could be something where it's a great PR play for them right now But I'm not a hundred percent sure the promise that this is never gonna happen is the right thing to do

Kevin Pereira: Yeah, I think Procreate is amazing, Gavin. I, uh, I promptly uninstalled it this morning. I, uh, no, no,

Gavin Purcell: Oh, I was gonna say, Everybody on the internet, Kevin has come down anti procreate, the most joyous gift in the world to

Kevin Pereira: I'm so pro AI that I threw my iPad off the balcony because I couldn't figure out how to uninstall the app.

Gavin Purcell: Kevin, you're anti create. That's what you should be. Instead of procreate, you're anti create.

Kevin Pereira: Oh no, there's a wormhole in my brain. You shattered the fifth wall. , they're going to have to walk this back if their product is going to remain remotely competitive in whatever your timeline is. I could say in one year, five years, 10 years, they're going to have to walk this back it's a great [00:20:00] press release. It rallied the never AI ers. I think that's fine. But I think even those never AI ers in time are going to slowly move the chains or move the goalposts and start saying, well, this implementation of AI is okay, because it's not.

Kevin Pereira: Generating art and this, that, the other, it's just helping me streamline my workflow and slowly that water's going to boil and then everybody's in the same pot, I think could be wrong.

Gavin Purcell: , I think you're right. But also like, I do want to shout out the fact that like, look, I think the important lesson here is there's a lot of creatives who feel that the act of creating itself is being usurped by AI. And I just completely disagree with that in, in an, in a very specific way, which is that if you'll see this dumb thing I made at the end of the show today, when we talk about it, these AI date for you videos, .

Gavin Purcell: There's something about the tools that AI gives you that allows you to create both faster and do things you weren't able to do before. And that is the trade off, right? , and granted, we've said this a bajillion times in the show, the [00:21:00] training of these models in the initial run, clearly there are problematic issues with that I think this is something we have to continue to think about, , going forward.

Kevin Pereira: Let's shift our attention, Gavin, from a company that says they're never doing anything with AI to the foremost leaders of the scene, dropping some astonishing announcements on a blog post, but they didn't actually release, well, they kind of released something new, but I guess it's fine.

Kevin Pereira: It's open AI, Gavin. We're talking open AI, , fine tuning for GPT 4. 0 is out. And I actually squeed a little bit with delight for that, but I understand why most people don't.

Gavin Purcell: I know what fine tuning is, but can you explain in this specific instance, why this is pretty cool for people that might want to try to use it?

Kevin Pereira: Fine tuning helps you steer the output of these language models so that you get the desired result that you want.

Kevin Pereira: For example, if you're building a chat bot for customer support or for your AI boyfriend, you can fine

Gavin Purcell: a better example than that. Kevin, come on. Let's, let's [00:22:00] talk about what if you're fine tuning a chat bot to make, , hot dogs speak a language, , that is like cling on, but something different. That

Kevin Pereira: So you want,

Gavin Purcell: new case.

Kevin Pereira: you want a Juggalo hot dog. That only speaks with ICP parlance, right?

Gavin Purcell: exactly. So

Gavin Purcell: help me, help me build that because that's

Gavin Purcell: clearly the

Kevin Pereira: I'm all kosher. Yes. So you would then, for example, for fine tuning, you would come up with a list of 10 to 10, 000 inputs and outputs. , if I ask you, Hey, what's up? How are you feeling this morning? The Juggalo hot dog might say what Gavin?

Gavin Purcell: It might say, , I love Faygo and also I'm a big sausage, uh, put mustard on me. Is that what it would say? If you're not familiar with

Kevin Pereira: I'm going to fine tune a model that every response is just, I am big sausage. Okay. Put mustard on me.

Gavin Purcell: Okay, keep going. The poor people listening to this are like, what just happened here? But that's fine.

Kevin Pereira: Not to get morbid Gavin, [00:23:00] but I know what I'm gonna lobby to have etched on your tombstone, my friend. I am big sausage. Put mustard on becomes a thing where people as a tribute to me just come around and spread mustard on my tombstone.

Kevin Pereira: Boy, that was a detraction. The point is, you fine tune so that you can get a certain type of response. That could be a certain format.

Kevin Pereira: Of a response, if you want code, , or a certain tonality out of your agent. So now if you're building with GPT 4. 0, you can fine tune. The prices on this are also super competitive. So we talk a lot about individual users using AI and their workflows or creatives trying to generate new things. But from an enterprise standpoint.

Kevin Pereira: OpenAI is really keeping things attractive by rolling out features like this and making them cheap. They also, Gavin announced a partnership with Condé Nast. So when we look at OpenAI's search product, which has been rolling out slowly to people, it's their perplexity or their Google competitor, they're making arrangements to make sure that best in class content appears within those results.

Kevin Pereira: [00:24:00] And they don't get slapped on the wrist with a giant lawsuit.

Gavin Purcell: . Actually, Matthew Berman, one of our favorite YouTubers created a video, which was comparing search GPT cause he got access. We don't have access yet. Most people don't and perplexity and Google search. And I have to say, Kev, after watching that video, I'm not super impressed by what search GPT is yet.

Gavin Purcell: It's very limited. I do see a world where in the future it's going to be really interesting, but the perplexity product kind of is really compelling already. But anyway, the idea that they partnered with Condé Nast is another way of signaling to the world at large.

Gavin Purcell: , look, we're doing business as per usual in the, in the way that people should be doing business. We're going to pay media companies , for their ability to. provide stuff for us. And this is really what OpenAI is doing going forward.

Kevin Pereira: To be clear, that's Vogue, The New Yorker, Condé Nast Traveler, GQ, Architectural Digest, Vanity Fair, Wired, Bon Appétit,

Gavin Purcell: the other thing that Condé Nast owns a big chunk of is Reddit, right? So there's no Reddit deal done yet, but we know that Reddit did a deal with Google. So we assume that there might be a Reddit deal around the corner as [00:25:00] well. All of that is pretty big news.

Gavin Purcell: Again, just blog posts. No products, but open eye. We know your products are coming soon enough.

Gavin Purcell: Okay, Kev. There's one other big story that we want to cover here before we get into some really interesting AI video tools. Unitree's 16, 000 humanoid robot goes into production. There is a great video that will be showing on the screen here and we'll link in the show notes of this Robot just go in bonkers.

Gavin Purcell: Like it is just like dancing around, moving around fast. But again, the thing here that's really interesting is it's 16, 000. Now that is a lot of money, but when you think about the idea that like, okay, is this thing going to work in my house for me? Am I going to have it be able to do the chores that I don't like to do?

Gavin Purcell: Is it going to be able to do certain things for me? It's not crazy. And you can see a world where what have these got down to like, say 5, 000 or 2, 500.

Kevin Pereira: Gav, let's keep it at a 15, price point. Let's keep it there, but let's imagine Two and a half to four and a half years from now where version three or four of [00:26:00] this is more capable does move faster can clean the house can do the laundry can make you the meals can take out the trash. Let's imagine that.

Kevin Pereira: Let's let's make it 20, 000. Well, guess what Americans are going to do now, right? They're going to drop the price of the car that they want from being like a 30, 000 some odd dollar purchase, right? They're going to get the smart car off of Timu and they're going to grab one of these as well because it will be

Gavin Purcell: Don't buy the team of car. I do not suggest buying a car off of Timu. There's

Kevin Pereira: Timu car's great! Who needs wheels that actually spin around? You don't need those! Let them fly off! Your robot can repair it. No, I really think that I think you're going to see consumers What? Go ahead. Here we go.

Gavin Purcell: I never thought about that before,

Kevin Pereira: Robot pit crew?

Gavin Purcell: no, but imagine this, like that's a really interesting use case of humanoid robots. Mechanics, if you're listening, I'm sorry, because this is going to sound scary to you, but , there is a world where my humanoid robot could easily download all the information needs to fix my [00:27:00] car and I just send it out to my garage and have it work on my car and do the oil changes and everything.

Gavin Purcell: That is

Kevin Pereira: I changed the fuel filter, not the oil filter, not the, the fuel filter in a Ram 3500 diesel, Gavin. Do you know how hard that was for me? I am, I had to use, I had to use more than a tool. I had to get under a vehicle, but I did it harnessing the power of a YouTube video. So yeah, there is a world where people will make that decision.

Kevin Pereira: I think consumer spending habits are going to change drastically. And maybe instead of . Buying a new phone every year or opting for a 45, 000 vehicle and making payments on it indefinitely, people are going to shift because they'll see the value of having that robo assistant everywhere. , I also love the memes of this robot folding up in very weird ways.

Kevin Pereira: I mean, it is like, it's look at how it

Gavin Purcell: the Ray Gun, Ray Gun vibe, right? Like, I'd love to see it do an Australian breakdancing event. So, you know, maybe [00:28:00] robots do come into our homes, but there's a lot of people that don't believe that robots are actually a real thing.

Gavin Purcell: In fact, Kevin, there's a ton of people out there who kind of think that the Internet is full of lies, right? That people are lying to us and we have to get closer to the truth. And this week's AI Coast is actually really interesting. We have one of the largest , baby podcasters on the planet. That's right.

Gavin Purcell: We have an exclusive interview with an AI denier. He is an infant supplement enthusiast, and he is a world famous crib cage commentator. That's right, Kevin. We have baby Joe Brogan on the show. This is baby Joe. Can we, can we meet baby

Kevin Pereira: So excited. As a fan of the Crib Cage Championships, I've been watching them since the early 90s. Gavin, I'm so excited. Yes, . Baby Joe Brogan, please say hello to the AI for Humans audience.

Kevin Pereira: Um, yeah. So, uh, Gavin and Kevin, huh? What's up guys? And, hey, to everyone listening to AI for Humans, if you're here for the truth, let's get something straight, okay? [00:29:00] AI? Robots? Nah, that's just a bunch of adults making up stories to keep babies in line.

Kevin Pereira: Uh, I mean, come on, if AI is so real, why why haven't I seen one change a diaper or make a bottle? Okay, now that is a good point. Baby Joe Brogan has a good point. Maybe because the articulation is so bad that when you see these robot hands, cracking eggs or smashing walnuts, you don't exactly want it handling an infant in a diaper situation.

Gavin Purcell: don't I I just think I just realized our transition into baby. Joe was very very Hard right turn into what we're doing here. Just everybody clear.

Kevin Pereira: Well, speaking of hard, right? We have baby Joe Brogan here. Anything else you want to ask him, Gavin?

Gavin Purcell: Baby Joe Rogan is an AI co host. We created Baby Joe, but what we have done is basically taken, , Joe Rogan's essence, let's call it, and brought it into a baby. . So baby Joe, we're, we want to hear more about your podcast. We want to know a little bit about what your show is and how you [00:30:00] grew so big.

Kevin Pereira: Um, yeah. So, the Baby Brogan podcast. It's like, you know, where I get real about the baby experience. I started out just ranting about the stuff we all go through. Like formula conspiracies and Big Crib. Big Crib? Trying to keep us down with nap schedules. Turns out, a lot of babies were feeling the same way, but nobody was talking about it, you know?

Kevin Pereira: Sure. Sure. I started bringing on guests. Other babies. Even a few toddlers. Even a few toddlers? Who got their own takes on the world. We debated everything. From The ethics of tummy time to the truth about teething rings. The key, um, just being real and not afraid to ask the tough questions.

Kevin Pereira: I love like a baby Huberman talking about tummy time.

Gavin Purcell: That's right. Exactly.

Kevin Pereira: The most optimal rubber animals to bring into tub time. Like, is it, is it the ducky? Is it, , a ship? Is it like a little rubber tugboat? Who knows? We have [00:31:00] to explore this. We got to trust the science. He did mention his guests, Gavin, and I think that that's interesting ,

Gavin Purcell: when

Gavin Purcell: our audience out here may or may not be familiar with baby Brogan, your feed may not be delivering this stuff. So baby Brogan has interviewed some very other famous babies.

Gavin Purcell: As we said,

Gavin Purcell: This isn't a clip from his interview with baby Terrence Howard.

Kevin Pereira: When you say number one plus number one equals number two, wha What do you mean? The thing about all this, it's vibrational.

Kevin Pereira: Look, you do a number one. You got a wet diaper, right? Sure. Yeah. Uh, that's how it works. Okay, now I'm saying you do another number one. Then, vibrations, and voila. A number two. But Terrence, it doesn't work like that. You've got to keep your mind open here. I, and, and, and, and, and, uh, Huh. Mm. Okay, I just, I just had a vibration, Joe.

? [00:32:00] Okay, and that is

Gavin Purcell: Wow. This is pretty impressive. , baby, Joe. , , if you're listening to this on the audio, one of the reasons we did this AI goes this week is because we wanted to show off what you're able to do now with these new video and audio tools.

Gavin Purcell: , go check out the YouTube video. We have some pretty interesting. Um, basically clips we've made of what would look like a baby Joe Rogan podcast. , before we roll out of here, we should play one more clip from baby Joe's podcast where he interviewed, , a very famous CEO baby.

Kevin Pereira: Joe, you know me. I have big baby ambitions. You don't have to tell me, man. I've seen Baby X. It's so much better now. No more whiny little bitches begging for burpees or upside downsies. Um, well, Joe, yes, uh, haha. Um, but have you ever looked up into the sky at night? Ooh, yeah, that, uh, dangly mobile thing.

Kevin Pereira: It's freaking weird, man, just hanging out there. Lot of cool shapes, though. Yes. Ha ha. Very cool shapes. [00:33:00] Well, we're going there. We're sending babies out into the great unknown. We deserve to know why those shapes dangle like that. How do they produce that soothing music? Why are they always moving?

Kevin Pereira: Is that safe? Is being a baby ever safe? Um, no. Pass me that joint.

Kevin Pereira: Baby Elon Musk. Baby, baby long, right? Baby long? Babylon, , the inventor of the Cybertrike. We're all pre ordering, we love it. We're super fans. Why did we just do that, Gavin? Why did we spend time creating these voices and these characters? What can the audience take away from that?

Gavin Purcell: this is part of our workflow that we're going to get into in a little bit, uh, talking about how to take a very simple, dumb idea and turn into something that's a little bit more complicated, maybe still very dumb, but more complicated. So we're going to get into that later. We're sorry for putting everybody through , baby Joe Brogan. But for now, we're going to put him aside, Kevin, and it's time to talk about some of the things that we've [00:34:00] seen out there on the internet this week that we love.

Gavin Purcell: These may not be new AI tools, but these are people using AI tools in a cool way. It's AI. See what you did there.

 

Gavin Purcell: First and foremost, I want to shout out a creator whose name is Andre three underscore AI, and this guy created one of my favorite viral AI videos to date. It is the game of Thrones, like massive rave. There was also a version of this was with Lord of the Rings, massive rave.

Gavin Purcell: And you might've seen this where like, it's a hardcore, like kind of trans house music track. And it goes from like game of Thrones character to game of Thrones character. And then from AI character to AI character. And what was interesting about this is this guy, he, at the time, he only had like a thousand followers on, on X and now he has [00:35:00] 1700s, but somebody had taken his video, ripped it and uploaded it as their own.

Gavin Purcell: And then Elon reposted it. , there's a lot of people out there like, Hey, he doesn't deserve the right to, he didn't create it. He just prompted it. That's not true. He did a lot of work on this to put it together. It is not an easy thing to do what he did, and it's a very well done thing. But even so, , Other people on social media taking what somebody kind of made and kind of purporting it, pushing it out.

Gavin Purcell: There's not the greatest. And again, Elon does this all the time.

Kevin Pereira: That's right, for anybody saying that there's no talent or artistry required for AI art, look at what Andre just made. That Game of Thrones rave, the electronic dragon carnival, whatever you want to call it. Look at that, and then look at baby Joe Brogan, alright? There are levels. To this stuff. All right.

Kevin Pereira: And if you're talented and creative, you can go viral on social as well. , speaking of that, Gavin, you turned me on to the space vets, children's series. And , people should go look at it. It's made by storybook studios. They're, a production company out of Germany. And when you watch the trailer, it looks like a pretty cool, competent, animated kid series.[00:36:00]

Kevin Pereira: A bunch of different characters, , floating around in space. There's animals, there's ships, , and it is pretty elegantly animated with decent lip flap. But then you go down below on the page and you get to the making of Gavin. And that's where you see that it's a very small team that's using it.

Kevin Pereira: AI pipelines so that they don't have to, , spend weeks modeling a character in 3d. They can use AI to generate a 2d character. Then use another tool to make it 3d. Then use another tool to film themselves and apply the animation to that character. And they do this for every aspect of the show. All of the textures in the spaceships, the set design, probably even some of the sound and music.

Kevin Pereira: This is the next gen workflow that I think more places are going to have to adopt if they want to compete with production companies in the future.

Gavin Purcell: , kids animation especially, but animation overall is probably gonna feel this first. Right? And the, you know, just to, to shout out, there is a, a big strike going on in the animation world right now. And I think that animation has always been a trickier place for people to [00:37:00] get paid in because it's seen as lesser work, which it absolutely isn't.

Gavin Purcell: It's very difficult work and , it's really compelling work as you can tell by things like Rick and Morty or the Simpsons or Family Guy being some of the biggest shows of all time. I think this is such a compelling workflow, right? Because you see one small team making something that could easily scale to a very large size.

Gavin Purcell: And if you go nitpick piece by piece, there are a few frames. You can see things that are a little funky, but there were kids shows, Kevin, when I was a kid, like Scooby Doo, which they literally reused frames often in the show, and they were still really

Kevin Pereira: In Disney films they did. They're the same cells of animation used from movie to movie. So absolutely.

Gavin Purcell: That's right.

Gavin Purcell: And I also want to shout out the Google DeepMind podcast. , this is a really cool podcast that's been happening for a couple years, and they had one of my favorite AI speakers on it, Demis Hassabis, who obviously leads Google DeepMind, Demis talks about the future of AI, where we are right now. He [00:38:00] really gets into the idea that we might be both overhyped and underhyped right now, which I think is a compelling argument.

Kevin Pereira: You really want AGI to be able to peer into the mysteries of the universe down at the Planck scale, like kind of subatomic. Yes. Quantum world. Yeah. Do you think that there are things that we have not even yet conceived of that might end up being possible?

Kevin Pereira: I'm talking wormholes here. Completely. Yes. I love wormholes to be possible. There is a lot of still things we don't understand about physics and the nature of reality, you know, obviously the quantum mechanics and unifying that with gravity and all these things and there's all these Problems with the Standard Model.

Kevin Pereira: Giant gaping holes in physics. Yes, in physics, all over the place. It would be great to come up with new theories and then test those on massive apparatus, perhaps out in space. The reason I'm obsessed with Planck scale things, Planck time, Planck space, you know, is because that seems to be the resolution of reality. That sounds like a PlayStation 6 marketing tactic, right? Remember the emotion engine for PlayStation 2, it, , crushed the Dreamcast? [00:39:00] This is, this is the resolution of reality.

Kevin Pereira: I'm on board.

Gavin Purcell: , the audio is great. You can go find it wherever you want. All right, Kev, we should jump into what we did with AI this week and

Kevin Pereira: Oh, we don't want to have, uh, baby Brogan interview baby Hasabis? We don't want

Gavin Purcell: don't trust, I don't trust that Baby Brogan would be genuine with the baby Hasabas. I feel like

Gavin Purcell: that's not going to work.

Gavin Purcell: But speaking of Baby Brogan, what we want to talk about this week , is a workflow that's kind of unique and interesting and utilizes a lot of these new AI tools. And there are a lot of new things that have just broken in the AI video space.

Gavin Purcell: There's runway gen three turbo, which means you can actually generate clips way faster. It's not as perfect as gen three alpha, but it's way more usable because you get to see this stuff faster. There's also a new update for Luma's dream machine. There is a new, , open ended hotshot video model, which you have four people made.

Gavin Purcell: So there's all these new tools, but most importantly, Hedra, , dropped 1. 5, which is their new character animation software in Hedra. If you remember, , as we talked about before is a very good lip sync [00:40:00] model, meaning that you take an image. It can get the words right and it kind of moves the head in an interesting way.

Gavin Purcell: So this week I created a, I'm a big fan of these dating videos from the 1980s and early 1990s. The found footage guys kind of made it popular. There was a parody of them we did when I was at Jimmy's show, , but it's people talking to camera and kind of , telling their perspective dates about what they are.

Gavin Purcell: So. I went through and I said, I wonder if I could make something with all these tools together. And I made something called AI dates for you. So Kev, I wonder if you can play the first one of these I made, and then we'll kind of talk about how I did this.

Kevin Pereira: Meet Chris. It's not hard to understand. Boats, I love them. They're my, uh, you know, happy place. Give me boat and hoodoggy I'm there. What were we talking about again? Oh yeah, boats. Okay, so this is, this is Chris feathered hair, standing on a dock talking about boats. Yeah, look a date with me. Uh, let's just say I've got a special plan.

Kevin Pereira: You [00:41:00] come onto my boat. It's a sweet 10-year-old cruiser. Uh, you might have to help me out with a few things in case it doesn't start. I hope you like getting dirty because, uh, well, the engine gets kinda greasy and I don't really understand engines. Do you? I'm looking for a woman that understands how to fix boat engines.

Kevin Pereira: Did I mention that?

Gavin Purcell: You can get a sense of what's going on here. And I think that , I'll walk you through my workflow real quick, Kevin. Then we should talk a little bit about like how I think this could kind of change the future of media at large. So this started with a dumb idea in my brain, as most fun things always do.

Gavin Purcell: Then was like, Hmm, I wonder if this , could be something I started writing a script and literally when I say writing a script, it was just kind of like putting some ideas down on, on, in my computer. And then I, the real thing that was like kind of the trigger was, I went to Midjourney to try to create an image.

Gavin Purcell: So I found a prompt that somebody else had used to make like a 1980s style like movie character. And we'll put all these prompts in our show notes so everybody has a sense of it. So I created this character who looked like, originally looked like an [00:42:00] 80s action hero, but then I wanted to tweak him slightly and I put him in a marina.

Gavin Purcell: And the marina part, Came from that prompt. And then I was like, Oh, I wonder if this guy is like a boat guy. He could be a big boat guy. So I then finished the script. I wrote out these kind of three parts. I took that, , image, which was a single image of Chris, the final image that I have in here. And I used it in both runway gen three turbo to make that image look like it was alive.

Gavin Purcell: And I put the graphics over that in an editing program. And then I used Hedra and I just uploaded the audio, which I got, I made the audio in 11 labs. I uploaded that audio into Hedra, downloaded it and put it all together. And to me, this is the promise of these tools, right? I am not. An expert at any of this stuff. you could argue that I spent a lot of my life making dumb things for television comedy stuff. So like, at least I have like an expertise of the idea brain a little bit, but I am not a, graphic artist.

Gavin Purcell: I am not , an AI artist. I am not [00:43:00] a cinematographer. I'm not any of these things, but I was able to piece all this together. And the craziest thing Kev is I did this first one in about 45 minutes And that felt like the Big unlock to me when you look at what's possible here.

Kevin Pereira: We use the same workflow essentially to bring Baby Brogan to life for this episode and the guests. I think the only difference is that rather than Midjourney, we used Flux, a different AI image generation software. But just describe the character that we want with a single still image, throw it in a runway, give it some motion and movement if you want, use Hedra to make it talk.

Kevin Pereira: That 1. 5 update, Is pretty incredible. The new update, they promise more realistic blinking and head movements.

Kevin Pereira: And , it's there, you can see it. Like it is an impressive program. And now I go, okay. How do we iterate on top of this? Can we take the Hedra output and track it onto runway movement? So it's not just a static avatar and get. Actual camera movements with the lip flap. Can we [00:44:00] feed everything through face fusion, a free program that has a face enhancer and a lip enhancer so that we can clean up some of the blurriness.

Kevin Pereira: And your mind starts to spin about just bolting other tools into the process to improve the output.

Gavin Purcell: That's right. And Kevin, this should be our time. We're asking for VC funding right here. So this is our product. I'm just kidding. But also , that's the interesting thing, right? Like we're just like kind of two dummies kind of like banging around at tools. That's what's possible now. And I think it's just as something to for everybody listening to this or watching this, like go try this workflow.

Gavin Purcell: Like it's very simple. You can do it. I know people that have been watching and listening to our show forever have done stuff with lip sync models in mid journey or try to make music, but like It is now possible very quickly to make an interactive thing. Like I've been thinking lately about like, what would it look like to make like a, a sitcom?

Gavin Purcell: Imagine a show like the office, which is done with a lot of like looking at camera, like docu documentary style.

Kevin Pereira: I want to see now, especially that you've [00:45:00] rendered. The TV show aspect of it, the AI dates for you. I want to see that old found footage of the love connection powered by AI because it is a Chuck Woolery, a host seated at a couch, talking to someone else who's mostly seated and watching video clips.

Kevin Pereira: Of date reactions like that's totally it. And you can hallucinate footage from the dates themselves, which that show didn't even have. So that's just one format. Let's see more talk shows. Let's see e sports streamers and live streaming content parody, because that is mostly a person sitting at the computer and you don't need.

Kevin Pereira: A beefy workstation to do any of this. You don't need a 5, 000 graphics card. In fact, most of the tools that we've talked about, there are free versions of them available. Even Hedra gives you a watermarked free output to try. So sign up , for foul. That's FAL. They let you play with flux for free and quickly.

Kevin Pereira: They'll give you a dollar worth of credits, which will give you hundreds of image generations. Take that. Throw it into [00:46:00] Hedra. Use your own voice, which you can record right on their website. Use AI to make a voice. , there's no reason you shouldn't be experimenting with this stuff right now. Unless you're a Procreate user.

Gavin Purcell: or you are a bot listening to our show. And in that case, do everything you can bot to up our algorithmic work.

Gavin Purcell: , Kev, we should shout out our, we have a five star review. We want to read is a very short and fun one, but it is from I am carp and it says fun, funny, and informative. I never miss an episode.

Gavin Purcell: I also listened to last week and AI good show. One of the hosts was a guest host on last week and I, and I was so impressed. Wop, wop, wop, wop. That was me. Thank you for the great infotainment. Thank you. I am Karp. That's wonderful. If you leave a five star review for us on iTunes, on Apple Podcasts, we will read it.

Gavin Purcell: And also, guess what, Kev? Just this week, Apple Podcasts is now accessible on the old internets. You do not have to go through their app to leave a review. So if you're listening to this, go to Apple Podcasts right now on the web and leave us a review. Please. It doesn't have to be [00:47:00] five star. We like it if it is.

Gavin Purcell: But the five star reviews, we will read here.

Kevin Pereira: That's right. You want to leave four? Hey, that's your business, but we're not reading it. And that's our business and businesses. Boom! And also a huge thank you, magical scroll time on the screen, Gavin, yes!

Kevin Pereira: Thank you to our patrons. Thank you to everybody who thus far has expressed an interest in jamming a couple dollars into our g string or the tip jar on the counter. Whatever visual you want, that's fine. It doesn't matter. The point is, we appreciate you very much. Thank you to all those fine Find folks that just scrolled along.

Kevin Pereira: We will have the Discord up and running within a week's time. So go over there, back us on Patreon, get over on the Discord, and start sharing your amazing creations with the world.

Gavin Purcell: That's right. Thanks, everybody. We'll see you next week. Bye bye.

Kevin Pereira: Bye.