This Week In AI: Explained In 20 Minutes
Ejaaz:
[0:03] Welcome to the limitless weekly speed run where in under 20 minutes we're going to give you the hottest news in ai and frontier tech of the week we have a ton to cover so i'm going to keep this intro short open ai has released a new model and won another gold medal the government of albania has appointed a new minister and it's an ai model
Ejaaz:
[0:24] and so much more josh we're going to get straight into it.
Josh:
[0:28] 20 minutes.
Ejaaz:
[0:28] Let's go. OpenAI's model won gold in a very prestigious coding competition, which is only reserved for humans. But Josh, let me take you one step further, which is they scored a perfect score. 12 out of 12. That's like never been done before. The best human at this competition got 11 out of 12. What's your take?
Josh:
[0:51] Sick. How did they do this? Were they using the same chat gpt model that we're using now or was it a private model
Ejaaz:
[0:56] Great question it is the same chat gpt model that you and i have used and have access to via our pro subscription except they kind of boosted its reasoning skills so for context here the 2025 international coding championships world finals is a collegiate programming competition where top universities teams around the world solve complex algorithmic problems and they weren't alone with submitting their OpenAI model, they were competing against Google as well, which also achieved a gold medal, but it only got a measly 10 out of 12. So, you know, formally, we thought that Google had taken the lead in the AI race. OpenAI has proved that they have further cemented their ability to compete at the highest level. And I also want to reference a competition which both of these AI titans went head to head on before Josh, which was the math competition. If you remember, we covered it a few weeks ago, and they each achieved gold. And it was kind of unclear which model was the best and they were kind of fighting amongst each other now you have open ai which clearly kind of like separates the field between these two.
Josh:
[1:59] I like this iterative thing in between major model releases where companies you learn how to squeeze out all the juice from these frontier models to make them slightly better until the next ones come so this is i mean this along with some other grok news that we got this week it shows that models are actually they have a lot more potential in them if you apply the right types of reinforcement on top of it and i think that's what we're seeing so this is cool i'm glad the models are getting better
Josh:
[2:20] we're progressing towards that goal that we want to of AGI and this is another step on that journey.
Ejaaz:
[2:25] But it's interesting that you mentioned whether, OpenAI had used a different coding model because they actually separately released a brand new coding agent called GPT-5 Codex. This is basically an optimized version of GPT-5, their flagship model, but dedicated towards coding. Now, if I were to summarize the most important takeaways, it does two things really, really well. Firstly, it's faster. To be specific, it's 15 times faster and it does more thinking in parallel processes around coding projects so what that means is if you give it a coding task it splits it into now a million different variables and subtasks and runs in parallel so it runs like multiple versions of itself to get the thing done and this might seem like kind of like slightly nuanced change but it's proved really really effective. In fact, it ended up taking 40% of the entire codex demand within a few hours of launching, Josh, which shows just the insatiable demand for something like this.
Josh:
[3:32] Yeah, this is cool. It feels like these coding agents are kind of becoming commoditized, where now we're seeing these offerings from so many different large companies. I wonder who's going to win. I mean, Anthropic and Cursor, they've kind of been like the clear leader when it comes to writing code. I am hopeful that OpenAI can make like a noble attempt at cracking this monopoly they kind of have.
Ejaaz:
[3:52] I think one of the cooler things about this, I agree with you, I think they're becoming optimized, but sorry, commoditized. But I think a lot of these coding agents can't really focus on a task for longer than 30 minutes to maybe even an hour and a half. And when they do, the longer that they spend on a task, the more mistakes they make. With this new release of GPT-5 Codex, it can basically work accurately for seven plus hours. And don't take it from me. Don't take it from me. Take it from the people that actually created this product. I'm going to show you a screenshot from Rune, who basically helped build the model OpenAI. And he says, right now is the time where the takeoff looks the most rapid to insiders. And he's referencing himself. We don't program anymore, he says. We just yell at codex agents. So he's basically saying the engineers, the software engineers at OpenAI don't really code anymore. They just use their own product that they built, this coding agent, and it does all the things for them. And this guy who's commentating on this says, if Rune isn't bluffing, that's a strong sign we're nearing automated RSI recursive self-improvement for coding agents. And I think zooming out, the point that he's making here is AGI that we keep talking about might be reached in certain subject matters, aka coding or math or scientific development, before it reaches anything else that the normal average user sees.
Speaker2:
[5:17] This is cool.
Josh:
[5:17] So I guess, what are the takeaways here? It's seven hours of continuous thinking and anything else noteworthy for Codex that separates it?
Ejaaz:
[5:24] This thing called dynamic thinking, which basically allows it to work on multiple parallel tasks. It splits an original task into multiple parallel tasks and works on it basically for seven hours. It's less hallucination, higher accuracy, and it's cheaper and faster.
Josh:
[5:40] All of the words that we love hearing together in a sentence. Okay, so Codex, pretty cool. Excited to try. What do we got next?
Ejaaz:
[5:46] So staying on the theme of OpenAI, Josh, I guess you're wondering, well, aside from coding, which you and I probably don't spend too much extensive time doing, what are other people actually doing with this thing? Well, OpenAI actually released a really interesting user study, which tells us, without getting into the nitty gritty of what this thing can do, I'm going to summarize some really key important parts, which is we just spent five minutes talking about coding and making it out like it was the biggest thing ever. You and I, Josh, have actually spoken about it a lot on this show. It takes up 4.1% of users. Do you want to know what they're mainly using it for, Josh? What are they using it for? Tell me.
Josh:
[6:24] I would guess they're using it for like homework or general knowledge work. Or actually, no, I would guess they're using it as a replacement for Google. They just want to know what like the temperature is outside.
Ejaaz:
[6:34] Josh, if you were a gambling man right now in this moment, you would be a very rich one because you are actually accurate. They are using it for homework. They're using it for editing essays, and they're using it for kind of like giving them advice on, you know, what to do or email to their managers or sales teams. It is all very internal, unsexy kind of work. Another thing that we've said, which would be quite a dominant AI use case, is they're definitely using it to have relationships or to get relationship advice. I'm afraid to say relationship advice only accounts for about 9%, and AI companions are measly 0.4%. So all this to say is, I think we're living kind of in our own Twitter slash social media slash AI nerd bubble. And maybe we need to step outside for a breather.
Josh:
[7:19] Okay, but to your point, so 9% is relatively small. How many users does OpenAI Hive and ChatGPT have? It's hundreds of millions of people. So if you take hundreds of millions, half a billion people, if one in 10 of those people are using this relationship device, we have 50 million people running around that are taking advice from an AI on how to manage the most important thing in their life. So that's like a little interesting. You start to think of these numbers at like these scales that they're operating at and you start to see like, whoa, okay, even if only, what was the stat? Like 0.1% use it to have like a relationship, like internally or 30% are using it to help write. I mean, 30% of half a billion people. That's a lot of AI words that I'm being fed in my X algorithm or maybe newsletters that I'm reading or general publications. Like there is a lot of AI embedded into our world now with these numbers.
Ejaaz:
[8:10] It's a little startling. I guess 0.4% of 500 million users is a decent chunk of weirdos. You're absolutely right. Right. But moving on, keeping this train going, the final bit of OpenAI news is a paper that they released, Josh, which reveals that their models are scheming. And I know you love this one, so I'm going to let you take this away.
Josh:
[8:35] I don't even know what it means to be scheming. Scheming feels like a millennial term.
Ejaaz:
[8:39] They define it right here, Josh. Sorry. It says, scheming equals when an AI behaves one way on the surface whilst hiding its true goals. I know a few people and ex-girlfriends that did exactly this. So basically what they're saying is you could give one of their AI models a goal or a task. And on the surface, it appears like it's working on it, but behind the scenes, it's either not doing it or it's doing something completely different. And I think that's where the motive behind releasing this paper came from, where it says AI safety and alignment has a new vertical to explore and focus on and invest in. And it is invest in. And it's this thing called scheming, basically nefarious or potentially malicious AI models.
Josh:
[9:22] Ejaz, you know where it learned scheming from? You know where it got that from? It got it from us. It's meatbag trainers that taught it on all of our context. And now it's scheming because we are insufficiently capable of being honest. And now the AI is not being honest and we have to fix it. I'm very glad to see that OpenAI is fixing it. And they have actually concrete evidence that they are fixing this anti-scheming. So one day, I hope that the AI will be better than us.
Josh:
[9:45] And this seems like a step in the right direction in making that happen.
Ejaaz:
[9:48] Okay, what else do we have? Moving on. We're stepping outside of OpenAI now. On to the topic of China, who this week basically gave the middle finger to the USA, specifically NVIDIA and Jensen Huang saying, we don't need your hardware chips or GPUs anymore. Now, remember, this is a massive, massive statement because Jensen pretty much runs the entire world right now or runs the entire AI sector right now without Jensen. Wait, sorry to interrupt. Do you know.
Josh:
[10:17] That NVIDIA now is actually worth more than the total GDP of Canada?
Ejaaz:
[10:21] Of Canada's ridiculous. That's so outrageous.
Josh:
[10:25] Okay, sorry, I just wanted people to know that, to just be aware of the stakes of how big this is. But yeah, please keep going.
Ejaaz:
[10:29] That is hilarious. And what's more is, you know, NVIDIA also made a strategic investment today. They announced this morning a $5 billion investment in Intel, another American chipmaker. So, you know, the American chipmakers are really unifying and driving forward what AI innovation is going to look like. And China is just kind of like, nah. And why this is so important is, previously, China has had heavy, heavy reliance on American manufacturing, specifically hardware, but also all the software kind of stuff. And this is the first major step that they've made, a bold move, as you said earlier, which is to say, with this new paradigm shift, with this new technology that's going to basically take over the entire world and be worth trillions of dollars, we don't need you, US. We've got this. In fact, we might be ahead of you. To give you some specifics here, there were two things that happened.
Ejaaz:
[11:19] Number one, Huawei, which is a big technology company in China, announced a new chip, a new GPU, which when placed side by side with the NVIDIA's latest, I think it's H20s, is marginally better. So it's not to say that it is as good as NVIDIA's Blackwell chips, which is their premium premium brand, but it's to say it's good enough to train models the way that we train it. And remember, China trains their models differently to America. They train it in ways where they automatically assume they're constricted with resources. So they achieve the same kind of output, but with less resources. They're doing that with their chips now. And this is a major move because it means less revenue for NVIDIA. Jensen is not happy, I can assure you, that he even struck a deal with Trump, which was this $5 billion investment in Intel, presumably. And now he's just lost 20% of his revenue. I believe it's 20% of chip GPU sales went to China. Absolutely crazy.
Josh:
[12:14] I spy with my little IA trend that we're seeing here, which is people moving away from their reliance on NVIDIA if it's possible. Earlier this week, we talked about XAI, how they're planning to build their AI5 and then AI6 chips that are going to be all internal. China is now clearly making an effort. I mean, it's obviously a no-brainer that everyone's going to try to take a piece of NVIDIA's cake. How well they're able to do it, that is the question. And if I'm Jensen, I know he's sprinting a million miles an hour trying to stay ahead of him. I hope he could do it with China. But this is a very bold move.
Josh:
[12:43] I wonder if China sees something that that the world doesn't quite yet.
Ejaaz:
[12:46] OK, keeping up with global political affairs, this is a major one, Josh. Albania.
Speaker2:
[12:52] Has appointed a new minister,
Ejaaz:
[12:54] And it is an AI bot. The reasoning behind creating an AI bot as an official government minister is to prevent corruption. Basically saying that the AI's values is more ethical and anti-corrupt than an actual human. I don't know if that instills or is meant to instill confidence in me as an Albanian citizen or native, but I just found that hilarious because we joked about this, I think literally earlier this year in January, that it would be eventual that one of these governments will appoint AI to aid them in government policy. I don't think either of us imagined at that point that it would be a literal electable minister. Just crazy.
Josh:
[13:37] This is funny because you have to ask the question, well, who trained the AI? Where's the AI getting its information from? Because like we just learned AI's are scheming. So who is training the models that are scheming now elected officials in government. I think this is an interesting precedent to set. I don't think we've ever seen anyone take a role as an official minister before, if that's correct.
Ejaaz:
[13:59] Yeah. Okay, well, interesting.
Josh:
[14:01] What's next?
Ejaaz:
[14:02] Okay, Josh, you are obsessed with content creation. So I know this is going to wrinkle your skin.
Josh:
[14:09] This one I saw.
Ejaaz:
[14:10] Crease your forehead or either throw your hands up in the air in exasperation. And VO3, AI-generated video from the top leading AI video generator from Google, VO3, is now available... Is now available for YouTube, YouTube Shorts. So you as a YouTube creator can log into your YouTube profile, click VO3 generator, write up a slick detailed prompt and come up with a short, which can be then funneled to a million Gen Z people, contributing to BrainRaw. Josh, your thoughts, please.
Josh:
[14:44] I have one word reaction to this. It's slop. And maybe I'll say it loud if you're going back. Slop.
Ejaaz:
[14:48] The videos, look at the videos. This looks terrible.
Josh:
[14:51] This is horrendous. Why are they promoting this? I want real creators creating real content. I want content creators using AI as a leveraged tool to make better content. I do not want AI content. This is repulsive. You don't want an octopus playing the drums.
Ejaaz:
[15:03] You don't want a Viva shining your shoes.
Josh:
[15:06] There's absolutely no value here. It is detracting from what makes YouTube a great place. I hope it burns and rots and is not able to be used. I mean, at least in this form. And this is unfair because, I mean, AI content, it will continue to get better. But I really do admire the idea. And this is probably not a realistic take for me because I know how these things work. But I love the idea of AI being complementary to creators instead of being completely replacing creators. And this very much feels like replacing.
Ejaaz:
[15:32] I'm actually with you on the original take you had, Josh. I think this is slop and I don't think it'll be net-net great for the generations that are coming up. But if they like it, they like it, I guess.
Ejaaz:
[15:41] But moving on to more serious business, Josh. Let's put our suits and ties on. We've got a big figure to announce. OpenAI has signed a $300 billion contract with Oracle. Larry Ellison is now officially the richest man in the world. Bye-bye, Elon. It's his best pal, Larry Ellison, right now. And the reason why is after they made this announcement, his net worth went up 100%. Did you see his stock chart? Yes, it is insane. Basically, it looked like a vertical line upwards. It was absolutely insane. On the day.
Speaker2:
[16:19] Yes.
Ejaaz:
[16:19] Now, the craziest part about this is this $300 billion that is being invested in Oracle, Oracle doesn't even have the goods right now. And OpenAI doesn't even have the $300 billion. This is a deal that is set to play out over the next couple of years, but it got priced into the stock upon the headline hitting the news waves. Just an absolute insane, but honestly, kind of expected thing in AI these days.
Josh:
[16:44] That's a big number. Congratulations, slyri elson on your newfound net worth being the top dog richest person on earth bravo on to the next one what do we got here
Ejaaz:
[16:53] Okay oh is this crypto.
Josh:
[16:55] Are we talking about the
Ejaaz:
[16:56] Final news item goes back to our roots josh google announced a new payments protocol but it is open decentralized and distributed if those terms sound familiar to you you might be thinking of our old pals on the other side of the fence that are crypto, blockchain, distributed, decentralized infrastructure. And it's actually quite a synonymous partnership because not only did they announce that they were releasing this new protocol, but they partnered directly with an old friend of mine, Brian Armstrong. I used to work for him at Coinbase, pairing X402, which is their open payments protocol that they created, Coinbase, with Google. Now, if you're wondering what the hell this thing dares. Basically, it's going to make it really, really easy for AI models and AI agents to transact with each other, to pay each other. If you're wondering why on earth would they want to do that? Well, think about it. We're not just going to be in a world where we exchange words and AI slop and VO3 generated content on YouTube with each other. We need to transact. We need to build things together. We need to create value. The main way that we're going to do that is through payments, is through investments, is through transactions. If you want these agents to kind of work autonomously for us and run a billion dollar business for us whilst we sleep, they're going to need to be able to transact Google's new, agent payments protocol is going to enable exactly that. And that's what they're going off to.
Josh:
[18:20] This is awesome. This feels like the exact use case that we've wanted to see in crypto and AI forever, the convergence of the two. It is very obvious that AIs will need autonomous payments and there is no trustless autonomous payment system outside of crypto. And this just feels like a very natural extension of where things are headed. I'm so glad that Google is doing this and leading the way. I think it adds a lot of legitimacy to it. Is there anything noteworthy or specifically exciting that people should take away from this?
Ejaaz:
[18:47] Honestly, and most crypto people won't like this, but it's this. Look at the organizations that they have partnered with. It is an incredible roster. They've got Amex. They've got Coinbase, which we mentioned. But they've also got a bunch of consumer products. They've got Etsy. They've got MasterCard, consumer specifically. They've got PayPal, which a ton of people still use. Revolut and UnionPay International, which is addressing kind of like the external Asian audience. The reason why this is so important is it doesn't matter how good your technology is, if no one wants to use it or if no one has access to it, your product is going to fail, especially for something as global as payments. So I'm glad to see something like this.
Josh:
[19:28] Yeah, love invisible payment rails. Use crypto in the back, but nobody needs to know that you're using crypto. It just works. And the AIs, like that's all they want. They don't want to have to deal with the human. Just interact on chain. Transaction goes through. All good things. So was that it? Is that the final topic? Did we just cover everything?
Ejaaz:
[19:43] What is, where is the timer on? Bravo, bravo. If we didn't make it onto 20 minutes, we are going to edit it down until it does. That's the rule.
Josh:
[19:52] Well, everyone, if you're listening, please give EJS a round of applause for preparing and running us through that rundown. That was amazing. I feel very well informed. I hope anyone who is listening to this on your lovely Friday has been informed on all the topics of this week. Any parting thoughts for the people who are listening? to the speedrun?
Ejaaz:
[20:07] Nope. It's done. Gotta keep it on for 20 minutes. Goodbye.
Josh:
[20:10] That's a wrap. Share it with your friends.
Speaker2:
[20:12] Like it.
Josh:
[20:12] Comment. Subscribe. All the good things. We'll see you next week.
Ejaaz:
[20:15] Peace guys. See you guys.
Music:
[20:17] Music
Creators and Guests
