Assembled co-founder John Wang on building a AI native support system for enterprises

Beskrivelse

In this episode, I sit down with John Wang, the co-founder of Assembled, to explore how AI is revolutionizing customer support. Having transitioned from a Stripe engineer to an AI startup founder, John shares his unique insights into the evolution of support tools. We delve into how these tools have shifted from being mere cost centers to becoming strategic assets that enhance customer experiences. John and I discuss the impact of AI on support volumes and staffing, highlighting how integration is reshaping the landscape. He emphasizes the importance of talent density and assembling high-caliber teams to drive success in the tech industry. Through his experiences, John provides practical insights into AI's current capabilities and limitations in support operations. We also explore the strategic considerations for future AI support ecosystems. John shares his thoughts on the role of support in driving revenue and customer satisfaction, and how AI can orchestrate with human support agents to create a seamless experience. His perspective on building high-performing support organizations offers valuable lessons for anyone looking to innovate in this space. Every episode, I bring in a guest with a unique point of view on a critical matter, phenomenon, or business trend—someone who can help us see things differently. Season two will host a series of guests from early-stage investing, as well as builders, founders, and product managers. For more information on the podcast series, see here. [https://aiproem.substack.com/p/launch-of-differentiated-understanding] To find the previous episodes of Differentiated Understanding, see here. [https://aiproem.substack.com/podcast] Chapters 00:00 The Journey from Stripe to Assembled 02:25 Understanding the Importance of Customer Support 05:29 Lessons Learned from Stripe 10:25 AI in Customer Support: Current State and Future 16:04 The Economic Impact of Support Operations 18:25 The Role of AI in Transforming Support Jobs 24:30 The Future of Support Organizations 26:58 Guardrails Against Fraud in AI Support 32:42 Navigating the AI Ecosystem 38:00 The Value of Long-Term Commitment in Careers AI-generated transcript Grace Shao (00:00) Hey, John, thank you so much for joining us. I just recorded your bio already. It’s extremely impressive. And you’ve done quite a, you’ve had quite a few different roles now as the co-founder of assembled, right? To start, can you just tell us about your story? Like what inspired you to leave Stripe, you know, go into, you know, right now what you guys are doing, which is a software for people who run customer service support operations. You know, now you guys are pivoting into AI as well, or at least leaning into AI. Tell us about all of this. John Wang (00:28) Yeah, great question. When we, well, when my co-founders and I started, we were all at Stripe. We worked on a bunch of different things at Stripe. And one of the last things that my two other co-founders worked on was a support tool, an internal support tool. And I remember pretty clearly that they were making a bunch of headway. It was really, really cool. And... They had gone to this really, really high up person and product. And this person was basically like, why are you guys wasting your time on this? Like you guys are kind of like, you’ve been at Stripe for so long, you know all these things and you’re doing support. Like I’ve got this really cool Bitcoin project that I would love for you to work on instead. And I remember my co-founder coming to me and being like, hey, like pretty bummed this is what happened. And then I was like, wait, you just saved Stripe, you know, quite a few million dollars, increased customer satisfaction by 40%. And still they don’t understand the value of this. And that’s when we were like, hey, ⁓ there’s something here where there’s a market opportunity. So that’s what got us really, really excited about support. We were doing it at Stripe. We knew it was an undervalued place. We didn’t see any very good tools out there to do support well. And so we decided to go build something really, really great in the support space and just like make transform and elevate support is our mission. Yeah. Grace Shao (01:50) Do you think it was just that stripe was too rich? They were just, and they just didn’t care about saving a couple million dollars? Or do you think it was actually a blind spot for people? John Wang (01:59) I think Stripe was definitely very rich at the time. think it was also a blind, it was a combination, right? Because most people, you think of support, you think of it as just a cost center. And I think recently that started to change in the sense that like, hey, this is actually a really important part of your business. But for a lot of companies, like if you look at FinTech, if you look at like a lot of health tech companies, their entire product is their relationship with their customers. And so support’s actually really, really important for that. And I think a lot of people underappreciated that for quite a while. And now I think people are starting to understand again, hey, if you piss off your customers every time they come and talk to you, that’s not going to be a very good thing. You better be a monopoly. Otherwise, you know, they might not be coming back. Grace Shao (02:46) Yeah, definitely. I think I want to kind of lean into that later in our conversation as well. It’s like people are trying to replace support and customer service AI first. But if anything, it’s not the best experience when you’re frustrated with a product and you keep on getting a robot, right? But I want to kind of talk more about your experience at Stripe. You were there quite early. What do you think it taught you, you know, as a very early employee at such a successful startup now? if even considered still startup and then like what were things that you think you learned there lessons even if soft skills that you kind of took away to to your current role like as a founder. John Wang (03:21) Yeah, it’s a great question. You know, it’s really funny actually. I just met up with someone where, so when I was starting out of college, I had applied for all these jobs. I was able to get a lot of them, except for this one company that I really, really wanted to go to. It was called Meteor Development Group. They built open source software. In college, I had built open source software at Ruby on Rails. I was really big in that community. was like, wow, it’d be awesome to go and make this something I do day to day. And I didn’t get the job. I was really bummed about it. And then I was like, I’ll just fall back on my second here, which is Stripe. And Stripe was the obvious second choice because just the people were really, good. And now like 10 years later, I like think about that and I’m like, the business model is really important. because Meteor was not a good business. Like open source frameworks is not a good business, but Stripe, really boring. Honestly, it’s just like payments. You process payments, you go talk to Visa. You literally have to like, we had a server in the server room that would send like a specific file with specific tabs and spaces in order to get it out to Visa. Really boring. Really, really core infrastructure too. And so like the big overarching thing that I learned was like one, business model is unbelievably important because if you can just make a good product when the kind of like market is there and when there’s a really big need, then this can scale like unbelievably fast. Two was the people. I remember talking actually to a few people, Greg Brockman was maybe the second or third person I talked to. who’s now the co-founder of OpenAI. And I remember just talking to him and being like, wow, this person is so, so smart. This is awesome. And I would talk to kind of like, I would go to the lunchroom and be talking to people at Stripe. that was just, people were talking about all sorts of things. And I think like talent density was a really, really big part of like what made Stripe successful. And It wasn’t any one thing over time. was one, Stripe was in a great market. And then two, it iterated really, really fast on a lot of little things over and over and over again. So I thought that was a really good place to learn a lot about like what makes a company great. Grace Shao (05:49) Yeah, I think it’s interesting you’re talking about talent density and a lot of the AI labs I speak to actually also talk about that. But I’m curious, what does it mean when you have really strong talent? Is it like that they are technologically superior, like they can code better? Or does it mean actually that they can think outside the box, they’re more creative, they can pivot faster? Like what does it really mean to have really high caliber talent on your team? John Wang (06:11) I think it depends on what company or like what you’re trying to solve, right? Like talent density for Los Alamos national, like Los Alamos, like building the atomic bomb is like very different than talent density for like Bell Labs, which is very different than talent density at early Stripe, which is very different also than talent density at OpenAI Research. Like I think for Stripe in particular, the type of talent density that was there was really high curiosity. Grace Shao (06:31) Right. John Wang (06:38) really high product thinking, really technical people, and people that could dive deep on certain problems and weren’t afraid to go talk to a bunch of customers. You saw so many conversations about like, how do we make this particular API parameter better for everyone? And like hours and hours and hours of like making sure it was a really, really good product. And people who weren’t afraid to like, you know, take a week of work and just like dump it away because it wasn’t quite there. So it was like, This combination of like, they worked really hard, they’re really smart, and they care a lot about the end result and have a high quality bar. That was Stripe’s version of kind of like talent density. But I think like, you know, if you look at the labs, if you look at different research institutions, maybe it’s just, you know, I don’t know, the raw ability. Yeah. But. Grace Shao (07:26) research capabilities or whatnot, right? No, that makes a lot of sense. Yeah, I wanted to ask you earlier on in our conversation, you said, you know, look, a lot of people overlook support. It’s not that glamorous. People kind of think it’s like a back office thing. But, you know, is that was that your view back then? How does you kind of, I guess, lean into this? And did your perspective or support change over the years? Now you say it’s very important, right? Did you understand the category correctly? Do you think? John Wang (07:52) You know, I think that when we looked at the category, we went at it from like kind of the lens of, Stripe was this company that worked in this unsexy space and did really, really great things. And we thought very similar things about support. It took us a long time to really grock support. And we talked to hundreds and hundreds of different people across different parts of the support stack. And I think early on, honestly, it was good and bad in certain ways. It was like, we thought we could build a piece of software really quickly that solved everything. Or like, you have that problem, we can build that in two weeks. Not a big deal. And we could solve the specific problems that they had in two weeks. And I remember talking to actually a few people, which was like, the system that you’re trying to do, which is called Workforce Management for Support. that’ll take you seven years. And we’re like, no way. Like we can do this. We can do this so fast. It’s going to be done soon. And now like seven and eight years later, we’re still working on it. We’re still uncovering more and more things. And that was probably the right, you know, that was probably the right call. But also there’s like some importance to naivety, which is like, if we had known that we wouldn’t have started. like we, yeah, like. Grace Shao (09:06) That’s why a lot of people say, yeah, as founders, right? John Wang (09:09) Yeah, so I think it was the right thing to do, which is just like start building stuff. Grace Shao (09:14) It’s amazing. ⁓ Why don’t we pivot into actually understanding your product bit better? So for someone who has never worked in support ops, what is the simplest way to explain to them what Assemble does? Because even between us, we had calls, we had back and forth emails. I was like, John, I don’t understand what you guys do. I’m trying to read through this material. I’ve listened to a few in the interviews. I don’t know what’s happening. Can you just dumb it down for me and explain to me what exactly you guys do? John Wang (09:37) Yeah, for sure. Let’s say you have like 10 people on your support team and you only do email, then you probably just staff them nine to five, right? Like there’s no big deal there. Once you start having a few more people on your support team, let’s say you have hundred people now and you might want to chat to your customers because AI chats, AI chat bots are a really big thing. Then you actually need to start thinking about when do these chats come in and how many people do I have in order to handle those chats, right? Because like, if you were talking to a chatbot, you’re getting instant responses back and forth, back and forth. And then you’re like, I have to wait 48 hours for the human response after I get handed off. That’s a really bad experience. So the problem is, you you’ve got a bunch of people who are calling in to support, writing in, who are chatting in. and they’re coming in at all different times of the day, they’re calling in for different types of problems, right? You, on the kind of like back end, you have a bunch of people and those people might be able to do different types of things. Like I might be a really good person to handle, you know, where’s my money kind of issues, but I might not be as good at like ⁓ fraud issues, right? Like if you’re having problems with fraud on your account. So there’s a lot of ways in which you can actually put people to the actual incoming tickets. And what our platform does is it tries to match those two things up. if you think about supply and demand, supply is the people that you have and demand is the people, like your customer is asking for questions. And if you don’t match those up well, you’re gonna either... spend way more money than you need to because you’re just going to staff everything way above what you need, or you’re going to have a terrible customer experience because it’s going to take you a really, really long time to get back to people. So it’s really an efficiency play. How do we make it really, really efficient for you to answer questions? In the last few years, we’ve also added AI agents, which is, you know, how do you actually respond instead of just with people, but also with AI to go and answer a chat or answer a phone call directly using AI. Grace Shao (11:50) That’s amazing. I really didn’t know there was so much like science kind of going behind that. I just thought kind of like you’re on a chatbot usually you have to have your frustrating like get me someone, get me someone. I’m one of those people who like no pages pressed zero all the time. I’m like, get me a human. But it makes sense. actually once you can match the talent with like the issue, it can be a lot more efficient in solving the issue and the customer experience will be much better as well. On the AI agent side. What’s the kind of, I guess, consensus right now? Like, are they really actually good at solving issues? Are customers complaining about them? Like, ⁓ how sophisticated are they at this point? He’s like, in my day to day, you know, obviously calling the banks or DHL for pickup or package returns, whatnot. None of those agents are really a pleasant experience, frankly. John Wang (12:35) Yeah, I think this depends pretty drastically on what tools you give these agents access to. I would say that the standard experience right now is fine. It will answer knowledge questions for you. And these can solve anywhere from 30 to 60 % of incoming issues, depending on how many knowledge questions you get. the place where it really is important is when you actually give it access to say your backend database and you can like make a refund or you can look up in order or you can identify why is my what’s going on with this error, right? And that is actually the hard part that prevents most of these banks and airlines and etc agents from being very good is because like that Access to data is a thing that they need to actually run and be able to perform actions. And then also the evals for that are really, really hard and not something that you just like launch without really thinking about it. So I’d say it’s in a progressive, like it’s in a progressing state, not at a place where it’s like, this is absolutely solved, but there are also some of our customers who have 90, 95 % of all issues who are able to be completely automated. because they’ve spent the time to give access to all of these systems and spent the time to validate that the agents are performing. Grace Shao (14:00) Very interesting. ⁓ I want to pivot into the to be kind of angle. Who are you guys actually selling to? Like who are the people inside companies that are managing this? Is it the head of support operations? And when they are buying and assessing your product like yours, is it really winning on price? Is it like over, you know, other maybe large softwares? Is it winning on speed and service? Like help us understand essentially how you guys are succeeding winning over customers. John Wang (14:26) Yeah, we generally sell into the head of support. Sometimes that person rolls up into the COO or there’s a head of operations or something like that. But generally there’s some group that is working on unsupport related things and that’s who we sell into usually. I think generally our differentiated, like the way that we actually go and sell this is one, we know all about workforce management, which is like a really, really nitty gritty detail about how you make your systems really good. And it can save you millions and millions of dollars. Almost actually, and this is one of the things that’s really funny, it’s like using our AI agents versus using our workforce management, we actually see somewhat similar gains across those two. Because to use the AI agents, you’re usually doing it so that you can reduce head count, right? And in order to reduce headcount, you need to know how much can I reduce headcount without hurting my customer experience. And for that, you generally need something like Workforce Management. So what we do is we go in, usually we have Workforce Management helping you understand how your system is set up. And then what the AI agents that we can also bring in is a relatively easy sell because our AI agents are really, really connected to kind of like how you staff and when you pull in people. The thing that you were mentioning, which is like, hey, my bank still doesn’t have a very good experience, that’s true of a lot of places. And getting access to information is really hard. So escalating to a human actually happens pretty frequently, sometimes 20, 30, 40 % of the time. So getting to the right human or the right or figuring out when to escalate to the right human is a really, really important skill to have. If you spend a million dollars a year with me, I should escalate you much more quickly than if you are a free user and you haven’t spent any money for me ever. Similarly, depending on the topic, depending on what kinds of things you’ve already previously talked to me about, I should be able to get to different types of agents and I should be able to have different levels of thresholds. that send me to a human. And I think because we have and handle the workforce management side, our ability to do the handle time and to make sure that you’re getting to the right person is much, better than a lot of our competitors. Grace Shao (16:45) So there’s an unspoken tier system then I guess with customer service as well that we don’t realize. How should we think about the economic importance of support operations? In terms of, we always think of it as like we said, back office support, but how much, do you have any proof that like basically better customer service equals better revenue? John Wang (16:52) There is, and sometimes it’s spoken. But yeah. You know, that’s a good question. should probably have some specific proof here. I guess the best anecdotes I can find are usually the kind of like medium to long-term anecdotes where companies that do not invest in their customer support tend to, you know, regress to the meat, right? Like if you are really trying to bare bones your way through customer support, ⁓ Your customers will understand that and it’s not going to affect your revenue right now, but it will likely affect your revenue in 6, 12, 18 months the next time that purchase happens. have seen actually some of our customers, so in our AI agents, we have a configurable setting that’s like, do you want to be containment focused or do you want to be escalation focused? And how good of a customer experience do you want? And we’ve generally seen actually that there’s a strong correlation. Obviously we haven’t run a ⁓ natural experiment or a true A-B test with this because it’s pretty impossible. But you see a general correlation between the customers that spend more money on support, the customers that spend more on trying to have a high quality experience, and the revenue growth of those companies. actually most of the customers that we spend a lot of money on care so much about support that they actually have, you know, executive briefings every week about these, about what’s going on. And they’re the people who have the largest support teams and they’re the people who kind of like make the most, make the most changes with their team. Obviously this is a very biased perspective from our customer, like set of customers, but I think that there’s still something to that where if you spend money and if you want to make a your support really, really good, that does tend to pay off with customers because they do tend to notice and it makes it easier from a product perspective to paper over all of the things that aren’t so great. Grace Shao (19:06) Yeah, no, totally makes sense. think even as consumers ourselves, we would be likely turned off by certain brands or experiences if the customer service really bad, right? Unless, like you said, they’re a monopoly and there’s nowhere you can go. All right, let’s talk about AI. You kind of touched on that earlier, but the naive view is that AI automate support, you know, a lot, a lot of the conversations right now about, my God, jobs are going be taken, especially the first batch is probably in roles like operationals and customer support roles. Second batch, people are saying are maybe in like more repetitive execution roles like junior consulting roles, a lot of junior training up roles, right? How do you see that? Because at least from where I sit in Hong Kong, a lot of stories are coming out saying markets like India, the Philippines, know, across Southeast Asia where they traditionally served as those telephone call centers or operational centers, they are getting caught. Is that going to be a trend forward? you know, how should we understand this? John Wang (20:02) Yeah. Yeah. I think there’s a few things. There’s like, like with all things, there’s a lot of nuance to this, which is I think your trend on seeing, you know, what we call tier one support, the first line of support who are traditionally humans outsourced. That is a place where we’re seeing a lot of change. And I don’t think that trend is going to slow down. That said, there’s a very interesting other trend that we’re seeing, which is that total spend on humans and headcount isn’t necessarily going down by that much. And it’s kind of like Jevin’s paradox where we see a lot of our customers and a lot of customers of other AI users ⁓ who have amazing resolution rates. They’re like answering so many questions, but that’s causing actually, or maybe there’s some correlation here of the number of tickets they’re getting and the number of chats they’re getting is like way, way, way higher than before. And I think there’s a few parts to this. One is you see way more ability for your AI agents to answer questions. And so obviously people are going to ask more questions because like, Hey, it used to be really hard for me because I had to literally type out an email to a human, wait a few days and get an answer. And now I can just like get an instant really good answer. Right. So I’m going to try asking more questions. The second thing is, as these companies do better and better, you actually just have this natural induced demand of increasing usage, numbers of people who are asking for support. So the higher amount of support that is automated is also, the general number of how much support is coming in is also very high. And so that actually offsets a very, very large portion of the head count. The head count is changing though. It’s not going to be the typical tier one support where it’s just like, answer an easy question. That is mostly going to go away to AI, think. The types of head count that is coming in are like, know, internal agents, people who are really good, people who can provide white glove support and like... actually go talk to people and provide like a human experience because like our companies still want and really crave giving that experience to people. And that’s just not what the kind of BPO standard really is. So I think it’s changing in the type of what you would see. Grace Shao (22:31) Yeah, I was actually going to ask about that, like, as in when AI agents start resolving more tickets, if we’re just going to see reduction of headcount. And I think you answered him when you once wait, whereas like, yes, in initial stages, but later on, there will be new jobs created, right? Essentially, people who will be managing more critical issues or even managing the agents. I want to understand. So for your company right now, essentially, are you a are you like a middleman between the human agents and the AI agents and becoming the orchestration layer, like you’re providing the service, the training and the orchestration. Like, how do we understand that? John Wang (23:05) Yeah, that’s a great question. So we think of ourselves as how do we get you to the right way to answer your question, right? In our view, there’s kind of three main types of people that can answer a support question. One, it’s AI. Two, it’s a tier one BPO’d outsourced agent. And three, it’s an internal agent who’s like super well-trained and like super, like, you really carrying about, like really trained on customer support. And what we are trying to do is make sure that you get placed at the right area, depending on what kind of issue you have and who you’re talking to and like what is the kind of like a cue that is backing up the set of people who need calls. So for us, what we’re trying to do is really provides you that ability to choose across a bunch of different options. So we don’t actually provide any, like we don’t provide any BPO agents, we don’t provide any internal agents. All we do is provide the software that routes you. And we also provide the software that can do the AI agents, or you can actually plug into a different piece of software if you want to have your own AI agents too. We’re trying to make sure that we are kind of third party and that we are making it really easy for you to optimize your support regardless of what specific providers you use. Grace Shao (24:30) So in your view, what does a well-run support organization look like in, let’s say, three years as AI adoption becomes mainstream or more more mass market? John Wang (24:38) I think you’ll probably want to have all of the different types of support using AI. So voice AI, chat AI, email AI. I think you’ll want to have a lot of nuance between the different types of customers that you have. You can’t generally provide the best level of support for literally everyone. Though this depends on also your customer base, right? Like a consumer customer base versus a super enterprise customer base with 100 very large customers is completely different. But let’s say for a standard company that might have ACVs that are in the, I don’t know, the 100 to couple tens of thousands range, then you’re probably going to have a combination of AI agents and human support. And you might have different tiers of human support, right? Some human support that’s really good at answering support questions and other tiers of human support, which is like, you’re just managing the agent. I think the other thing that’ll happen a lot is you’re gonna start to see more like, supporting agents acting in a simulation where right now, like the kind of typical flow is like a supporting agent gets a ticket and they answer it and it goes back. I think as the agents get more like, get more and more training data, get access to more information, really they’re only gonna come to humans for escalations. And similar to how Waymo works, if you’ve ever taken a Waymo, it’s a great experience, you’re like driving, driving, driving, and sometimes you get kicked out and a human operator in the Philippines is like, hey, I need to move you around this truck, right? And similar to support, That’s probably what’s going to happen. A human operator is going to come in and be like, hey, I can give you a refund right here. And then what’s going to happen is the AI agents are going to train on that, right? They’re going to like learn and get better. And you’re going to be able to use that whenever you have an interruption to understand like, why did I have this interruption? How do I make my model better for the future? And then you’ve got your closed loop. So I think in the future, you’re going to see much more of that happening than people who are just like, coming in and their job is to solve as many tickets as possible. I think the change is gonna be like, okay, people are gonna start to need to provide the best possible response in that particular instance so that the models can train on that and be as good as you are. Grace Shao (26:58) actually just on that, do you think then we’ll see more and more fraudulent activity or people trying to exploit that? like if say you know the models trained on, I say this one buzzword or one keyword and it triggers like refund. What if I just go on the call like on the phone all the time, just to be like keyword, keyword, you know, and then like how do we prevent something like that? Or do you guys kind of get involved in that building this guardrails as well? John Wang (27:21) Yeah, no, that is a age old question. think like, wouldn’t say there is going to be necessarily more or less of that, but I think like, it’s kind of like the cat and mouse game of like, everyone has always been doing that. And so like, and the methods always change every, every few months. I think the methods will change every few months here too. Our AI agents have a lot of guardrails put in place to automatically detect that. And we also have kind of like post-hoc guardrails which are like scanning through our logs and trying to identify situations where that might have happened. And we’re also training on those examples, right? So I think, yes, people will definitely start to exploit this and be like, hey, how do I get a refund faster? But there’s a ton of guardrails that you can put in place. For example, each account, you can have one or two, have like refunds without looking until that actually gets flagged and it needs to go to a human or. You can set good policies, for example, like, you know, if it is within policy of 30 to 40 days after purchase, like automatic refund, otherwise, you know, flag it and do something with it. So there’s a ton of stuff that you can do to actually like reduce the possibility of that. And I do think that it will end up being cat and mouse game like over and over again, as people get more sophisticated. Grace Shao (28:39) Right, right. And they’ll start using AI to trick AI. That’s what’s scary, right? So as we talked about, different gender standing the podcast does not have to interview anyone related to China or Asia, but we do have kind of an Asia angle to a lot of how we view the world. So my question for you really is because you’re like out in San Fran and like your company actually has no sales in China or anything. But I actually had a curious question. How does SF John Wang (28:43) Totally, yes. Grace Shao (29:05) as a whole, the startup ecosystem kind of view the current rise of a lot of Chinese AI. And have you guys yourself or your peers, you know, tried to use Chinese open source models over the years? Is there any view on the open source models given that, you know, you previously said you were very involved in open source and I think it’s part of your philosophical belief as well, right? So just kind of like the high level vibes. John Wang (29:28) Yeah, our vibes might be different than at the model, like the Frontier Labs, honestly. Our vibes, we love the Chinese open source models because it adds more competition. And I think the open source models are actually very, very good. I think from my friends at OpenAI Anthropic, they don’t like it quite as much because it’s competition. But for us, we have no allegiance really to any of the Frontier Labs. or any of the models that are out there, we want to provide the best possible experience to our customers at the best possible price. And that has meant, you know, over the years, like making changes in our models, making updates and to figure out what is that frontier of cost or performance. The Chinese models tend to perform really, really well on that, especially Grace Shao (30:12) Mm-hmm. John Wang (30:19) kind of like the latest series of models, we’ve actually spent a lot of time in the last six months kind of like pulling out a lot of our tokens. We have tens of billions of tokens per day. And a lot of it now goes to models like Quen or Kimi. And like that has actually started to really, really increase over time, mostly because you can find to them, you can do RL on them, you can... have better latency on them, you can run them on your own hardware. There’s just like so much more stuff that you can do with it. And also, you know, the cost performance latency trade off is really, really good. Now, most of our most of the like the strategy we take is actually one where we try to understand the use case and the problem and what type of model is necessary for that. So for kind of like the main model that’s actually answering questions. We’re actually usually using a frontier model for that. But actually the majority of our tokens come from out of secondary processes, processes like detecting if I need to escalate, detecting if there’s a fraud here, detecting if there’s an adversarial intent, making updates to large swathes of data in batch, like all this other stuff where you really don’t need frontier level intelligence and where if you have a a well-tuned prompt and an open source model or an open source model plus a fine-tuned model, you can get at or better in terms of frontier performance. We’ve really seen that and we’ve actually been able to save millions on our token costs in just the last two or three months by being very smart about how we use our models. And we’ve also seen a 15 to 20 % increase in quality. ⁓ Just because like when you go and you have evals, you can make things much, much better more quickly with these open source models. Grace Shao (32:14) Yeah, I think that’s like the general sense I kind of get from a lot of startups, right? In a known day, it’s like, you guys are obviously more cost conscious. What is the best price to get to what you need? And there’s like a tier system where how you use the models, you might not use the most frontier models for everything. I think that makes a lot of sense, business sense, especially. Is there anything you would like to share with us that we haven’t touched on in terms of, just the overall AI ecosystem, any thoughts on, you know, where we’re going with this AI agentic push right now? ⁓ you know, are we really going to see that, you a giant moment, like just kind of some high level thoughts. John Wang (32:49) Yeah, that’s a good question. Recently, I’ve been thinking a lot about Opus 4.7, which got launched a few days ago. And it’s actually kind of similar to what we were just talking about in terms of this price for performance ratio. And it seems like, based on my usage, based on our evals, based on other people’s usage on the coding side, that it’s a better model, but it is also more expensive. than before. like, you’re really it’s literally like a trade off in terms of dollars and intelligence. And it’s really interesting because, you know, a year ago, every model would just be like, this is strictly better, and it’s probably cheaper, and you’re to get more context and like, everything’s better. And you could basically just bet that you’re just going to like get better models across the board. And now actually, you’re just like kind of moving from this part of the like the frontier curve to the other part of the frontier curve without actually shifting the entire curve. And that’s happening with a few more model releases. You still see general increases in the frontier, but it’s less stark every single model release that you see that. And so I think it’s just an interesting area to look at because when you get into that world. Gross margins has become really important. Gross margins for ourselves as a startup, but also gross margins for Anthropic and OpenAI. One of the funny things that I’ve seen, just talking to people who are working at Anthropic and OpenAI, and also people who are trying to invest in those companies, gross margins are actually incredibly important. One of the OpenAI right now is becoming a much more... hand investment than before. And like, it used to be like six months ago, it’s like, you have it, you have shares of OpenAI, like, how do I get in? Now it’s completely different with, you have shares of Anthropic, how do I get in? And I think part of that’s because like, OpenAI wants to spend $100 billion on infrastructure. And Anthropic is a lot more measured in the way that they’re spending money. And I think gross margins actually do matter a lot right now. And that’s where I think actually Chinese open source models are making a big difference because just at the end of the day, you still have to make money. And if you’re losing money on a per token basis, that’s really bad because if you go to infinity, you lose infinity money. And if you make money per token, great. Ramp usage up as high as you can. Grace Shao (35:03) Yeah. It’s just so crazy how the sentiment shifts like so every three months I feel like and then to your point like whenever I speak to investors like oh my god I got my hands on some anthropic shares and last three months earlier. Oh my god I got my hands on opening I like it’s just like and like oh no one would invest in opening right now like I don’t want to do that like people just completely go like black and white on these things it’s pretty crazy how the pendulum swings I do have a question actually on the infrastructure side doesn’t it actually make sense for open AI to eventually own their infrastructure because otherwise they have to become continuously constantly pay the hyperscalers for all the infrastructure like so in the grand like scheme wouldn’t it make sense? I mean although obviously how much you’re spending is like absolutely crazy. John Wang (35:55) I think it actually does. And I think that’s like part of the problem, which is like, you know, if you think about what their compute costs are, I think actually doing all of these things that they were doing makes perfect sense. And it makes especially perfect sense if you have investors who are willing to bankroll this. But it’s almost like the, ⁓ what’s that paradox? It’s like the St. Petersburg paradox, something like that, where it’s like, you keep doing, your expected value is infinity and you keep doubling your money basically, but at some point you need to not double your money because you don’t have enough money. Grace Shao (36:32) That’s such a mo- I’m like, I’m still confused when you’re saying, go back. You keep on doubling your money. John Wang (36:36) Sorry, So I think the I think it’s like Let me let me look this up st. Petersburg paradox is Okay, it’s a coin flipping game and You start at two dollars and with every tails you double the pot and you can basically decide to like take your money at any time, right? and so you you’re doubling exponentially as you go up and If you compute the expected value, you should basically just like, keep going forever because your expected value is like infinite, right? Like because the doubling of the pot is better than kind of like what your losses are. You just got to, you got to run. And I think OpenAI is in this St. Petersburg paradox where it’s like, well, in theory, double everything, keep going. But in practice, you don’t have enough money. Grace Shao (37:17) Yeah, I see what you mean. John Wang (37:25) and resources to be able to do that. I think that’s actually what’s happening is like, there’s not enough money in the world, not enough investors with liquid cash who are willing to invest in a business as big as OpenAI while the gains and the returns are still, yeah, having improvements. So I think it’s both rational, but also, you know, actually practically very hard to make what they’re doing. Grace Shao (37:41) haven’t been proven. Yeah. totally. Okay, I want to ask you one last question, which I ask every single guest. What is one differentiated view you have? It could be on your own sector, industry, life. John Wang (38:00) man, have a really like, I have one that like is very controversial. I don’t know if I should talk through that one. ⁓ Grace Shao (38:07) You’re get doxxed and to hate it after this. Okay, tell me that one after, I wanna hear it. John Wang (38:17) Yeah, yeah, Let’s see. Like... I would say, I don’t know if this is differentiated now in the market or not, but the thing that I’ve been thinking about recently is that you should stay somewhere long enough where you see your mistakes through. And I think it’s like slightly differentiated right now, because like you’ve got in Silicon Valley, at least you’ve got people who are jumping between big labs, who are jumping between different startups where it’s like, Hey, I can make the next, you know, $5 million. by going to this next thing. And there’s just a whole huge amount of opportunity and there’s like a ton of opportunity costs to staying somewhere for a long time. And at the same time, think like long-term staying somewhere for a long time is actually one of the best things that you can do for your own learning. And it gives you that a better shot to make like the long-term massive gains that you could have like $5 million. is amazing for someone. But if you want to build your own startup, if you really want to like change everything, if you jump around between companies every year or two, like you’re probably not actually going to learn a lot. And you’re probably not in the position to make really hard decisions and then have to see those hard decisions through and then, you know, be able to learn and see that feedback from those hard decisions. Especially if you’re jumped like Especially if you’re like at OpenAI and you’re like, no, investors don’t want this anymore. You jump ship to Anthropic. That’s like, you know, I don’t think you’re going to get that, the learning that you really need to get. Grace Shao (39:46) Yeah, yeah. actually agree with that. think I also took some time and like experience to realize that because when we’re all young, like you’re really excited, right? It’s like, this looks cool. That looks cool. this person hates me. hate that person. Like you take everything very personally and then, you know, we’ve all heard these stories from peers, even ourselves. But what is the threshold though? Because then the other side of the argument is that like you see people who’ve been in a job for like a decade and clearly they’re frankly not. moving up in a very corporate structure way or even intellectually growing or even, you know, excited about their job anymore. You know, the joke is like you get the like, okay, this sounds on PC, but you know, like the 45 year old VP that’s been a VP for the last 15 years at banks, we have a lot of these. So what happens? Like when is it best for them to actually maybe jump or some say, that was like a lifestyle decision where they want to take it easy because they have some more time for kids. Fine. But taking that kind of considerate way, wouldn’t it sometimes be better that you jump to try something new to take risks? John Wang (40:50) I think if you are in a place where you’re unhappy with, so I will caveat this with, have to, you should only stay if you’re excited about what you’re doing and you’re learning continuously and you’re surrounded by great people. If those three things aren’t true, yeah, it’s really hard to fly. Grace Shao (41:05) Which is so hard to find. you were very lucky at Stripe, right? Like you said, you were just surrounded by very high caliber, high agency people, but not everyone can get all those things at the same time. ⁓ But no, that’s great. Thank you so much, Sean. ⁓ I had a lovely time chatting with you. I still wanna follow up on what was the unspoken differentiative you later. All right, thank you. John Wang (41:17) Yeah. ⁓ Let’s do it. Let’s do it. Thanks, guys. Get full access to AI Proem at aiproem.substack.com/subscribe [https://aiproem.substack.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4]

Nathan Lambert Reflects on China’s AI Labs: DeepSeek, Open Models, and the 'Race' with the U.S.

Joining me today is Nathan Lambert [https://substack.com/profile/10472909-nathan-lambert], author of Interconnects AI [null] and a post-training lead at the Allen Institute for AI. Nathan recently returned from a major tour of China’s leading AI labs, where he met with researchers and teams building some of the most impressive open models in the world. In this conversation, we discuss what Nathan saw on the ground: how Chinese AI labs differ from their U.S. counterparts, why open models have become such an important part of China’s AI strategy, and how labs like DeepSeek, Alibaba, ByteDance, Kimi, Z.ai, MiniMax, and others are navigating compute constraints, data access, and commercialization. We also dig into some of the most debated questions in AI today: Are Chinese labs really 6-9 months behind U.S. frontier labs? How meaningful are distillation accusations? Can domestic chips like Huawei’s make up for restricted access to Nvidia GPUs? And is China’s AI ecosystem actually government-directed, or is the reality more fragmented and commercially driven? Ultimately, this episode is a more nuanced look at China’s AI ecosystem that looks beyond simplistic narratives about subsidies, copying, or geopolitics, and instead examines the technical, cultural, and economic forces shaping the future of open models. Check out his two recent articles here: * Notes from inside China’s AI labs [https://www.interconnects.ai/p/notes-from-inside-chinas-ai-labs] * How open model ecosystems compound [https://www.interconnects.ai/p/how-open-model-ecosystems-compound] To find the previous episodes of Differentiated Understanding, see here. [https://aiproem.substack.com/podcast] Every episode, I bring in a guest with a unique point of view on a critical matter, phenomenon, or business trend—someone who can help us see things differently. Season two will host a series of guests from early-stage investing, as well as builders, researchers, founders, and product managers. For more information on the podcast series, see here. [https://aiproem.substack.com/p/launch-of-differentiated-understanding] Chapters00:00 Insights from the China Trip11:51 Cultural Differences in AI Research18:15 The Role of DeepSeek in China’s AI Ecosystem25:26 Overview of Major Chinese AI Labs30:56 The Future of Open Source in AI37:50 Market Dynamics and Consolidation in AI42:28 Distillation and Model Convergence Controversies51:58 The Gap in AI Performance: US vs China61:09 Monetization Strategies in AI: A Comparative Analysis62:32 Government Influence and Misconceptions in AI Transcript (AI-generated for reference only) Grace Shao (00:00) Nathan, thank you so much for joining us today. Yeah, really, really excited to finally hear your thoughts on your big China trip, on what’s happening between the Chinese AI labs and the U.S. AI labs, what you think the potential compute constraints might mean for these labs and their performance in the future, and obviously the open-source ecosystem. So before we get into all of that, could you... Nathan Lambert (00:02) Yeah, thanks for having me. Grace Shao (00:23) Briefly tell us about how you ended up actually working on post-training and open language models. Just a bit about yourself. Nathan Lambert (00:29) Yeah. So I actually started my PhD at Berkeley in 2017, not working on AI things. I was an electrical engineer by training in undergrad, which is funny looking back, because that’s the same year that the Transformer paper came out. And I was like, I think I should do this AI thing, and tried to get the famous advisors to mentor me. And they’re like, we can’t take you. So I had my PhD as this wandering path to become an AI researcher. And then I ended up at Hugging Face after that, which was, realistically, the only industry research job that I had, but also a very hot startup and very fun to learn kind of at the intersection of these tools that people use a lot for AI and research, which is what I was doing. And then when ChatGPT hit, the kind of RLHF thing blew up as the hot word on the technical side of things. My PhD had ended up being in reinforcement learning, which is just the first half of reinforcement learning from human feedback. So it was kind of a natural pivot to be like, well, I might just do that. And Hugging Face was a good place for doing that, because the whole company is kind of all for that, which is like: figure out how to support the community on the hot thing and build platforms there. So they were very happy about that. And I helped build a team at Hugging Face. And then I was kind of burnt out on the remote-work time-zone thing and found out that the Allen Institute was doing such similar stuff. And I was like, wow, I have people that could be in-person friends and do similar things. I was like, quality of life — I need to do this. And a few years later, I ended up building a bunch of models. And I think being at a nonprofit opened me to this ecosystem vacuum of information, where there aren’t many people who can talk about what they’re doing. So then, with some luck and committing to write every week, I just feel like my influence filled the vacuum of nobody saying reasonable things. And it is this nice synergy between what I write about and what I work on in my day job, and it just kind of got bigger and bigger in a very fun way. I think that, generally, at the highest level, I’m motivated by wanting AI to go well on this trajectory. And I worry about a lot of near-term things, whether it’s social unrest in the U.S. and just kind of the massive hatred for AI — I think is a very big near-term problem — and then, medium term, concentration of power, because I think AI will be super powerful in ways that people don’t expect. So generally, open models are a nice way to curb both of them by being a bit more transparent to people, and it naturally is a hedge against concentration of power. There have been different reasons throughout that, but that’s kind of a recurring theme in my life in the last few years. Grace Shao (02:50) Definitely. I love your work because I think you help non-technical people like myself really understand what’s behind what’s happening in these labs a lot better. And then I actually just spoke to your former colleague, Tiejin Wang, and he was with APAC Hugging Face just last week. He was saying the same thing. Open source, in many ways, is kind of the best way to go forward as we know that this technology will not stop evolving, but it’s the best way to kind of put up guardrails and checks and balances for the monopolies. Okay, I don’t want to take up too much time on that side of things today because our focus really is about your China trip. Before we get into the weeds of all that, I want to hear about the trip itself. Most people who are writing about Chinese AI are getting their information secondhand. You really went there, you spent time with the researchers, you met with people who are building the models. Tell us about what you meant when you said you came back with great humility, right? Your eyes are a bit more open, whether it’s the good or the bad. Tell us about your trip. Nathan Lambert (03:50) I feel like I kind of went in — I mean, I had this horrible English phrase in my writing, which was like, “I knew I knew nothing about China,” which kind of tried to indicate that I knew going into the trip that I knew nothing. And it was still the fact in my current writing. This is a horribly written sentence that I had in there. And I only talk about it because somebody called me out on it. It’s like, what is this? And it’s like, leaving, which is knowing that it’s such a big country, there are just such vast amounts of talent working on these problems, and how unpredictable it is as a human to model people with very different worldviews and upbringings and training systems. Realistically, the way that people are trained in China is very different. And I just think that even being there, you can’t fully grasp: what are the pockets of three to six researchers doing that is actually a bit different than in the West, even if they’re working on the same goal? I think you could get down to that level of granularity and a sociological study and actually see differences in what they’re working on, and that’ll always change the output. I didn’t get to that level of granularity, but it’s just to start having real experiences and understanding how people explain how they work on these problems. And for me, realistically, a lot of it is coalition building, which is just like: I want there to not be vitriol at the level of the technical companies doing things in international bodies. So just meeting all the labs on both sides is really nice, because you need to do that for them to talk to you about more sensitive issues in the future. I got some criticism on the piece, which is like, this is how you shouldn’t visit China. And it’s like, well, what are you going to do if you’re going on an official visit to a bunch of companies? How do you expect to get in the door without being nice? You have to start somewhere, and I think it’s important to be respectful. Grace Shao (05:31) I think the piece was, frankly — I don’t think the criticism was fair, to be honest, because I think you were really transparent with the fact that you’re not a China person, right? It’s not like you’re going there and exoticizing everything. And if anything, a lot of people, even with China backgrounds, like to use certain dragons and tigers to describe things. I feel like you actually were really humble going and being like, I’m just a technical dude meeting with these labs, talking about their technical research, right? And then because you were physically there, you had observations of the culture and the people. So yeah, I actually thought your piece was quite good. And yeah, sorry. Nathan Lambert (06:05) I agree. I was willing to let that sail past, but I think it’s important for people who listen to realize how actively these companies are trying to court Western audiences, which is why we could get in the door. I mean, we had some prominent people on this trip, but that’s why we got all of them in the days that we wanted them, except for DeepSeek. So essentially some, like Catherine Rintel, who works with me at Interconnects, and some other creators... Grace Shao (06:23) How did you get everyone? Yeah, how did you get everyone? Nathan Lambert (06:29) He used to live in China and has connections in China. So he kind of orchestrated the mix of his connections and leveraging my connections to labs. We had some bigger names on the trip as well. Just stringing all of these together to get all the various labs in place is a few months of networking to make sure the trip lines up with people with established networks and contacts with the various labs. But these people want to look good to Western audiences, so they’re only going to say yes to the right researchers. And the researchers know that there are two to four comms/ops people in the room, hanging out, making sure that it goes well. Especially the bigger the company, the more comms people. You go to Alibaba and there are three to five various people, from the head of comms to some special offices. You’re not going to get these people in the office, or at all, without accepting the cost of these types of handlers. It’s the same thing in the U.S. You’re not going to just plop a senior executive into a chair. So it’s also good because now I have the WeChats of a bunch of researchers from China that I could just text about things. It’s like, hey, congrats on the new model release. It’s like Lay Lee works at Xiaomi, Xiaomi MiMo. It’s like, talk to this guy for an hour at a mall — I don’t remember the name of the tea store — but it’s like... Grace Shao (07:27) No, of course. No, of course. Nathan Lambert (07:49) Now we have these relationships, which is very useful, and that helps information spread across the ecosystem to these trusted parties, which doesn’t really exist. There are not that many, I think. And the opposite direction of the trip is very hard because Chinese researchers can’t really enter the U.S.; the visa purgatory is too complicated. A lot of us on the trip were either Canadian or entered on a transit-without-visa entry, which makes it very easy for American technical talent to go to China right now, which is why I think there are so many trips. I think there’ll be more of them. We’ve got a lot of inbound from VCs and open-source labs in the U.S. that want to establish collaborations with these various labs because they’re the best open-weight models, and they want to build a stack for companies in the U.S. building open-weight models. So I think there are going to be more prominent, but not gigantic, U.S. startups going to try to build these relationships, which I think is a really interesting technological development because we’ve never seen this type of professional work trip in China from U.S. tech companies. Most tech companies have a “bring a device to China, it auto-bricks itself, and you have to hand it into IT.” So to actually proactively send people in a professional capacity is a really big change. There are a lot of angles you could take this, and I think it’s cool to see how it unfolds. This isn’t even really about the trip. This is the follow-on that we’re hearing from people that are like, hey, how’d you do this? We want to do this trip. Grace Shao (09:06) Yeah, definitely. Actually, from my end, I hear about VCs or investors always being quite active going to China because previously American funds were very, very active during the internet era. People were kind of always trying to find a way to either get into these good deals or potentially keep their pulse on it. But I think it’s really, really positive for the whole AI ecosystem to have this kind of fair, transparent exchange in some capacity. But to your point, there’s no way that star researchers can come out and talk to you off the record without any compliance, because that doesn’t happen in the U.S. either. That’s just companies protecting themselves. I just think your trip was quite meaningful, and I want to bring it back to your observations. You talked a lot about the cultural aspects of it. You talked about how you felt like in China there was less of this star-researcher celebrity status around people. People were more humble, or there was more humility. It was very focused on execution. You argue that Chinese labs are particularly well suited to the current LM-building game because they’re very focused on meticulous stack-level work. And there’s less ego sometimes to work on the dirty work, or the non-sexy work. So kind of unpack that for us. Why do you think that is? You kind of touched on it — you said they were brought up differently, they were taught differently — but what’s so different? Nathan Lambert (10:27) So essentially, an interesting part that synergizes on this trip is that we stopped by some academic institutions. I think it was like AIR and Tsinghua and stuff. And you hear all of these academic leaders talk about how they’re pushing hard to try to change it. So yes, they know China is producing more papers than anyone else, but they still think that it’s not as transformative of research. And they think that they’re trying to cultivate the academic domestic ecosystem to change just the type of work it works on, and the distribution, and take more risk. And then you would talk to some industry leaders off the record behind closed doors, and you would hear things like, it’s never going to change because the education system is so structured. There are so many layers of the funnel that reward things like memorization and stuff that they’re just like, this research culture is not going to emerge. And then the follow-on with the AI labs is that these labs are doing fast-following. They kind of have a proof of concept, and they know what it needs to look like. Therefore, in that domain, you’re not trying to invent the new paradigm. You’re not trying to make the model that is o1 or o3, or the first model to work in Claude Code. You’re like, I see it, and I’m going to try to do that and make it the best thing. And I’m going to try to make it cheaper and just maximize that goal. A lot of companies don’t need to invent the new paradigm. OpenAI has done this so many times. That’s their bread and butter: never doubt OpenAI’s ability to release a blog post and a plot that changes how people think about AI. I still think it’s going to happen a few times in this massive boom over the next four years. OpenAI just kind of has that sense of what is the thing that you can push on a bit earlier and just transform things. But I don’t expect — and other people wouldn’t expect — the Chinese companies to do that as much, because it’s just such a culture of, I guess, building. I don’t know how to describe the positive version of this. Maybe it’s slightly more practical-minded, in terms of: it’s your job to build this thing. A lot of the researchers, maybe because they knew their managers — some of them had managers in the room — see their role in the company as being to make the models excellent. And especially for students, I work with students and that’s what they say. I work at the Allen Institute and we have students that will co-lead our language models. It’s not that surprising, because if you do an industry research job in the U.S., a lot of mentors will tell you that you’re kind of free of the burden of bureaucracy and politics. So the naivety of students, and the simplifying, is actually so good at just getting a lot of technical work done. There’s also the life-stage side. If you’re younger, you don’t have as much family, and you normally haven’t built up as many habits and other things you do with your life. Language models are so complex, and the amount of context that you need to absorb to understand what the bottleneck is — there’s so much information, and you have to be able to pick what the bottleneck is and break it. If you just don’t have the mental space to absorb all the context, you kind of end up doing things that are cute but don’t make breakthroughs on the model. So that’s kind of a difference that I’ve seen in people who were both very successful academically before language models. Some of them are able to pivot to this practical mind, which is: what is the state of the system? How do I improve it? And then some try to make kind of these abstract frames of what’s happening and approach it like an academic, and it normally doesn’t improve the model as much. So I just kind of see, if the academic system is a bit more practical-minded, a bit more structured, and the work you’re doing is structured in the language model — make this kernel implementation faster, make this idea work — then maybe it can be... I think it’s an oversimplification. I push on that a bit in the piece just to really contrast what you could think a U.S. lab would look like. And I have a few anecdotes. I’ve heard a U.S. lab paying off a researcher to be quiet about their thing not being in the model. All of these one-off things are more storytelling devices than anything, because most one-off things don’t matter at all. But also Llama 4 imploded, and that was because it was described as a Game-of-Thrones political-style environment, with all the VPs vying for influence and showing that their thing made the benchmarks go up. It kind of fell. Many, many people will tell you that. And we’ve had the Qwen turnover, but it doesn’t seem like it was quite the same type of thing as Llama 4 or xAI. xAI barely exists now. There have been some dramatic things in the U.S. with how these companies have kind of come and gone out of the fold. Grace Shao (14:55) Yeah, I kind of agree with you, but also I would push back on that. I think there’s obviously a more rigid and competitive academic system, which by default in East Asia results in a culture of students following the bureaucracy and authority a bit more. So I agree with you in the sense that they’re very pragmatic. They focus on the task that is given to them. However, I wonder if things will change with how AI will disrupt education. That’s number one. But also, a lot of the young researchers that you’re working with today seem quite different. At least a lot of the entrepreneurs I meet today are born in the ‘80s and ‘90s, some even younger and born in the 2000s. And I think there’s a kind of aura or confidence coming from them. If anything, you want to say they’re a bit more individualistic-minded. You went to Shanghai, right? They are dressed very, very uniquely. They have these outrageous outfits on the streets. People are seeking individual ways to showcase their personality. So I wonder if that will shift. But for sure, for the academic institutions like the Tsinghua and the Beida of the world, they are still very old-school. But I would say that is the same maybe in some academic institutions in the West still. Okay, I think on this topic we can go off on a tangent on academics, but let’s go back to China’s ecosystem. When DeepSeek V4 came out, we talked about it offline, the two of us, quickly about a piece I wrote saying how DeepSeek is starting to look a bit more like a base layer for China. And if anything, some of the labs kind of admitted to that. They’re like, we have very limited resources. And to your point earlier... Nathan Lambert (16:11) Yeah, you could take that in so many tangents. Grace Shao (16:34) Limited people — these labs are tiny. They’re run by 100 to 200 people max. Limited capital, obviously limited compute. They have constraints all around. And in that sense, in a way, the ecosystem’s looking less like a zero-sum game and more like different players optimizing their own strengths. So correct me if I’m wrong, but DeepSeek is providing a base layer where a lot of labs will quickly follow and basically adopt a lot of their engineering breakthroughs. And then Zhipu, Z.ai, will focus on the coding; MiniMax focusing on the multimodality, et cetera. There are a lot of these different players. ByteDance, obviously, very, very focused on their video models. And Qwen, like you mentioned, had the whole open-source saga break apart with Lin Junyang leaving. But in general, they’re still kind of the leader in hyperscalers on that front. So everyone’s doing their own thing almost, instead of really... Nathan Lambert (17:27) I agree with the people specializing, which I think is normal business evolution. You figure out a bit where you’re good at. And there’s so much opportunity that they are like, okay, I’ll follow this because they see that they’re good at it. I just am more skeptical of DeepSeek as a base because I have no idea what DeepSeek is doing. And some of the labs when we were there, because DeepSeek V4 had just come out, were like, yeah, we look at the things they’re doing, but they seem more intricate than needed. And if you read the paper, there’s just so much going on in this model. As a researcher, I’m like, some of it seems a little fake or a little dependent on their setup and not necessarily going to work in every model. Grace Shao (18:04) What does that mean? Break it down for me. Nathan Lambert (18:18) Essentially, I will say that building an LLM is dependent on where you have your GPUs, your pre-training dataset, your intended deployment setup, and stuff like this. So you make decisions based on your constraints, and you build the model. DeepSeek has these constraints and they end up with their model, but Moonshot and Zhipu have different constraints, maybe more flexibility, and they ended up building a different model. They will test the DeepSeek innovations. So they’ll say things like, X innovation doesn’t improve our model. These two organizations are on different development paths that have core similarities, like these large mixture-of-experts models and the general methods are similar, but a lot of the parts end up being a bit different. That’s why I’m like, I don’t know exactly. If DeepSeek was a base, you would see the Chinese labs just do post-training. We just take the base model that’s out there and we adapt it to our domain of specialty. And we have users that do that, which is something that I think about a lot. I’m thinking about starting a post-training lab and how to format post-training research better. So I think about this a lot. I think about what a shared base actually would be. They go through — some of these labs put an extreme cost on creating their base model. And if they didn’t need to do that, they wouldn’t. One of the labs told us how long their pre-training run was, and my jaw dropped. I was like, that’s way too long. Any U.S. advisor would be like, you’re taking way too much risk on this pre-training run. If they didn’t land that pre-training run from one of these past big MoEs at a Chinese lab, I don’t know if the company’s dead, but that’s a huge amount of time. Most U.S. companies now know that you don’t want your big pre-training run to be more than a few months because it’s just so much risk and time to put all your eggs in that basket. That’s a sign that, in that case, they don’t have as big of a peak-size cluster. Essentially, pre-training time can come down a lot when you have a bigger overall cluster; you can just get more throughput on it. But if your biggest cluster is smaller, it’s harder to get a certain amount of throughput, so you use that one for longer. That’s a compute constraint. To loop it back, I think the specialization is real, but I’m more like, I have no idea what DeepSeek is doing. I know they’re raising money now. I don’t know what the plan is there. They seem the most without a specialty in the Chinese ecosystem. Grace Shao (19:59) Dependency on. Yeah. Mm-hmm. No one knows, though. No one knows. They’re secretive. But that’s my point, right? I feel like they’ve been kind of nationalized, whether willingly or not, because they’re taking the Chinese government’s money. They’ve kind of gone secretive. And it’s not like there’s a secret that they prefer Chinese-educated researchers. They’re keeping a very domestic stack, from talent to capital to the whole stack. So to me, it seems like they’re being Huawei’d, in some ways, because they did well and they got their name globally, and then by default they’re becoming the next Huawei, willingly or not. Nathan Lambert (21:01) I don’t think nationalization makes you a base for the other companies, at least not at this stage. There could be something, but it’s hard to force. Grace Shao (21:06) But then you have some incentive, right? But then it is some incentive. You’re like, well, if you can propel one of the teams and propel the whole industry as a whole, it could be in your KPI or some kind of unspoken expectation. Nathan Lambert (21:17) The coordination problem is so hard. Essentially, both in the U.S. and China, even the open labs, what they do is they fork open-source code and match it to their internals, and every company does this. Therefore, all the improvements that could potentially be going to the open code and forming this base that is far more efficient — they’re not completing the feedback loop. I think China could be closer to it. If people really lean into DeepSeek as a standard architecture and DeepSeek shared their training code and all the specifics and how to do this, from a Chinese economic perspective, that would be a huge win because you’re just saving compute. But I think it’s too decentralized and too competitive to have that happen. It wouldn’t happen in the U.S. either. Grace Shao (22:04) It’s so cutthroat. Yeah. Nathan Lambert (22:08) Even though I think for open models to be closer to the frontier, it would be better. I talk about open models in the U.S. needing a consortium. But there’s definitely enough money to make a consortium in the U.S.; then you fail because the model won’t be good because you’re feeding too many asks into the model. That’s the only way to create a shared base. Grace Shao (22:25) Interesting. So it’s not really just commercial. Yeah. It’s not the commercial reason. Okay. So if you had to give a high-level commentary on each of the major labs, what would it be? If you look at ByteDance, Alibaba, Tencent Hunyuan, if they’re relevant, DeepSeek, Moonshot, Zhipu, MiniMax, Meituan, Xiaomi now being part of the ecosystem too. Nathan Lambert (22:46) You might have to prompt it or say more, but I could just kind of ramble through them, which is kind of fun. Alibaba: cloud-focused, understands that open models can enable more usage of platform. So I would say Alibaba is very, very cloud-focused. ByteDance: mostly characterized by everybody else being intimidated by them, and very user-focused, including multimodal. Kimi: vibes of the office were great. It would be one of the best startup vibes that you would visit among U.S. or China. Zhipu: very AGI-pilled, surprisingly cautiously excited about being entity-listed, even though they have no idea why they are, because they’re like, it stamps them as a big deal. And then there’s some... Grace Shao (23:27) I think they previously worked with SOEs. That’s the main reason. Or they still do, but that was one of their main sources of income. And unfortunately, because a lot of these labs spun out of Tsinghua, and Tsinghua is, for people’s context, in Beijing. It’s really close to the government, obviously. But the thing is, when it’s close to the government, it could mean there are three layers of agency underneath the actual government apparatus. But then people like to link it to the fact that it’s taking government money, so therefore they are suspicious. It’s very unfortunate, I think. A lot of companies get thrown into that category. Even companies like Lenovo and a few other Chinese companies have previously been called out by U.S. senators saying, they’re taking Chinese government money, but really it’s that their scientists or their research labs spun out of a certain government-affiliated or government-funded academic institution. That’s what it is. Anyway, yes, go on. Nathan Lambert (24:23) Yeah. Some more would be: Xiaomi — surprisingly great research vibes for a new team at a random company. They seem to be crushing it. Grace Shao (24:31) What do you think of Luo Fuli? The star researcher. Nathan Lambert (24:31) I didn’t get to meet her. I think she’s as close as they have to a star researcher right now. There’s the tier of star CEO, which there are obviously others — Dario and Sam, the analogies are there — but the star researchers, like the Sholtos of the world in the U.S., obviously you can come up with many more. She’s the closest you have to this. I need to watch more interviews. We’ll see. But she wasn’t in our meeting. But they just seem to be doing the right thing. They’re making general models. They don’t really have specialization yet. Florian, the person who helps me write about open models on Interconnects, and I took a detour to go see Meituan because we’re like, why is Meituan building these models? And they’re very practical about it. It was a less glamorous visit at a normal tech office. It wasn’t an official visit for them. They were like, yeah, we’re a major online platform. We obviously are going to use LLMs everywhere once we need to build our own LLM and specialize it to our products, which, surprise, is very practical-minded. I’m guessing there are many more companies in China like this. Grace Shao (25:39) That’s what Tencent’s saying too. It’s because they want to serve their existing consumers and optimize their LLMs for their own distribution and their own basic interface or activity loop. Nathan Lambert (25:52) Yeah. After I left, some people in the group went to Xiaohongshu, like RedNote, and they’re there. They’ve released some language models that are multimodal. They’re like multimodal data-processing things. So a lot of them are not that surprising. The startups just have different cultures. I have met some MiniMax people before, so I left the trip early before MiniMax on this one. But MiniMax was quirky. They have a ton of women in their company, which was very fun. And they have products. They’re maybe slightly more product-focused, but I feel like the quirkiness of the company kind of matches maybe Western confusion over what their products are doing and what they’re trying to do. But it kind of matches their language models that are a bit more efficient. Grace Shao (26:35) Well, they came out with a lot of very consumer-focused applications, right? They had Hailuo and Talkie, all these character companion-bot products before. Nathan Lambert (26:45) Yeah. And then the last one I went to was Ant Ling, which is also very corporate, but in a less intense way, because I think they see it as serving their own products, whereas Alibaba Cloud is like, this is the gold mine we have to win. It’s a much bigger deal for them than Ant Group. But a lot of these things, when you list them — I don’t know, eight to 10 companies — they’re all pretty reasonable with respect to the age of the company and what the company does best. There’s not as much confusion. Grace Shao (27:14) Yeah. And Ant is low-key best at medical chatbots right now, which I guess makes sense because everyone has access to Alipay. And then for seniors, apart from WeChat, it might be the only application they’re using on a regular basis. So it became the default medical consultation app, which is really random, but it’s their niche now. Yeah, I think you’re pretty spot-on. It’s pretty cool that you got those takeaways, even just meeting with them for a couple hours. Nathan Lambert (27:41) I have been reading about them for so long, so a lot of these priors are easy to confirm when they kind of fit with things you have seen. The Chinese showroom culture is so interesting, and also one of the most surprising things to have at software companies. It’s so funny. They’re definitely appealing to Western audiences. Z.ai had poorly translated merch. What was it? Something so — it would be borderline inappropriate translation in the U.S. It was like “ship big, go hard,” or something. Just some really weird translations. And they have live API statistics in their showroom. So Z.ai was like, we’re serving 5.5 trillion tokens a day. All the U.S. companies are so closely watched for when they announce token statistics. I know at least one of these numbers is wrong. It’s something like Fireworks does either 30 or 300 trillion tokens a day — or I meant Together for that one — and then one of Fireworks or Together, and the other one, are like 100 trillion tokens a day. Don’t take these as sourced; go look them up. There were some public announcements recently, but those were the first updates that anyone has on major infra companies in the U.S. Inference is a huge market. You don’t hear anything from Fireworks because they’re just struggling to demand and they’re making bank, because inference is a much better thing to sell than bare metal. Essentially, inference is selling the software implementation to serve tokens more efficiently, and you can just get more margin when you improve the stack for a fixed model. So a model comes out and you host it, and then you can make your stack more and more efficient on that model. You just get more margin and hopefully growing usage. That’s way different than GPUs, where the best case is that you lock in a huge commitment for a long term. Just being able to walk into an office and learn about their API is interesting because they also had geographic distribution, which was like: China was, I don’t know, two-thirds; U.S.A., 20%; and then the last percent was Singapore, Korea, Japan on the Z.ai API. So that’s cool. This is s**t that I always want to know about the companies, and I have no idea. One of the things I always want to know is: how are open models being used outside of the U.S. and China, and has this decades-long process of technological diffusion started to kick in in a way that any company can measure? I don’t think anyone has good data on it yet, but I think it’s obvious that at some point, open models that are cheap to run are going to have some interesting playbook across the globe for the long tail of countries. Maybe I’ll just walk into the front door of a Chinese open-weight company and get my answer. Grace Shao (30:30) But actually, I think the culture of these labs — a lot of them, because they’re run by really young, passionate people — you would feel like they’re a lot less commercialized or less corporate, or at least less sleek. They’re not sophisticated with, you can say, the capital-market side of things, but you can also say that they’re just really naive and open-minded and passionate about the product they’re working on, with less of a corporate guardrail built around them. Nathan Lambert (30:56) Yeah. Grace Shao (30:57) Okay, I want to talk about... Nathan Lambert (30:57) Yeah, go ahead. It’s like one of the people at Z.ai who’s known on X — I don’t know, 9,000 followers — it’s like Lu. She came up and was like, hi, I’m a student, I’m 20. I’m Lu from X. And I was like, that’s hilarious. There was a lot of s**t like that. It was like, oh, okay. I don’t want to call her a kid, but it’s like... Grace Shao (31:06) Yeah, yeah, yeah. And I think the one that runs Moonshot’s developer ecosystem or something is literally a girl fresh out of school, right? And she just posts hilarious memes all day long. There’s no filter on her social media. It’s funny. Okay, we go on these tangents, Nathan. We need to come back on track. Open source, open weight. Why? Why do you think Chinese labs are adopting it or embracing it, however you want to put it, especially after visiting them? Is it because they simply have to, because of what we talked about — they are leaning on each other because of all the constraints they have? Or do you think the philosophical drive is actually bigger in that ecosystem? Or is this a bigger strategic thinking for diffusion in the long run? Nathan Lambert (31:54) I actually don’t feel like it’s that special ideologically. I think it’s easy to say the ideological line when you are doing it. Now you can look at Zuckerberg: he said the ideological line when he was doing it, and then he stopped. I think it’s mostly just that, for one, distributing within the U.S. ecosystem, especially to enterprises, is the highest-value market, and they can’t sign many enterprise deals. And the closest best thing is things like Cursor adopting Kimi’s model. Even if Kimi doesn’t get paid for that, they’re happy. That’s the biggest sign of credibility for them, and they can figure it out in selling tokens or whatever in the future. Practically speaking, one, the only way to influence the U.S. market is by releasing these models. And two, it seems like they don’t feel like they’re losing as much if they release and share things. If the model was closed, they just think they would get less influence, they would be seen less, fewer people would use the model, their actual paid offerings would be adopted less. It just seems almost overwhelmingly obvious, because there are all these benefits and not as obvious of a drawback. There will always be better models, and just keep going. But I think every scientist loves... Grace Shao (33:08) Then why are so many U.S. labs against it, or not willing to? Nathan Lambert (33:12) Because they can make as much money without it. Anthropic and OpenAI make more money by not releasing them. They can just make so much money, so why bother thinking about an open model that doesn’t make money? There are different scales of influence. Same with Google. Google’s making so much money. I think Meta will make a lot of money by having good AI models in their products, if they get their act together. Even Google could release more models. They have so many surfaces other than Gemini that need AI to be commoditized and used, like the cloud and all of this. Meta could release the models. It’s just not worth the effort for some of them. They’re like, we need to do this high revenue target; it’s too much of a pain to go through legal and make it ready to release. Why bother? I don’t know, maybe it’s a little bit of a cynical take, but I think Microsoft and Meta could release their best models openly because they benefit if it’s a commodity layer. But I don’t expect them to, because it’s just kind of like the benefits of focus are so high, and they just kind of see it as something they don’t have to do. Grace Shao (33:56) And it’ll be good for them. But then eventually, we will see some consolidation in the market as well, assuming — because you can’t really have 10 labs in each dominant country right now all exist. Nathan Lambert (34:26) I do expect consolidation. I think this is potentially a subtle cultural point, which is that the U.S. labs are more likely to buy into “we’re special, we need to go fast, keep it closed,” and the Chinese labs are not. There could be something there. That’s also who the decisions funnel up to. I don’t know. I talked to the Alibaba people that make these decisions. I can’t say all the things that they say about them. Some of these were two-on-one and off the record, so I can’t say all these things. But at all the other labs, there is a person that makes the call, I’m guessing. I think those are senior leadership that we’re not talking to. So it’s kind of hard to know exactly what they really think. I definitely expect consolidation. My thing is that I expected it in China faster because the capital markets aren’t as strong as in the U.S., but I don’t have a model for that. I think you can model it, which is: what do you think the revenue growth would be? What do they need to do to raise to keep training bigger models? What is the compute cost? Then you look at the potential raises and think about which country would not be able to do that race first. But also, it’s this wild thing with OpenAI raising $120 billion. Are you kidding me? What is that? Grace Shao (35:47) Yeah, the valuations in the U.S. are not really understandable by anyone else right now. I think in China — so on your point on that, I’ve been writing about this and I think it would make sense for Tencent just to buy out one of the labs. They have the money, they need the capabilities, and frankly, they’ve really been struggling to compete with their LLMs, with all the labs talked about just now. So my... Nathan Lambert (36:05) Their licenses are so bad. They release all these models that have horrible licenses. They’re not that good, and the licenses are just horrible. Grace Shao (36:13) So I feel like it financially makes sense for a company like that to optimize and just buy out a lab. Then the labs can also lean on their distribution, because at the end of the day, how are they going to win consumer mindshare or distribution in China right now when it’s really just dominated by Alibaba, ByteDance, and Tencent? That’s my spiel. But when I spoke to some of the researchers... Nathan Lambert (36:33) I think big companies have a lot of inertia, and the senior leadership has the call, and they can have inertia. I still think Apple just ends up buying some lab for $25 to $50 billion. It’s not the worst thing. Just golden-handcuff the researchers. Some will still quit. Grace Shao (36:43) Yeah. But I think right now they don’t want to. The labs still have a dream. Some of the researchers still have a dream. So when I spoke to a lot of them, they’re like, no, we don’t want to do that. We want to commit to our own frontier research. If I wanted to join one of the big tech companies, I could have. So why would I want to sell? That’s what the researchers think. But to your point, we don’t know what actually the one person or two people at the very top think, especially if they continue to have hurdles with compute access and capital access, which brings me to the question. Nathan Lambert (37:14) It also depends on your view of inference. You can ask your next question. I don’t need to cut you off. It depends on your view of inference. If these agents are just so much inference, I do think it’s going to be an oligopoly-style market, not a monopoly-style market. And what’s the difference financially between two and four or five big companies with great models? Is that actually not sustainable if there’s so much demand? There are a lot of cases where we have two or three, like the cloud, but what’s stopping that from being four? Grace Shao (37:40) I think they will be the infrastructure providers. Yeah, yeah. And they would kind of lean into each of their existing ecosystems or distribution, whatever you want to call it, and serve certain specific models for specific uses. So enterprises can choose what matches their needs the best as well. I do want to bring the conversation to a more contentious topic, which is on distillation and model convergence. You raise the question of whether Chinese models are structurally different. Often we are hearing claims saying a lot of these labs are about three to six months or six to nine months behind U.S. labs. There’s obviously a lot of noise or allegations and accusations from certain U.S. labs saying Chinese labs are distilling them. How do you actually see that accusation or that kind of dynamic? Nathan Lambert (38:36) The biggest unknown that I don’t have an answer to, which actually has a lot of sway, is how much of the Chinese companies are actively trying to hack APIs versus just showing up as a customer and paying. If you’re trying to hack the APIs, normally you get reasoning traces out so that you can create a reasoning foundation that would be similar to the model that you’re trying to do this from. That’s very different than the API standard form, which is just the output of the model, which is a less direct process for learning from. I don’t know the magnitudes. If it’s more just like, I walk up to an Anthropic API and I use it as intended, but I’m making a competitive model, I’m not very sympathetic to Anthropic. They could ban it if they want to. And I think the impacts are kind of a standard practice. You can do it with many different models and so on. The evidence Anthropic provided is not large enough scale where I’m like, this is industry IP theft at mass scale going on 24/7/365. So there’s definitely some gray area to what is actually happening in distillation. That’s why, on the policy side, I try to push people to not call all of it the same thing. Essentially, using any API endpoint to make synthetic data to train your model is some form of distillation, but it’s very different if you’re trying to break this model so that it gives us a different behavior that is hyper-useful for training and not get caught. Those are pretty different actions, and they’re all looped into this common phrase of “distillation” right now. That’s my biggest problem, which is that academic researchers and small companies use distillation extensively as the core of their business and the core of research methods. So if the U.S. government nukes that as a thing that could be done in the AI ecosystem, it’s mostly bad for small players, bad for U.S.-China tensions, and bad for academics. That’s my primary concern. And then trying to get the labs to actually say more. There’s a distillation side and then performance is the other side, which on benchmarks, it does seem like the Chinese labs tend to be six to nine months behind. When it comes to general use, I’ve always found the closed models to be better in ways that are hard to measure. So I go very back and forth on whether the closed models are better. I think we will especially see Anthropic and OpenAI pull ahead on knowledge-work tasks like legal, healthcare, financial services, because I just don’t see the Chinese labs paying for that data. All that data is going to be people that charge hundreds of dollars an hour to annotate and create these environments. So it’s a whole new capital build-out that goes on there right now. It’s going to be billions of dollars if you’re going to buy a billion dollars of data and a billion dollars of compute and a billion dollars of talent to train your model. Grace Shao (41:30) They don’t have the money. Nathan Lambert (41:30) I don’t think they have that. Mercor has some of these evals, and I think there is a bigger gap there. So it’s very interesting. Florian, the guy that helps me, and I disagree on it. It’s this fine line between, yes, the evals — coding and lots of these things, and even random evals that surely the Chinese labs aren’t training on — the open models really are genuinely crazy impressive scores. So I think there’s also a tester’s bias, where I don’t use the open models as much. Maybe it’s hard to ground in my head what I was doing with AI six to nine months ago. I wasn’t even using Claude Code as extensively. I guess the question is, at the end of this year, can I use an open model in something like Claude Code and feel like it works at all? That’s the test on the performance gap, starting in June, June to August, and whether or not that hits. I don’t think the open models have hit that yet. I think it would be way more of a narrative if all the companies spending billions of dollars on Claude are like, oh, we can spend 1% and just use DeepSeek. These CIOs and all the big companies — some companies spend more on tokens for their employees than on headcount. These are normally startups. But they would happily reduce that token cost to 1% expenditure if it really was that similar, because then you could just use 10x the tokens. I don’t expect that to happen. And I expect things like the latest Claude and GPT-5.5. I expect more of these things through the year, and we’ll see if I end up being right. Both are right at the middle of us, as a world, getting more clarity on them. They’re like 18-month-long stories unfolding, and I feel like we’re just in the middle of performance gap and distillation and learning more. Grace Shao (43:25) Yeah, it’s interesting. You mentioned — it helped me recall a conversation I had with other people as well. The point on distillation is that I just had a conversation with your former colleague at Hugging Face, who leads APAC, called Tiejin Wang. He was just saying, look, the distillation accusations don’t really make sense because we’re all distilling off of each other as we speak. I’m learning from you; you learn from me. We’re distilling. It’s so vague of a terminology to just use that to accuse all these various behaviors. So to your point, I think people in the technical world who understand what’s happening actually want more clarity on what is the gray area, what is actually black and white, and what is not appropriate or unethical. That needs, I think, the industry to come together to really put guardrails and rules around. Now, number two on the compute side and the data side. Something anecdotally will be interesting to you is that when I spoke to one of the lab researchers in Beijing, I think in February around Chinese New Year, they were saying, look, they want to get better data, but they can’t because usually a lot of American labs would pay tens of millions, if not even more, like a hundred million dollars, for a set of very obscure or niche datasets, but they would have an exclusivity contract. What the Chinese labs will do is that they will literally wait out the exclusivity contract and then, say two or three months later, pay for it at one-tenth or one-twentieth of the price for that same dataset. So then once they start post-training on that dataset, that’s where the three to six months or six to nine months come in as well. Yeah. On that note, I want to... Nathan Lambert (45:00) Yeah. I think the data industry in the U.S. has two things. One, the lab asks the data vendor, we need this specific type of data. And the data vendor is a network that connects the people to the lab. The other thing is the data vendors know evals that are important, so they try to create good data for hill-climbing on specific evals. That data could be sold to multiple people, but is less expensive because they make it once and expect to eat margin or take margin on it. There could be a pipeline where once OpenAI is at the cutting edge, creates this new thing, they create deep research, then the data industry is like, let’s make things that are a little bit cheaper to sell. So there is time lag in these various things. But I heard the same thing on the ground, where they have a negative view of the data industry. It’s like, quality is bad, we don’t really have access, we do some in-house. That’s a very big difference from today, which is that you have the data companies in the U.S., which is insane. Grace Shao (45:53) Yeah, the American data companies are so mature. It’s its own sophisticated ecosystem. Before we get into data, I actually want to ask you this question. I think recently a lot of the narrative is now saying, look, Anthropic and OpenAI have kind of proven that pre-training scaling laws continue to hold, especially with the recent models. There’s an obvious compute constraint on the China side that we talked about. And then it will likely be even more amplified with the absence of Blackwells in the coming months. So as we move forward in this race, per se, if you have to put it in China versus U.S. in that sense, will we see a wider gap between the performance and benchmarks between the Chinese labs and U.S. labs? As in, will we see the gap going to 12 months, 24 months, as Chinese labs are very, very constrained on compute for pre-training breakthroughs? Nathan Lambert (46:40) I think it’s more of pre-training as a thing that you could actually finish. How big can you pre-train a model that you can finish and serve? The Chinese labs could train models that look like GPT-4.5, which is this giant model, but you can’t serve it. They end up training a model that is 2.5 trillion parameters and they release it, and no one can use it. They could barely serve it on their API because they don’t have Blackwell NVL72 racks or something — these racks that are definitely what are serving these large MoE models. They just don’t have the quantity of these. So there’s a difference between models that you can build and models that are actually useful. I think some of the Chinese labs are definitely like, we don’t need to release the gigantic models because nobody is going to use them in open weight. The biggest models end up getting served via API. So there might be some segmentation in that market. But I do think the inference and amount of economic resources that you have to serve your customers is becoming a thing that dictates what models are built. That’s why I think the gap will continue to rise. All signs point to GPT-5.5 being a bigger model, and I don’t expect that to stop. And then the economics of it is just the basics of: you need a certain volume to have the margin to support the research, because you can’t keep raising these ridiculous rounds forever. I think OpenAI, Anthropic, and Google are the only people with that AI usage volume to keep marching down the scaling laws to another 10x of training compute, which is mind-boggling amounts of investment in a model. That’s why, when the economic markets slow for fundraising, the model gap between these big three will just show a lot more. That’s the distilled way to say my prediction of when things will look different. It’s like these labs can’t fundraise, they go public, they can’t generate revenue more on their paid services, and then it’s just: look at how much training compute can be allocated or can’t be allocated. Grace Shao (48:41) Yeah. Basically, we’ll see a bigger gap, I think, in the coming months. Then what can make up for that? Domestic chips, or, like you said, better data. And why is it that sometimes people assume China has a very strong data ecosystem or data products, but actually the data vendor ecosystem is very weak in China? Nathan Lambert (48:41) So generally, I think I agree with what you said. I don’t know on the data side, but the way domestic chips could help is that if Huawei chips are fine for inference, and if they have sufficient volume to support the inference economics, which then trickles back into revenue, my read is that they just don’t have the volume of the chips, especially spread out across the amount of companies that they have. Essentially, the total FLOPs of Huawei, all the things produced, and it’s going to all these different places — it’s just not big enough. It could be something like ByteDance and Alibaba, with offshore data centers, can keep up a lot longer because they have access to Nvidia compute and have for a long time through this kind of offshoring. Maybe that stabilizes the ecosystem, and we’ll see what the AI startup, the younger startups like Kimi and Z.ai, end up doing. No one wants to do this, but if they pool resources, they last an extra year. You get another order of magnitude if they all pool together, but I don’t see them doing it. Grace Shao (50:00) But that’s the thing we were just talking about, right? MiniMax and Zhipu, how can they possibly compete with the hyperscalers at this point if you need offshore data centers? And the fact that Zhipu is on the Entity List doesn’t help, right? It’s not going to be easy for them to access these data centers either. Nathan Lambert (50:12) Yeah, I think they can’t. I think they won’t. Human nature will make it so they won’t collaborate. They’ll just do something smaller. They’ll just have successful businesses that are different. Grace Shao (50:22) They just have smaller ambitions, want a smaller piece of the pie. Yeah. Okay, so you wrote something like, nothing’s a secret, but everyone wants Nvidia chips. They want it, they don’t know how to get it, they’re fighting over it. Nathan Lambert (50:34) Yeah. They’re the only thing that works for training. All the models are trained on Nvidia. I don’t believe the DeepSeek propaganda that it’s trained on Huawei. The only models that are trained on Huawei are tiny. Inference on Huawei works. Every lab is like, inference on Huawei works. The labs that don’t have meaningful inference are like, we are told to get Huawei, so we buy them, but we don’t use them. Earlier research labs are like, we don’t have any inference and we don’t have a need for Huawei. Any company that has meaningful use of their models has figured out how to run them on Huawei for inference, which, to Jensen’s credit, is like — it’s happening when he said it was going to happen, but it’s not that surprising. Grace Shao (51:11) Yeah, the Dwarkesh interview. I don’t actually understand why he got so much hate for it because even without your political stance, what he said actually made sense logically by saying, if you don’t sell them the crappier versions of what we have, they will have an equally quite crappy version to serve themselves, or they would just want... Nathan Lambert (51:28) I think they would buy both. Buying both is actually true. The amount of Nvidia chips that you would have to sell to China for them to stop buying Huawei — because Huawei is almost surely way cheaper because Nvidia margins are insane — when would they actually stop buying both? Grace Shao (51:43) But then you have to go on CANN. You have to reroute everything back on CANN. The developer ecosystem is not there. That’s Jensen’s point, right? Or the habits are not there. So I think that’s what, when I talked to a research lab... Nathan Lambert (51:50) Yeah. But I’m saying they would also use Huawei. I think they are so supply-limited, they would use both. Anthropic uses everything. A lot of companies in the U.S. will use multi-platform. Meta is a huge buyer of AMD. Demand is so high that any chip that is potentially viable on the models within a few generations is very valuable. And the fact that you can run some reasonably large model on any Huawei chip is a big line crossed for Huawei. I don’t know if they can produce the volume of chips and scale that quickly, especially as they try to move to lower nodes. That’s the standard semi debate. But the question is: can Huawei scale production? That’s the only question. And if Huawei can manage to scale production, Jensen will just look really right. If Huawei can’t scale production, Jensen will look a little bit like a lunatic, but it will be outside of his hands. Grace Shao (52:43) And we don’t really know what happened during this trip. It seemed like nothing really substantial happened after this big Trump delegation. It was more like a high-profile tourism trip versus an actual deal trip. Okay, I want to ask you something you wrote about that’s a bit niche, not something you usually write about. It’s on the SaaS side of things. You said that there’s a common argument that China struggled to monetize AI because they’re unwilling to pay for enterprise software. We looked at how China tries to monetize on consumer AI, but clearly that’s not really been proven yet. In your piece, you push back on the claim and say that there’s a distinction between SaaS spend and cloud or inference spend. Tell us about what you think about that ecosystem and how Chinese AI labs are trying to make money maybe a bit differently from American AI labs. Nathan Lambert (53:32) I don’t know if it’s necessarily different, but I ask a lot of researchers about this. They say that everybody is trying the new AI tools when they come out. If they don’t like them, they stop using them. If they like them, they keep using them on the consumer side. So something like Claude Code would be an example: tons of people tried it. I’m guessing lots of them churn in China, just like in the U.S., but consumers are very quick to adopt and try new things, but won’t stick if it’s not actually serving them. And then the enterprise is like: there’s definitely cloud that exists. Digital services are gigantic. They essentially think that there’s more runway for making money on AI models that falls into that. And they all use coding agents; they all use Claude. It’s a hilarious thing. They’re all very Claude-pilled. There’s almost no mention of Codex, where in the Western media, Claude versus Codex is this whole thing. They all use Claude. And that is obviously a paid service. So I think there are cracks in the argument, and I expect AI models to be seen as a bit of cloud, but potentially it is the thing that changes some of the expectations, where it’s just so transformative because they’re so competitive, and it could be seen as a bit of a phase shift. Grace Shao (54:41) Yeah, and I think it’s a generational shift, a phase shift. Also, actually, recently Doubao raised their prices on Seedance usage and whatnot, and it’s a shift into trying to capture the prosumer market. You can say the average uncle and auntie on the streets still don’t want to pay for a consumer app, but I think there’s more prosumer market share that could be captured in China, maybe not fully enterprise either. I want to ask you about government roles and geopolitics. I know there is a common narrative that usually people assume Chinese AI labs are heavily subsidized. Actually, when I was in San Fran in March, I was at a dinner with a couple of investors, mostly public investors, and one guy asked me, “Hey, are all labs just basically subsidized by the government?” I was like, definitely not. The majority of them are not. If not, they frankly don’t want to take money from the government. It was really hard for him to understand that, because I think the misconception is all Chinese labs or Chinese tech are just funded by the government. Kind of to our point earlier, where any affiliation to any government agency, just by default, is assumed to be therefore backed. First of all, the government, I don’t even know if they have that much money to give out. Number two, I don’t think that’s how competition works, right? So what’s your thought on all of this? Nathan Lambert (55:55) It seemed more like a provincial government trying to help the companies do stuff, which is like get offices, get talent. I don’t know what the provincial government can do. In Beijing, there’s Beijing Academy for AI or whatever, which is a real research institute that’s just funded by a certain neighborhood in Beijing. It was like, okay, the U.S. could do that. But much less of the Ant Group-style thing, which is government takes major ownership stake in an investment round and goes on. Maybe Kimi’s latest round, there were mentions of government-backed VCs, and I don’t know how that kind of intermediary works. So I still think it’s very indirect. And because the government system is so competitive across the different layers, each of those layers are competing to help the companies, but they don’t have piles of cash sitting around to buy GPUs. Grace Shao (56:44) No, they don’t. And they frankly don’t know what they’re doing half the ti

19. maj 20261 h 3 min

AI x education, a contentious but unavoidable future. Designing tech for children with Dex's Reni Cao

I spoke with Reni Cao, the CEO and co-founder of Dex. Dex Camera is a language-learning camera for kids. Reni is a dad, a former product lead at YouTube, and on a mission to build technology that does good for kids and gives digital autonomy back to parents. We dive into his personal story from his high school days that drives his passion for AI, and why he believes the current education system is a “cookie-cutter” that fails curious kids. We get really into the nitty-gritty of what makes “good” tech versus “bad” tech for kids and why the category of ‘children-first tech’ is very overlooked. Reni explains why most children’s apps are built on an “attention economy” model that forces them to compete with addictive content, and why his team needed to build physical hardware to break that cycle. We tackle the hard questions, including the pushback from parents who believe in “no tech” childhoods. And he shared his most non-consensus view: that the era of standardized, industrial education is over. He believes we are entering a golden age of “scaled homeschooling” where AI meets kids where they are. Whether you’re a tech investor or an anxious parent, this conversation about nature versus nurture, “nei juan” (involution), and raising resilient humans in an AI world is a must-listen. Every episode, I bring in a guest with a unique point of view on a critical matter, phenomenon, or business trend—someone who can help us see things differently. Season two will host a series of guests from early-stage investing, as well as builders, founders, and product managers. For more information on the podcast series, see here. [https://aiproem.substack.com/p/launch-of-differentiated-understanding] To find the previous episodes of Differentiated Understanding, see here. [https://aiproem.substack.com/podcast] Chapters 00:00 Reni’s Journey to Dex Camera 03:48 Designing for Children: Principles and Insights 08:05 Technology’s Impact on Child Development 12:09 Bridging the Gap: Business and Product Design 15:36 The Role of Parents in Tech Development 25:20 Leveraging AI and Language Models 29:48 Value-Driven Pricing Strategy 32:05 Defining the Product Category 34:33 Subscription Models and Content Delivery 37:58 AI and Parenting: Balancing Technology and Safety 43:29 Unexpected Use Cases and Impact 47:29 Personalized Education and Parenting Philosophy AI-generated Transcript Grace Shao (00:00) Reni let’s start with your personal story. Who are you and who are your team members? Because when I met you in SF, I was so enamored by the product and I thought your story was so interesting. So please share that. Reni Cao (00:11) Hi everyone, my name is Renny, CEO and co-founder of Dex. We’re a technology company in San Francisco, almost all parent company, which is pretty special in a startup setting. We’re a bunch of parents that having trouble with the same kind of like a reality where like our education system is a sort of like cookie cutter and our entertainment is also cookie cutter for children. So we’re like, can we harness technology, especially the latest development of the AI, in different way for families that really gives children a chance to become the best version of themselves and ⁓ give the digital autonomy back to parents themselves rather than accepting the fact that they have to struggle between technology versus no technology. So yeah, we’re the parents, of like a bunch of missionaries in this journey together to explore how can we make the best use. of the AI and our first product is called Dexta Language Learning Camera where kids can take pictures and turn the whole world into language immersions. And it’s a product targeting young children three to eight. And we’ve sold 10,000 pieces so far and ⁓ ratings has been high and we’re pretty excited about this. But yeah, this is pretty much about us. Grace Shao (01:24) But Reni, tell us a bit about what you did before Dex actually. What kind of led you to this path? I know becoming a parent really did inspire you. You have a young daughter, I think similar age to mine, around three years old. But before that, what really led you to this path? Were you always passionate about children’s tech or education? Reni Cao (01:41) I actually have been a product management guy for the last decade in Silicon Valley, some big companies like YouTube and LinkedIn, some smaller s***, ZFS, Wish. But I have been a builder since the beginning. I would actually say that my passion for decks actually originated much earlier than I started my career. It actually started right when I was at school, but happy to say more if you’re interested. Grace Shao (02:07) Yeah, no, do tell us a personal story there. Reni Cao (02:09) So I was always this random kid with tons of questions back in high school. And very unfortunately, I think the education system, especially in East Asian countries, is not designed for meet kids where they are. So every time when I come up with a random question, my teachers are usually a little bit impatient and will be like, can you just go back and finish your quiz, et cetera, et cetera. So the moment I saw when GPT-4 comes out, I was thrilled and I posted a long like blurb on LinkedIn. Basically saying like, you know, if I had, have this as a kid, I would have grown into a more complete human. So this kind of like, I feel like this like generative AI’s capability to meet kids where they are, especially meets your needs for curiosity. It’s game changing. So. I feel like I’m building this product first and foremost for a younger me that could have benefited so much from this. That’s pretty much the story about me. yeah, I know we see it and of course our parents right now we see there is a tectonic shift in terms of the skill landscape and what the future of workforce is going to be and even the existential challenge of what does human mean in a future society. So we do want to build something that’s centered around children, centered around the family to help them find what they love and build agencies around it at the end of the day. So yeah, that’s the two main driving force of me coming to Dex. But I would be honest about it. It’s like very random. When I want to start a company, a lot of my colleagues are very surprised, being like, oh my god, Renny, you’re getting into this field. But yeah, I guess I finally find the work of my life. Grace Shao (03:48) I love it. think you need to understand the passion and the personal reason behind the businesses to really understand why the design was frankly so intuitive and why you’re so passionate about building this and leaving such a comfy, know, like cushy corporate role. I think that’s the one thing that stuck out to me. The product itself is actually so natural to how children behave to your point, like my three year old. from morning to night, know, morning she wakes up, it’s like, mommy, what’s this? What’s this? What’s this? What’s this? How do say this? Why do you know that? Sometimes she gets angry at me. If I don’t know something, she’d be like, but you’re an adult, you should know everything. But the reality, especially with languages, it’s really difficult. So for example, yesterday she was coming back from her Mandarin class and she said, liu shu, she was pointing at random tree. And I was like, that’s not liu shu. All I know is not liu shu, but I actually don’t know what liu shu is in English because I think it’s only really common in mainland. I’ve never seen that kind of tree. Well, I guess it’s a willow tree. You don’t see it very commonly elsewhere. And then she kept on pointing at trees, but in Hong Kong, you clearly don’t have liu shu because Hong Kong is like tropical. And then she got really, really mad at me. And that moment I was like, wow, if we had a Dex camera, that would have been perfect. But I was literally trying to take a picture of it while we’re moving car and try to upload it to GBTB, like what tree is this? What’s the name of it? So anyway, I think it’s really great product design. And I want to kind of get into that a little bit. When you were designing it, what was the thinking? Like, what does it mean to be children first? Reni Cao (05:10) I think there are three layers of children first as a principle. The first layer we already touched upon that. So young children, their hand anxiety is very different from adults. they tend to use one hand to operate a device and another hand they want to use for sensory explorations, like they want to touch. Sometimes they want to just move things around. So this requires a different form factor that one handed use, very tactile, very intuitive for young children such that they can explore a world while harnessing the power of AI in this case. So this is kind of like the user, the special things about the user. And it’s a different design. think that’s layer number one. I think the layer number two is also that the device itself is a metaphor for the market as well. And in the market, we want to build something that’s drastically from the so-called adult-centric smart devices, namely the phones and tablets, to send the market a message that there could be a different option. There could be a good technology. There could be a family-centric technology. And we’ve picked this form factor utilizing the metaphor of magnifying glass. It is something you use to see some hidden wonders, otherwise you cannot see. I do think that’s the ⁓ second layer of the things, which is like metaphor and category creation. And at the end of the day, I do think we intentionally make the device kind of worth finding this fine balance between engagement and learning or kind of like a healthy aspect of the technology, meaning like we add a assistive screen, but we make it really kind of like limited and not the center of the whole kind of like a user journey. And we want to kind of like find a new way to put all the components in our consumer electronics world in a way that it strikes a more delicate balance and ⁓ let the device itself to be kind of like, you know, retentive. for children without getting them to be addicted. So it’s kind of like we intentionally make it a little less stimulating, actually much less stimulating than a lot of a thought-centric ⁓ product. So that’s the main three kind of like principles around the product design. There’s a lot of conflicting constraints here, as you can see, but we do our best trying to find what is the answer. And here we go. Like what you see right now is our first, you know, ⁓ answer we have thought through and I think the market validated the answer quite well so far. Grace Shao (07:39) Yeah, definitely. think exactly to your point, know, like a lot of times, I think when we as young parents looking at introducing technology to children, really worried about the big screens, addictive nature, or even the parental, even though a lot of them allow parental control, it’s the unlimited access to a wild, wild internet out there. Like all of these things are basically concerns and or reasons why we hold back technology from our kids. So actually on that note, do think you kind of mentioned it, right? Like technology over the years, especially big tech frankly, has garnered a bit of a bad reputation. And I think that was really tied to the rise of social media and all of this mental illness that came with it. And obviously like you mentioned the addictive nature. So what do you think is actually harmful to the children’s development when we are looking at tech? What are areas actually we can really embrace technology? I think you kind of touched on it lightly, maybe explain it to us in an even deeper, more technical way. Reni Cao (08:37) Yeah, our thesis is that why a lot of parents think technology is negative for a good reason. And the reason is that all the main status quo technology for children are built on top of the attention economy, as we call it. Everything revolves around time spent and how much attention, how much engagement in terms of like a minute, seconds, sessions you can get. That, is the reality because, think about it, you build an app on an iPad, immediately you’re entering a competition with Roblox, with YouTube Kids, with all the videos, all sorts of things out there. You could do well. You can try to do good for the society, for the families, but you’re effectively competing against more like... addictive kind of like a form factor of information and it’s a losing battle and as we call it is a rat race. So no matter what type of like educational apps or content you’re trying to deliver at the end of day you have to deliver them in more and more engaging way more and more gamified and more and more animation used etc etc. That’s I think that’s why it’s another reason why we need hardware at the end of day. I think the first step how we can create alternative reality is that we need to create a new world, a new kingdom where the business is built upon outcome rather than attention. Meaning like it’s not the time spent logic anymore. It’s like, can you use this device? For example, for Dex, you can use the device and you see the child speaks better after two months or your kid starts to have a like a love to speak Mandarin and not preserve the rest of their childhood. I do think there is a business model there like that, but I believe that business model warrant a totally kind of like a different design of the experience from ground up, from the device layer to the software, to the content, all the way to like user interaction. So I do think like that’s why the current technology is considered bad because it raised towards attention. And I think ultimately, inside Dex, I believe the final answer to create that alternates like a reality is can we deliver something that’s purpose built for children before we build a general sort of like, you know, like time spent logic, like a product in, in, in the case of the decks, is something that, you know, purpose built around the languages. cannot do a lot of things. It cannot, it’s not a chatbot. It cannot, it cannot play videos. But I do think even do one thing super well with the Frontier technology already delivers so much value to the families such that you can build a viable business model on top of that while creating values for families. I think being courageous enough to limit our scope to something to begin with, like really hold onto our principle, deliver a promise, create values there. is another internally operating principle to get there in terms of how to harness the technology. And I want to say that it’s very interesting. What we noticed that is a lot of people are trying to use the AI in quite an all-in-one way. So you can see a little device with tons of features in there. can generate pictures. You can talk to celebrities at chatbot. You can talk to Elon Musk on that device. And we think, actually, that would be a very slippery slope. ⁓ in terms of harnessing the technology at the end of the day. yeah, purpose-built is another very critical principle we’re holding on to, to create a good technology. Grace Shao (12:09) No, I love that. But I mean, from a business perspective, sometimes people might not have purpose built businesses, right? Unfortunately, some are not. Then thus, how do we basically help the industry align the business incentive to the product design incentive? Because, know, like what you’re saying right now, it makes a lot of sense. And I think once I saw Dex Camera, I was like, wow, why is there not something like this on the market? but it does feel like there’s a huge gap where, like you said, there is a big devices and the big tech. There’s this tiny niche little products, whether software product or hardware for children’s ⁓ use, but it doesn’t feel like people are taking it seriously, even though we all know parents are willing to spend on children if it’s for their good. It’s not like the economics doesn’t make sense. So why is there’s that gap right now? Reni Cao (12:56) I think you’re hitting on one of our most recent realization that the parenting needs and the children’s needs are quite long tail or as we call it, very like a versatile, right? Different parents have different parenting needs. Even when you look at the language as example, there are tons of different languages and even more dialects you wanna learn. Like let’s say you wanna learn Mandarin, you still got so many like a dialects there. There hasn’t been a real... kind of like technology that can enable a venture scale business that attracts talent, that attracts a good backing in terms of like a capital to build something that’s like a generational. But I do think this is the moment AI is strong. We finally have to make sure we can build one system. that can consolidate all those long tailed needs. Even for Dex, very specifically, you can learn a lot of languages and even more dialects with just like a nine person team building the hardware plus software. I think it’s the catalyst that’s much bigger than Dex itself. And I’m really excited about that. But I think another very interesting angle is like, despite the technologies there, you have another question. It’s like why there is not more company like Dex. I have a personal opinion here. When new technology comes out, people will tend to use it in the most sloppiest way possible. They were trying to just like, OK, you can chat with the AI, so why don’t we just shovel AI into a little box and put it into a Talking Fluffy and call it an AI toy. And that’s it. That’s my business. I do think it is like a gravity that’s pulling people away. from deeply think how to harness technology and pulling them towards something that’s so trivial and it’s just almost like a shortcut. I think that’s kind of like also, I would call that a trap on the entrepreneur side, that the technology is changing so fast and everyone’s a full mowing, everyone just wanna use it in some way. But I think in this sense, we as Dex, the company, we believe in that. we need to think very deep about how should we use this technology to meet users where they are and deploy like AI in certain ways so Shell can deliver the value. So that’s why we start small, but we’re going to expand from there. Grace Shao (15:09) Yeah. No, it makes a lot of sense, but I think I wonder if you guys all being parents like you just said have made a huge difference. I hate to overgeneralize, but like, you I’ve been in the tech space for 10 years, but usually either I meet men who are like 20 years older than me or they’re very young men who have not, you know, settled into a family yet. And I’m just saying when I tell people my mom, it scares people. They’re like, I don’t know what to say. I’m like, OK, like. I’m not trying to scare you off by telling my mother, but the reality is most of us one day will all have families. And when we do, we start thinking about the things around us very differently, our perspectives shift. And I think to your point when you guys had a lot of purpose designing this product, I wonder if it has a lot of, you know, reason because you guys are parents. Whereas if someone is an entrepreneur for the sake of being a business person, they might not have the nuanced understanding of what a kid needs and what they even think is good for a kid. So to your point, they create little stuffed animals with an L-I unplugged into it, which is horrendously scary. I would never introduce that to my kid, right? I’m getting very, very agitated about this. But you know, another one that we talked about kind of offline was like, I should be ambassador and be paid by Tony Box at this point, because I probably gifted at least like 20 of them out to friends with kids. I think they’re just like, on the surface, you think about it, they’re like, ⁓ a little box that plays music. You’re like, this is so easy. I can just use my iPhone. Reni Cao (16:13) Me neither. Grace Shao (16:32) to exactly to your point. It gives the kids agency, allows the kids to start navigating the world themselves and have preferences. For context for people who don’t have Tony boxes or kids at this point is you put these little miniature IPs, essentially they’re Disney or whatnot, and you can put them on the little box as a magnet. And then the box starts singing and has like seven or eight pre-programmed music or ⁓ stories. And then you can control with your little hands. And basically like you press the Reni Cao (16:54) stories. Grace Shao (16:58) big ear, the ear just like the volume goes up, small ear, the volume goes down. It’s like really, really great. So basically introduce technology to kids where they’re like, oh mom, I can control what I want to listen to today. But I don’t need to nag you about it to control the iPhone. I don’t get exposed to a screen. And I can sit there and be entertained for like half an hour myself. So I think Dext really falls into that category for me. Like, you know, we kind of skip the part where we explain how your technology work really and in a very day to day way. It’s basically like you hold a camera, you point at things, you click the button, you can say, what is this? And you default choose languages, right? You actually explain better than me, please. Reni Cao (17:35) So there are actually four questions here. So I want to actually react to all of them one by one. I think this is a lot of good insights here. I think Tony Box and Dex share one thing in common, which is they are children-led, or they are child-led in this case. Think in the POV of a child. The world is kind of like a scary place that you’re told to do this or that. you are brought to here or there, there’s not much quote unquote autonomy you could have. But now there’s a device that your parents actually are willing to let you operate and you can decide what type of content media or interactions you can get. That is just a huge reward to children’s like unlimited curiosity and their like a strong needs to be considered sort of like, you know, a big kid or a even grown up in a way. I think that’s the intricate magic that if you were not a parent, you haven’t interacted with children a lot, you will miss. So instead of saying like a parent’s made us a better product builder, I think at the end of the day, it goes back to the product 101 that you really need to know your user. You really need to know who are using your product. We spent such a long time with our kids every day. And early days, which is very funny, like ⁓ the first group of users using DAX is just our own children. And that gives us a huge edge there. Right. And I do think you mentioned that a lot of like startup founders in this category, sometimes they’re doing something with raised eyebrows of the parents. I do think they’re a little bit distant from the kids is one reason. And another reason is I do think there is a misconception that children are less. at the end of the day, lot of founders think, you know, those are toys or some gimmicky stuff. Kids, you know, you just give them something that can flash, they can make some sound, and children would love to use them. But I reject that answer. I think that assumption is completely wrong. Children are actually smarter than adults in certain ways. They just cannot verbalize it. But as I said, they already got their little taste. as the famous word, popular words, they got their taste and they sometimes can tell what’s a soulful piece of story versus it’s a very sloppy kind of story. So children actually knows that and they want quality experience, they want quality product, they can actually absorb something that’s really built well for them. I think that just gives us kind of like this endless. sort of motivation to polish our product as if we’re building this for the most critical sets of adult users because we think actually children are more and they deserve more. Now, coming back to how Dex works at the end of the day, I think the core loop of Dex is quite simple. You just take the little camera. I’m happy to actually send a video to be the bureau here. You just take a picture. ⁓ And they would just literally just tell you, let me actually take a selfie here. Hi. Let’s see what I can learn about this. Look at that big smile. It’s like spreading happiness everywhere. Can you say a smile? Just smile. smile. Yeah. This is like you get unlimited, like smile comes with some laughing too. It’s when you make happy sounds like, ha. Can you say laughing? Laughing. like the ones we use to listen to music. Do you like music too? Can you say headphones? This is actually English immersive mode. So you can, you can improve your vocabulary there. Grace Shao (21:06) how many languages you have now. Reni Cao (21:08) We have 16 languages and more than 30 dialects and it’s still expanding. And interesting observation here is like the smaller, the more niche the languages is, the stronger the demand is there, which we find is super interesting. Grace Shao (21:21) probably just harder to find offline solutions otherwise, right? Or like harder with the communities, assuming you’re an SF, finding a Mandarin community is not that difficult. You know, if you’re in England, finding a French community, probably not as difficult. if you go, you were saying like maybe like Arabic languages like that are not as mainstream, maybe in San Fran, you have people in San Fran wanting to do that, right? Or like people in Dallas last time you said, trying to learn Mandarin, which again, you don’t have a huge community. Very interesting. I’m sorry, I got very passionate about the topic. So I want to of swerve back to our conversation here about raising children with technology. I’m sure you get pushback. think people right now, there’s the other side of argument where everything should be organic. Everything should be very simple. Reni Cao (21:53) Yeah, of course. Grace Shao (22:08) And I myself, I’m a big fan of a lot of the Montessori toys. You know, they’re not buttons or not even power charged. They’re just little wooden blocks, but they’re designed very well for them to, you know, develop motor skills. So how do you kind of explain to parents today who are saying technology should be rejected in the childhood. Kids should just be reading physical books. should learn the way that we learned or even like previous generation learned. We should go back to touching grass only. So Like, yeah, what’s your argument there? Reni Cao (22:37) First of all, you are completely right. Every once in a while, we got a comment on our social media that, why don’t you talk to your own daughter to teach that language? Why do you need a device to do that? So your assumption is completely right. And my response to that is, first of all, actually, I respect that parent a lot. I believe in the most ideal world, organic human-to-human interaction and free play in the real world is great. There’s a lot of tech, like researchers actually Prove that right, right? However, I do think the parent miss out constraints here. Number one, you may want to talk to your daughter, but you don’t know Cantonese, for example. So there’s no way for you to teach some subjects or some skills that you want them to learn or you want to immerse them with. And second, all of us know that the contemporary society is more and more fast paced. Not all the parents enjoy this privilege. of saying, let’s slow down, set up a dedicated time for children to go out to places. All sorts of this ideal family style back in the 80s and 90s changed a lot, I would say. So we are, believe, rather than just blaming the parents, not spending enough organic time with their children, I do believe that technology should be introduced more as an option, as kind of like a gap stop. as one of the extra tools on the table. That’s why when we design decks, we don’t introduce chatbots, but we spend so much time on sharing the insights that what your children are interested in. What did they take a picture of? What do they want to geek on? What did they learn today towards the parent app? And just give them this little window to see the world through their children’s eyes. Give them good downtime topic. giving them a way to reconnect even as asynchronous. So I do think the concern is real and the overall kind of like, you know, judgment is very well reasoned. But I think what that’s the approach here is much more nuanced than saying like, let’s use technology to replace human. It’s not, it’s actually using technology to connect the humans, connect the parents and kids better. That’s the nuance I have to take a bit. Grace Shao (24:42) I see what you Yeah. No, no, I love it because actually I’ve seen some parents even give kids like little Kodak cameras these days and these little toddlers go around the world, take pictures of how they see the world and they’re so cute. My own daughter sometimes takes my phone and takes pictures around the home and I come back with a lot of selfies and pictures of her sister’s foot or it’s just very cute because you see the world through their eyes, right? And it gives like, it’s like technology doesn’t take all connection away. on technology. wanted to ask you about the technology. How do we understand that? Like how are you actually leveraging LLMs? How do you route through different LLMs or different languages? Is this something we talked about briefly? But I wanted to understand that bit more. Reni Cao (25:20) to share details. Where should we start? Grace Shao (25:22) Like how does it work? right now? So basically for the little Dex camera, can’t ask it, like he’s to your point, you didn’t build a chatbot. So I can’t ask a question. I can’t have a conversation. It’s not a companion, but I can ask it what’s this? How does all that work in terms of the back end technology and the guardrails you built up? Reni Cao (25:38) Yeah, I think in a 30K feed view, Dex are utilizing basically all the multimodal LM capabilities to understand what the children are looking at. And on top of that, we build sort of like a profile, interest profile for the children and the parenting need profile for the parents to help contextualize, you what responses should we give in that case? To give an example, if you’re a three year old, just starting to learn Cantonese and you are sort of like interested in a bunch of like a museum topics or you love like dinosaur skeletons and stuff like that, we will render you more challenges around kind of like hey let’s bring Dex to a museum and learn about different terms there and it will be English the primary languages teaching entry-level Cantonese things there. So basically like the visual understanding you certainly use like a multimodal LLM The response definitely use kind of a conversation API of a lot of like an ALM. And I think building out this context layer or this memory layer of like a children’s interest and parenting needs, that actually is more complex. That takes kind of like a full agent system to try to understand what matters, like condensing or distill insights into a profile and gradually kind of injecting that into our responses. I think that’s on a very high level. That’s it. We do use a wide range of LLM, mostly with Gemini and OpenAI. yeah, that’s kind of like the high levels. Grace Shao (27:08) I’m going ask a question you might not like, but I’m going to put you on the spot. When we talked last time, said specifically on Cantonese and Mandarin, you do use different LMS, but the accents can be quite funny. Like they’re a bit off. They’re not native sounding. Why is that? And how do you overcome something like that? Or other maybe non-English languages. Yeah. Reni Cao (27:12) No, ask me. First of all, you need to try again because we have a solution already. But definitely, hit. We are already squeezing. I’m so hard that we’re hitting the boundary of a lot of like, in this case, it’s a TTS of the leading providers. Because I think about it, I’m pretty sure you’re using English plus Cantonese. It’s basically using English to learn Cantonese. Is that the case? Grace Shao (27:50) Yes. Reni Cao (27:51) is a mixture of languages cases. The challenge there is that without fine tuning, there is very limited sample of someone that speaks very good English and very good Cantonese, and they mix them in like one sentences. So the data, the training data to begin with is a little flawed. Either you have accent English or Cantonese as the more common cases. That’s the fundamental root causes of this. And we’re having kind of like heavy lifting tasks to kind of like solve that. And with the foundational model getting better and better, think one day we’ll get there. And we can see that to be fully fleshed out in the next six months. You definitely hold us accountable. And I think this is right observation for mixed languages. It’s really hard. Yeah. Grace Shao (28:32) Yeah, I bet. how does it actually work right now? Like in terms of economics, like people pay you about $249, right? That’s the price of the product pre-tax. That’s not cheap. Like it’s much more expensive than a toy, but obviously bit cheaper than an iPad. How do I understand the pricing decision there and price? And then how does that relate to, I guess, how you pay for your token usage right now? Does that cover it? Reni Cao (28:57) Yeah. Yeah. Oh, big time. We actually have a pretty healthy margin and the tokens are getting incredibly cheap. Much cheaper than where we started. I’m talking about like in 96, 97. It were a fraction of the token cost of where compared to when we just getting started, which is back in 2024 February. At that time we don’t even have GBD4, we have GBD3.5. that’s the kind of like, that’s the kind of like, actually that time we have GBD4 but is we don’t have GPT-4.0. So it’s very expensive at that time. So now pricing. Actually, I have a let’s talk about the user-centric view and a business-centric view. On the user side, we’re actually adopting this value-based pricing model, which is like any enough day, language is a high value skill to acquire. I sent my daughter to a language immersion in the US. I’m very embarrassed to mention how much I spent on that school. Grace Shao (29:33) Okay. Reni Cao (29:49) And if DAX can offer 1 % lift or enhancement on top of that school, the price is fully adjusted and much more than that. So this is what I mean by like, and very funny that you mentioned toy, right? Toy is something that you get it, you play it for a couple of days, then you don’t see it, you don’t worry about it. And this is not what we’re trying to do. What we’re trying to do is we want to use a relatively high price to keep ourself honest about the value we’re delivering to the parent. Do we really teach a language or do we really get the kids to fall in love speaking that language? If we do so, that price is well-justed. If not, we’re going to give you 90 days of free return period. No question asked, just return it to us. I do want to use this pricing model to push us to deliver more value for the user. So that’s one aspect of it. And on the business side, very funny, you mentioned, I hate when people box us. into toy category. I don’t blame them. Natural reaction, but I want to send a signal to the market that if a team of talented people, hardworking parents, put their heart and soul in building a purpose-built device that harnesses AI and delivers concrete results, we could get out from the typical, stereotypical, like a toy average order value band and go much higher. Above that, it’s less about, I to keep my is more kind of like, want to send a signal to prove that the market, we have enough parents waiting anxiously for something similar to this and want to pay a perceptually higher price for it, a premium for it. But yeah, that’s kind of like we landed on that price. And it’s so funny that so many people in the early days tell us, you’re going to do $1.99, because anything that started with a one Grace Shao (31:22) Premium, yes. Reni Cao (31:36) is night and day different than like two, that it started with two. But I actually, I’m like launching a suicidal mission. was like, let’s actually make it start with two, but let’s deliver more value there because it’s never like, it’s not a retail business at the end of the day. We’re trying to create a new paradigm of digital parenthood and childhood. We need to hold a high bar for ourselves. And the price is very telling, like in that case. Grace Shao (31:59) No, I actually agree and I think would you categorize yourself in the same box as Tony box vertical? Would you? Reni Cao (32:06) Not really. ⁓ Tony Box is a, I would say they are a content business. they are, same thing with Yoto. Actually, their founders have deep backgrounds in labels, music labels specifically, and IPs. So they are effectively a distribution business that they are creating a new channel to distributing those IPs from Disney, from Spin Master, and et cetera, et cetera. And the other side, you can see that at Dex, we’re not Grace Shao (32:19) I see. Reni Cao (32:31) I think like IP partnership or putting characters on our device. And we actually optimize for value and outcomes, like I promised to you in one of our principle. So I would put ourselves in, I don’t know, the de facto smart device for families. Just very honestly, the family device, the family technology, maybe like this is where we’re trying to go to, but it’s a completely like non-existent category before we’re still exploring. Grace Shao (32:33) Yeah. Yeah, yeah. Okay, like family tech device. Reni Cao (32:59) and it may change how I call it. Grace Shao (33:00) think there’s some more similar things maybe in East Asia because the audio learning like you know even when I was very young like I remember my grandma had a 步步高步伏机 I don’t know if you know what that is it’s like those like tiny little yeah yeah basically what it is it’s like people learn English with it and I think it’s very very like mainstream in China for a while but like you know these things been around I think in East Asia because everyone is using it to literally learn English Reni Cao (33:11) The steps are fine. Grace Shao (33:24) But it’s very one dimensional. It’s like one language to one language. They basically embed a dictionary, make the dictionary into a digital one. And you can ask search questions. You can ask what this word is. might, more advanced one might be even like with images, but I think, I don’t know, in the 90s, I didn’t see any images. But yeah, it does remind me of that technology and that vertical. haven’t seen something like that too mainstream in the West growing up, you know? I think if I was when I was learning French and German growing up, that would have been so helpful to your point. But yeah, so I want to bring it back to sorry, I just want to bring it back to the the business. On the Tony box comment, I do believe their business actually could be really high margin because their product is only say like 199 or something like that, right? Like they’re the box. But each character is not a 20 bucks or 30 bucks. ⁓ My daughter is drying me up here because every two months she asks for a new figure. But my point is, it’s a great business, right? Like that thing just keeps selling. It’s like Spotify and a physical thing. So would you guys have add-on any services, software, hardware, anything? Reni Cao (34:32) We do. That’s a lot of investor has been pushing us regarding this razor razor blade business model. I think for us though, what we are ultimately delivering is a business more like an app store. It’s like where you can get personalized content and software for your parenting needs and for your children’s growth needs at the end of the day. We’re launching, not we’re launching, we launched two tiers of subscription so far to validate that. One tier, $10 per month, you got unlimited LTE, plus you actually got a curriculum packed in like a content library. Every day we give you one topic and in the topic you can explore a lot of new vocabulary, expression, know, new languages and it’s good kind of like content to consume. And I think what’s most interesting is our future vision is actually a $20 per month tier. In that tier, you can actually create activities for your children, personalize. Grace, can be like, I run this podcast. I’m a podcast host. How do I explain that to my kid and make it a little bit fun, exciting, and even adventurous as if the recording a podcast is a little journey? And by the way, my kid likes this way of storytelling. You could give a lot of like a prompt there. They’re actually based on the profile, the context layer, we’re gonna build sort of like interactive, like a content that involves taking pictures, speaking, and just like looking at the device for explaining what does podcasting mean. And this tier actually got really good like attraction. And when we look at their subscription retention, is above like 90 % in three months that shows early signs of product market fit. But this is what I mean by like our business setting of day. We are a channel to deliver like harnessed intelligence to parents such that they can build whatever content and software that adapt to their needs rather than just a purely search, then filter or control kind of like a timer. I really want the digital world to revolve around them, running out of way around. So in this case, Put it in a simple way, we give them a tool to build whatever they want, and we charge on the usage of the tool, pretty much. Grace Shao (36:41) No, I actually really see that. I love it. Because I think my husband was trying to use chat GPT for a while to create stories with my daughter. Like, add a pig, add a dog, add a whatever in this. And obviously, it’s not made naturally for this. So the stories don’t come out as, I guess, natively understandable for children. So I see where this can go. And the funny thing, you use my profession as an example. Reni Cao (36:49) Exactly. Grace Shao (37:05) example, like my daughter just thinks I talk all day, that’s my job, and she thinks that her dad sits at a computer and press buttons all day. So between the two of us, none of us are doing it much, just talking and pressing buttons. So it’d be really great if, you know, I can, I guess, lean on technology to find a better way to explain to children modern day careers, you know, that may be not as easy to explain as, know, mommy’s a doctor, and doctors go help people and save lives, which is like what my family has. you know, explained to us when we were growing up, you it was very clear. I want to kind of go on a little bit more about AI and parenting. I think there’s a huge discourse right now in the US, especially, I think from my point of view, where I sit in Hong Kong, in Asia, even yesterday, I was speaking to someone from South Korea, venture capitalist, they’re saying that parents and society seems to be a lot more open to bring technology into their day to day lives. They’re much more open to the idea of leaning into technology for personal use and less worried about privacy and you know these kind of issues I guess. So at a high level, what do you think, should we be concerned when we introduce technology to children? will they, you know, for example, taking pictures themselves that automatically goes into one of the LLMs. Is that something that... he should be mindful of or are there guardrails that can be built in? Reni Cao (38:26) We should be definitely mindful. That’s why we enforce ZDR, zero data retention across our stack for images. So even let’s say your kid take a picture of themselves, you cannot retrieve that picture even you want. You can ping me through my personal email. You cannot find that picture anymore. And OpenAI and Google signed a contract with us to burn a picture immediately, like zero data retention on all the usages. But overall, I do think Grace Shao (38:48) See. Reni Cao (38:51) It’s the company’s responsibility to introduce technologies to family and the family should hold a high bar there for sure. Because like the AI is so early and it’s way too powerful in certain way. And it’s like a kind of like a black box in certain way in a lot of different ways. that I definitely, I’m not a, I’m not that one of the technologies that wanted to like, you know, glorify AI and it is the future and stuff like that. comes with a lot of risk, especially like unproven. aspect how it impacts the children’s cognitive development and something like that. That’s also a reason why we work with researchers and professors ⁓ closely like in Mount Eucalon from UCSF and Harvard professors doing education and doing research using text. I do think there is a substantial risk here such that the And we as the entrepreneurs and we as the parents, we need to hold a high bar for ourselves and roll out things one by one. So I guess that’s why you will hear more about like, oh, that’s like, you could have done this. You could have made it more engaging. You will hear this much more often than you’d be like, oh, there is like an incident because, you know, we always prioritize, you know, safety first. We’d rather the device to be boring in certain way rather than introducing consequences that we don’t understand. So I think there’s a very interesting dynamic between the Western and Eastern in terms of their views about technology. And I don’t think it’s a family parenting only. It’s also even a whole society, general perceptions. Happy to chat about that, but maybe it’s a little bit off topic here. Yeah. Grace Shao (40:12) comes from the mindful design as well. ⁓ No, we can definitely talk about that a little bit, but I kind of just follow up on what you just said. So how should a parent evaluate an AI device or tech device when they are purchasing for children, right? ⁓ I’m sure there are different devices out there, maybe not exactly doing the same thing as what you’re doing, but other devices are tech native ⁓ or AI enabled for children. How should parents kind of go about this? Reni Cao (40:52) I’m not a parenting coach. I will share my views. Number one, do think we should bias, we should start from our needs first. Maybe let me put it this way. Don’t get carried away with all the possibilities of the AI. Ask yourself, what is the unresolved parenting needs you have and find solution there. Rather than, this AIX, then that’s just to buy that AI device and give it a try. That’s number one, I would adopt that. Number two, I do think it’s important to see what a company’s method is. They definitely put their methodology somewhere, their belief somewhere, their principles somewhere, they’re kind of like, like how you ask me about how we use ALM. I believe that the parents should definitely hold the company accountable to explain those details and ask, verify, and that’s crucial step. That’s the due diligence on them, right? And I do think that Finally, for any sort of AI product, I actually even think the parents doesn’t have to be getting into this searching and validating mental model. They could literally build their own in some sense. Given all the agent codings rising up and reducing the piece cost of software so low, I do think for lot of stuff, they should try. to accommodate their own parenting needs in certain ways. Like I saw tons of the parents go into cloud code generating like a, know, almost like a story writer for their daughter. That’s actually my previous colleague at Wish. And it was awesome. It’s just different blanks to fill in. It’s kind of mad lips type of like a story. I do think the parents can change also their way that they are in the autonomy right now to build whatever they want to build. Having said, it’s still a little bit of kind of like a Silicon Valley bubble type of answer, because honestly, in the world, the adoption of a cloud code is probably less than 2 % or 1%, I’m pretty sure. But I do think I would encourage parents to use AI themselves and explore a boundary, it can do, what it can does well, what it doesn’t. So then, the kind of I make a decision from there. Grace Shao (42:45) Yeah, no, I I appreciate that. It’s a very like thoughtful answer because it’s not just like A or B. of the day, think it’s parenting itself is so personal. It’s on how your family dynamics work, how you prioritize your time, how you want to parent. So when you want to buy technology for your children or incorporate that into their lives, it’s also a personal decision. I wanted to ask, actually, do you have any good case studies to share with us just a little bit? Reni Cao (43:11) We have quite a lot. What aspect, what, what type of case does he want? Grace Shao (43:14) Just like, I don’t know, like things that unexpected people use. For me, I mean, by default, just assume, yeah, people use it in urban areas, right? But then I think when I met you, you said, actually a lot of people use them, you know, in unexpected places, like orders come through all over. Reni Cao (43:19) Alright, I’ll give you one. One of, I immediately think about one thing, one, almost like ⁓ close to 5 % of our users, they bought Dex to help with speech delay. That’s something we never anticipated, but those parents are very frustrated with all the, as we call it, sometimes autism tech or the speech therapy tech there. It’s not meeting their bar and they saw Dex, they’d be like, I would try everything right now for my kid. And surprisingly Dex helped them. and it makes them real happy. And you can find actually all those real reviews in our review sections. Quite a few family mentioned that their kids refuse to speak certain languages or just even English, but that’s kind of necessitate the language as a fun activities. And all of a sudden, the kids start to open up and speak more, and the parents are really happy about it. This same exact story happened with my co-founder, who is really worried about his, at that time, two-year-old young son having speech delay. But I want to disclose the name, but the song he actually first time spoken like coherent like ⁓ Chinese phrases using Dex and he caught it on a video. That was one of the most wholesome moment of our kind of like a user feedback in our channel. And right now we’re actually ⁓ volunteering to develop this special need mode. That’s kind of like, you know, customizing to special needs children. especially like April is the world kind of like autism awareness month. And yeah, we just want to do it. And we want to donate to Dextre researchers and speech therapists to help us do it. This is a totally kind of like a side quest, but it just like give us it gives us so much kind of like energy. You’re thinking about technology can be used in a way that’s like immensely helpful. Grace Shao (45:00) That’s amazing. Yeah, and something unexpected, right? Okay, I think I want to wrap up our conversation because I don’t take up too much of your time, but I do want to ask you one big macro question. With you working on whether you like to call it physical AI or not, essentially like a physical product hardware time software, how do we understand that trend going forward? Do you think AI will be essentially integrated, plugged in to more more hardware devices? What’s your view on that? Reni Cao (45:33) I do think there is a consensus that every wave of software technology revolution, there will be kind of like a device revolution following that. We are at the tipping point there. That’s like people starts to reimagine, where is this? this cloud? Is this the recording card? Maybe it should be a separate, like an ⁓ AI. Or this is a sort of like a little pendant that can kind of like ultimately listen to your life, help you organizing. I do think we’re at the ⁓ the dawn of a next wave of hardware. But it’s less about we’re doing the hardware because of the, I do think this is a software or technology driven type of hardware revolution out there. I do anticipate that. I do at least what I’m 100 % sure is like smartphones are not designed for children. Tablets are not designed for children. Families deserve something built with their interest. their needs in the center of the spotlight. And I see that happening. And that’s why we started this company. And I bet there’s going to be tons more use cases there. Grace Shao (46:33) No, amazing. Thank you. I think ⁓ one last thing. Is there anything I missed or anything you would like to share with us? Reni Cao (46:39) By the way, I time. If you want to turn through all the questions, I’m happy to be here. I don’t have anything else after this meeting. Grace Shao (46:44) no, don’t worry. think it’s a lot of times like I use them as prompts. But you know, when we’re chatting, like we actually covered most of it, you know. ⁓ Yeah, is there anything you think we missed? But from my end, like I feel like I covered most of it. You know, we did technology, we talked about children, AI philosophy, talk a bit about your business model. Reni Cao (46:50) Yeah. Yeah. Yeah. I do think you would want to talk about. Yeah, go ahead. Go with one last one, and I have one for you. Yes, go ahead. Ask yours first. Grace Shao (47:04) I think one last one. You go. So I want to ask you one last question, which is a question I ask every guest that comes on the show. What is one differentiated view you hold? I feel like your whole thesis around devices right now on the market are not made for children is already a differentiated view. But is there anything else you think that you hold that’s non-consensus? Reni Cao (47:29) Yes, with this view, I got beaten up so many times, but I still got to say it, right? I believe that education should not be cookie cutter. It should be highly personalized. So is entertainment. So is the parenting software. And we’re about to enter the golden age. Finally, this is becoming the reality. And let me say it this way. You look at a school in the US, how you tell the school is good or not, you look at one ratio. It’s called a teacher-student ratio. One teacher taking care of less, but why? Because then the teacher can accommodate, individualize the needs. I actually have a very radical view in terms of our education system is definitely lagging, significantly lagging against how our society evolves, how the technology evolves. It’s still a one size fit all and industrial way. to handle education, handle like, you know, testing, standard testing. It hasn’t really changed in the past couple of decades, but the world is a different place now. And I guess my view is like, it shouldn’t be that. The default shouldn’t be that. The default is like every kid should almost have their personalized tutor and the playmate that deeply understand them. Unfortunately, that’s impossible before, resources-wise. But I guess we need to strive to get there. as a race, as a humanity. Because each kids just come up, come with their own spark. that will miss out the window to make that spark their lifelong journey. But I’m not trying to attack on educators or school systems, something like that. I just feel like there needs to be more forces from the society, especially from the tech side, to help together build this alternative, enhanced of like a system that really delivers individualized education. Sometimes I use the word scaled homeschooling. And you cannot imagine how much people hate that. people are like, homeschooling, you’re taking away the social aspect of it. People are very constrained on the vocabulary of how they describe things. But I guess when I say homeschooling, it’s not about keeping the kids at school and hiring a teacher. And that specific process right now, I’m talking about really meet children where they are in terms of their growth, in terms of their needs, in terms of the skills they’re going to develop. I call that a differentiator, but maybe actually lot of people will share the same views. I’ll be happy to know who shared the same view and please join us in the journey. Follow us along. Grace Shao (49:49) I definitely think that view is definitely, feel anecdotally a lot more prevalent in SF when I visit. I’ve met other people like yourself, other people in the tech space or, you know, investors who are embracing this idea of modern homeschooling. And they say the same thing. They’re like, we don’t like to use the word homeschooling because, it sounds like a bit more cultish, but it really isn’t right. Like it’s really focusing on individual ⁓ growth. I think it’s amazing because I also think it’s because Silicon Valley itself kind of harbors this kind of growth and mentality and that the fact that people can succeed without degrees, people can succeed by building different things, people can succeed in just being different and being themselves, but the best version of themselves have always been, I think, what drives a lot of people who want to go to Silicon Valley because it’s like in many ways, as a mayor, talk to see like the best version of my talk to see, right? I think in East Asia, even as I put my kid in school right now, I find people definitely a lot less like that minded. ⁓ I don’t know if it’s a cultural thing because like, know, for you, you know, I grew up in Canada for me, I always felt like, you know, having that freedom to learn, explore when you’re young, which is more Western kind of way of, I guess, education was good. But I think a lot of peers here actually believe that, you know, for the first like say eight to 10 years, that foundational education should be drilled in. know, ⁓ grit should be taught, discipline should be taught. But it’s very interesting because it does kind of, I guess, manufacture different kinds of stereotypes. And I think it’s fascinating. And I think one more comment on that, I know this conversation has been more personal than we thought it would be, but I love it, you know. I don’t really get to talk about motherhood that much in my podcast. It’s usually about tech and bros and tech bros and about, ⁓ and about finance. but I think even, you know, when you have kids, people talk so much about nature versus nurture. And what I realized is I was shocked to see the nature come through, as young as like six to eight months in a child. Reni Cao (51:36) You Grace Shao (51:53) their personality starts coming through and by the time they’re one to one and a half, they start kind of babbling, start demanding things. I realize 80 % of it is all nature. It’s like their preferences for how they socialize, their preferences of even noise, even you can realize like your point, your taste. You’re going to find a six months old who just wants to sit in a corner in a play group who just wants to flip through books literally and just undisturbed. You’re going to find someone who’s screaming in the middle of the whole group. you’re gonna find my daughter who’s rolling over everyone and just like trying to knock everyone out. And I don’t know why. You know, you’re gonna realize all of it is nature. And even I believe agency, autonomy, grit, and desire to actually succeed, that itself is nature. And I don’t think you’d be taught. And I think this is a bit controversial. But definitely I think my husband and I have been thinking a lot about this. We’re like, we can just provide them what we can. But there is no... point of even pushing them when they don’t want certain things. the best is to push them in a direction that they want to be pushed and they will tell you. I think this is like kind of the difference in our generation of parents. yeah. Reni, thank you so much. ⁓ Yeah, go on. Reni Cao (52:49) Exactly. Yeah, but can I, I know this is over time, but can I add one last comment towards what you say? But I think what you said, especially growing up in East Asian, like, you know, education system, it has been industrial for a good reason, right? At a time where stuff like AI doesn’t exist, the most effective way, Grace Shao (53:03) No, of course, of course. Reni Cao (53:19) to develop fundamental knowledge workers, plus finishing the job of dividing the children into different segments and give them different levels of education. That education system works perfectly. I entrance exam, as I’m talking about, taking standard tests and stuff like that. But all we know that is AI is sweeping through all the knowledge works. and specialized in knowledge works, honestly, Asian parents like favorite jobs, like being a doctor, especially radiologist, you know, and or being a lawyer, you’ve got to start somewhere as associate. Now it’s getting kind of like his hardest. The world has already changed. The tsunami already hits. But I don’t think people actually understand the level of the s**t. A lot of like everyday people in the world, they haven’t felt. this like a tsunami, right? So when you say you want to kind of like, you know, like find define your children’s nature and push them towards kind of like what they are intrinsically motivated about and give them resources to set them up for success, building grades are on the way. I do believe that I think I will 100 % agree with you that it will become the most fundamental aspect or element of education in the next like five years or even sooner to be fair. That’s why I don’t send my... to put it in a simple term. I don’t send my daughter to Kumon. I don’t want my daughter to do Russian math. I never benchmark her against like, oh, like the other kids can read at the age of like three and a half. Why don’t you? Actually, I don’t because I fully understand that kids have their own time zone. Kids have their own spark. All you need to do is think deeply to define that, to understand that, understand why my daughter sometimes is super sensitive, understand why sometimes she got frustrated and want to hit. Grace Shao (54:31) I feel very validated. Reni Cao (55:00) Don’t take that on a surface level with the other tools you have. Go deep, understand that, and build these programs that’s personalized to her and help her. And I think like this is why I if I, talk about the word of Nei Juan a lot. If I have to dream on anything, right? I have to like a rather like ruthless compete on anything. I complete the deaf understanding of my daughter rather than anything else. Grace Shao (55:03) 100%. Reni Cao (55:26) Because I actually think that’s the thing people gloss over. People must be like, education is just checklist. You got to check, check, check, check. And there is a better checkbox. Like Ivy League school, there’s a OK checkbox. There’s a worse checkbox. Forget about a checklist. That checklist is obsolete already. So I respect. I think we vibe together in terms of our schools of parenting. Grace Shao (55:33) Yeah. Yeah, 100%. No, I agree with you. parenting style. Yeah, yeah, yeah. Reni Cao (55:51) But you’re so fully intuitive. don’t know whether I’m right or wrong, but this is what I firmly believe in. And I believe someone’s going to join this journey. Grace Shao (55:58) I think there’s more people who are aware, especially people who are more plugged in with the technology because they realize how fundamental society will change. I just thought about when we were young, I’m sure your parents also told you to go to university, go to this, go to that, right? For sure there was a hierarchy in their mind, what kind of school you should go to, what kind of degree you should get. Now I really don’t think that’s the case. Actually, a lot of my readers would even know. my dad really forced me, well, pushed me, encouraged me to go into finance. And at one point he was like, if you don’t study finance and don’t work in finance, you’re not like following my footsteps and blah, blah, blah, blah. Right. And it was a very, it became a personal reason to do it. It’s not because I wanted to, or I was good at it. And there was actually a battle between us being like, I want to go into journalism. And he’s like, no, I was like, no, I’m going to go to journalism. He’s like, I’m not going to pay for it. You figure it out. But the beauty of it is actually found a way resourceful enough to get a full time, full scholarship. And I still want to journalism. Again, I recognize how lucky I was. I I found the opportunity to do that. But most kids actually just end up then doing what their parents told them to do and they never, and they never actually live their best life or become the best versions of themselves because they’re doing something not actually fundamental. Reni Cao (57:08) you hit a very critical, I think it’s a background or context. There wasn’t an abundance before, right? Growing up, let’s say in the eighties, It’s a relatively kind of like a society. It’s relatively kind of like not that sort of like, you you wouldn’t call it abundance at the time. Let me just put it that way, right? You still need to compete for stability, compete for resources. That’s why there’s a rat race in education, which I totally understand. That’s kind of like. It’s like a whole

18. maj 20261 h 0 min

Assembled co-founder John Wang on building a AI native support system for enterprises

Beskrivelse

Kommentarer

2 måneder kun 19 kr.

Alle episoder