Litigating the Pope's AI Encyclical with the Lawyers of Scaling Laws Pod

Beschrijving

In this episode of Justified Posteriors, we host Alan Rozenshtein [https://x.com/ARozenshtein] and Kevin Frazier [https://x.com/KevinTFrazier] — the law-professor duo behind Lawfare’s Scaling Laws [https://www.lawfaremedia.org/podcasts-multimedia/podcast/scaling-laws] — to take two of the most-discussed AI policy documents of the spring and subject them to an inquisition. Our disputors are probably not what Pope Leo anticipated: two lawyers, two economists, and probably 3/4ths Jewish. Talk about a crossover episode! First up is Pope Leo XIV’s 42,000-word encyclical (that’s Pope-talk for letter) on artificial intelligence. Magnifica Humanitas [https://www.vatican.va/content/leo-xiv/en/encyclicals/documents/20260515-magnifica-humanitas.html]: On Safeguarding the Human Person in the Time of Artificial Intelligence lays out 5 principles of Catholic social teaching, and then explains how this should shape Catholicism’s approach to AI. We focus on two in particular. The first is subsidiarity, which Seth summarizes as Catholic federalism, the idea that most decisions should be made at as local a level as possible. We discuss both the economic argument for this, but also what the Pope adds to Hayek: Decentralization not for efficiency’s sake, but a kind of ennoblement, the dignity of deciding things locally. The second is the universal destination of goods, which the encyclical extends to “immaterial goods”. This leads to the positive argument of the Pope - that AI should be undertaken as a communal project with decentralized power and discussion, rather than a technocratic “Tower of Babel” that will lead to ruin and division. Much of our disputation focuses on whether these principles actually resolve the important questions. Is the Pope rightfully cautious about an emerging technology, or was this an opportunity to take a stronger stand on what constitutes AI Sin? Interestingly, the Pope’s strongest stand is against transhumanism, which would be a plausible resolution to the dialectic of “Butlerian Jihad” vs. worship of a new machine god. Then we pick up DeepMind’s “Positive Alignment” [https://arxiv.org/abs/2605.10310] paper, and the economists get grumpier. Andrey complains that the paper is vacuous, failing to take a stand on actual practical goals or methods. But it sets us up for a good conversation about several issues: Such as liberalism of fear [https://en.wikipedia.org/wiki/Judith_N._Shklar], a type of anti-utopian liberalism; whether “flourishing” is something you can A/B test towards; and where the ‘constitution’ metaphor behind Constitutional AI works vs. breaks down. We also tease a joint project, “SCOTUS Bench,” a new benchmark for evaluating AIs’ ability to predict appeals court outcomes. Watch this space for more on that soon. Related Links * Scaling Laws [https://www.lawfaremedia.org/podcasts-multimedia/podcast/scaling-laws] — Alan and Kevin’s AI, law, and policy podcast at Lawfare * Alan Rozenshtein on X: @ARozenshtein [https://x.com/ARozenshtein] · Kevin Frazier on X: @KevinTFrazier [https://x.com/KevinTFrazier] * Magnifica Humanitas [https://www.vatican.va/content/leo-xiv/en/encyclicals/documents/20260515-magnifica-humanitas.html] — Pope Leo XIV’s first encyclical, “On Safeguarding the Human Person in the Time of Artificial Intelligence,” in full, straight from the Vatican * “Positive Alignment: Artificial Intelligence for Human Flourishing” [https://arxiv.org/abs/2605.10310] — the DeepMind-led paper (Laukkonen, Krier, et al.) arguing alignment should optimize toward flourishing, not just away from harm * Claude’s Constitution [https://www.anthropic.com/constitution] — Anthropic’s ~20,000-word statement of Claude’s values and character, released under CC0 * “Claude’s Constitution,” with Amanda Askell [https://www.lawfaremedia.org/article/scaling-laws--claude's-constitution--with-amanda-askell] — the Scaling Laws interview with the document’s primary author (the one we keep saying we’re jealous of) * The Moral Machine [https://www.media.mit.edu/projects/moral-machine/overview/] — MIT Media Lab’s crowdsourced trolley-problem experiment: millions of judgments on the grandma-versus-criminals ratio * Meta’s Oversight Board [https://www.oversightboard.com/] — the “Supreme Court of Facebook,” and Kevin’s cautionary tale in institutional design * Andrew B. Hall [https://www.gsb.stanford.edu/faculty-research/faculty/andrew-b-hall] — Stanford political economist on deliberative democracy, platform governance, and what went wrong with the Oversight Board * The Anthropic Economic Index [https://www.anthropic.com/economic-index] — the adoption data behind the “whole countries blacked out” point * Judith Shklar, “The Liberalism of Fear” [https://en.wikipedia.org/wiki/Judith_N._Shklar] — the cruelty-first, anti-utopian liberalism Alan invokes against thick conceptions of the good Timestamps (00:00) Intro — two papers, four hosts (01:47) Paper 1: Pope Leo XIV’s encyclical, Magnifica Humanitas (04:00) Subsidiarity, or “Catholic federalism” (12:26) Does the Pope take AI seriously enough? Mind-body dualism and the ex cathedra problem (15:34) The coming religious schism over AI personhood — and the Butlerian jihad (18:06) Transhumanism and the dignity of human limits (20:59) When is using AI a sin? Best-man speeches and eulogies (25:05) The universal destination of goods — is AI access already universal? (33:37) Is AI a centralizing technology? Dignity vs. efficiency (36:37) Freedom vs. control, the labor market, and make-work (41:10) Chess, the centaur era, and living after we’re no longer the best (47:34) Sponsor: Revelio Labs (48:49) Paper 2: DeepMind’s “Positive Alignment” (49:17) The liberalism of fear and thick vs. thin notions of the good (53:53) Is positive alignment an empirical question? A/B-testing flourishing (56:29) What would a useful positive-alignment paper actually do? (58:09) Constitutional AI as a site for public participation (1:00:47) The Moral Machine and trolley problems at scale (1:01:08) Does the “constitution” metaphor hold? Virtue ethics and self-binding (1:10:02) Running every Supreme Court case through the models (1:10:53) Lessons from Meta’s Oversight Board (1:15:09) Wrap-up Justified Posteriors is a reader-supported publication. To receive new posts and support our work, consider becoming a free or paid subscriber. You’re also invited to our Discord community at: https://discord.gg/2r3pExumQ Our sponsor This episode is brought to you by Revelio Labs [https://www.reveliolabs.com/], the leading provider of labor-economics data, available to academics on WRDS [https://wrds-www.wharton.upenn.edu/]. Transcript: Seth (00:00:00): [upbeat music] Welcome to the Justified Posteriors podcast, the podcast that updates beliefs about the economics of AI and technology. I'm Seth Benzell, always positive and always aligned, coming to you from the Pocono Mountains of eastern Pennsylvania. Andrey (00:00:23): And I'm Andrey Fradkin, coming to you from San Francisco, California. We're sponsored by Revelio Labs, fine purveyors of data products. And we're very excited to have Alan Rozenshtein and Kevin Frazier from the "Scaling Laws" podcast on the podcast today. Welcome. Alan (00:00:43): Yeah, thanks for having us. Andrey (00:00:45): Just for our listeners, why don't you tell us a little bit about "Scaling Laws?" Kevin (00:00:51): Sure thing. So our main goal here is to provide robust and timely analysis of all AI policy questions. And that's an expansive ambit, and it's one that keeps us really, really busy because if it's not an executive order, then it's some big new policy idea from one of the labs, or it's some new economic report. But really what we try to do is dive into the weeds of policy and legal issues that are emerging in the AI space, given our backgrounds as law professors. But Alan's the one with the brain, so I'll let him fill in the details on earth. Alan (00:01:30): No, that's a perfect description. Yeah. We just think that there's a lot of really interesting stuff happening at the intersection of AI, law, policy, especially around national security, which is the core focus of the publication that "Scaling Laws" is part of, which is Lawfare, and so we're trying to fight the good fight, and it's never a dull moment. Kevin (00:01:47): That out of the way, I think we can dive into our first paper. Although, I think by any podcast standards, our first paper is lengthy to say the least, dealing with the- Andrey (00:02:00): Mm Kevin (00:02:01): ... pope's encyclical at 42,000 words, or for all those listening, about two and a half hours on my stationary bike. I don't know what that says about my biking skill- Andrey (00:02:13): [laughing] Kevin (00:02:14): ... or my reading ability, but it was a very tiring afternoon. But a very extensive, very important read from Pope Leo. And this has been covered by a lot of folks, but I don't think it's ever been covered by two lawyers and two economists at once. Andrey (00:02:34): [laughing] Kevin (00:02:36): My hunch is that this wasn't what Pope Leo was anticipating when he was sitting and putting a... I like to think of him writing with a quill and- Andrey (00:02:45): [laughing] Kevin (00:02:45): ... on some very old paper. But, I don't think he anticipated this podcast duo diving into his encyclical. Seth (00:02:55): Well, hopefully, our analysis will be less a Tower of Babel of technocratic overreach, and more a blessed city of Jerusalem built together by our common efforts. Kevin (00:03:06): Seth did his reading. Seth dove in. All right. Excellent. Good to hear. Well, I think at this point in time, we're talking in early June. By the time folks are listening to this, unless you've been living under- Seth (00:03:18): There may be a new encyclical. [laughing] Kevin (00:03:21): Somebody- Seth (00:03:22): Pope maybe changed his mind. Kevin (00:03:24): Yeah. Ugh. But there's so much to cover in this encyclical. Obviously, we could start with just the pope's analysis of the role of the Church and of social doctrine, which he gets into in extensive detail, and that covers about 20 to 30 pages. I think for the sake of our podcast, that's probably not our main forte in terms of analyzing the evolution of the Church's social doctrine. But I will let anyone intervene there if they're extremely fired up about that posture. Andrey (00:04:00): [chuckles] Kevin (00:04:00): But I do think that the first area for us to really explore, that both economists and lawyers can appreciate, is this idea of subsidiarity, which is really- Andrey (00:04:12): Mm Kevin (00:04:12): ... the notion that we have various institutions operating at various levels of jurisdiction, and that ultimately we want to devolve regulation or governance of an issue to the smallest capable actor. And that has a lot of resonance and a lot of power in the Church's teaching, which is to say you have this centralized entity, the Catholic Church, and yet we have parishes all over the world. And so how do we distinguish between the issues that Pope Leo needs to decide and the issues that parishes and then to, switch to a different context, local governments versus national government versus international government. How do we think about the allocation of responsibility there? So I would love to just hear the initial thoughts. I know Alan will have thoughts. But from an economic perspective, what is the relevance of a sort of subsidiarity principle to the governance of emerging technology? Seth (00:05:15): Oh, well, Catholic federalism. Andrey (00:05:16): Well, there- Seth (00:05:17): I love it. Kevin (00:05:18): Exactly. Andrey (00:05:19): Let me take a little stab at it. I think this hearkens back to the central planning versus markets debate in the sense that we could have AI policy be governed at the national or even a supranational level. But there is a risk that those laws are not going to be well-suited to individuals with heterogeneous preferences and heterogeneous information about their needs and their constraints. And so to the extent that we can allow for governance to happen at a more local level, then that AI, the way in which it's used, is going to be More appropriate, more beneficial for everyone involved. So that's kind of the high-level thought here. But then, of course, with something like AI, you are worried about externalities of various types, right? So, the way in which one group decides to use the AI may affect everyone else, and then that kind of pushes things back up to the top because you need coordinating mechanisms, and that's pretty hard. But also, this is a very abstract discussion, and I always like to think about specific issues, specific AI policies about which we can think about. Kevin (00:06:44): Seth- Seth (00:06:45): Are you going to give us one? [laughing] Andrey (00:06:47): Well, so an example might be by what constitution is the AI trained from. Kevin (00:06:58): Sure. Andrey (00:06:58): So, if every little village had its own Claude with a different constitution, there might be a scenario in which the constitution of one of the Claudes might say, "Help us dominate our neighbors." And that would obviously have [chuckles] a negative repercussion on the neighbors potentially. And I think there's this kind of a separate thing there now that I'm bringing it up, is that it seems at least, given today's technology, pretty implausible to have that many different Claudes. We don't know how to do that yet. The training runs and the post-training are pretty catered to one thing and very expensive, and so maybe we'll get there one day, but at the moment, it doesn't even seem very affordable. Seth (00:07:49): Mm-hmm. Andrey (00:07:49): Yeah. But I'm curious what you think. Seth (00:07:52): Yeah. So I'll jump in here. I'll say that, first of all, we're all kind of maybe libertarian-leading guys. I don't know if that's fair to say for you, too. So as I was reading this, I was thinking, "Oh, those law fair guys are going to really like the subsidiarity. Of these five points, that's going to be the one they like." [chuckles] And I think Andre gave a good analysis of the economic take on why you would want subsidiarity. So, what is the pope adding to that that wouldn't be in Hayek's argument for decentralized planning? I think what he wants to add is there's a kind of an ennoblement. There's a kind of positive good vibes that come around from the local decision-making, even above and beyond the efficiency arguments for decentralized planning, which is I think what Andre was emphasizing. And this is definitely an essay that is sort of a little bit anti-putting efficiency above everything else. The second thing I will say is that it's sort of interesting reading this principle of subsidiarity. Obviously, we know the history of Catholicism and its various schisms, right? Obviously, sometimes- Kevin (00:09:03): Yeah. I don't tend to think of it as a super decentralized religious and spiritual movement. I will say with a name like Rosenstein, I am neither qualified to opine on Catholic thought, nor do I have a ton invested in it. But- Seth (00:09:16): Mm Kevin (00:09:16): ... since we're talking about it- Seth (00:09:18): Let's do it Kevin (00:09:18): ... the Catholic Church did not strike me as the most let a thousand flowers bloom type institution. Seth (00:09:26): It's not clear to me that establishing that the sacrament of communion is literally Jesus in the wafer. It's not clear that that had to be the centralized decision to be made. From the outside, it's not obvious what are the high-up decisions and what are the low-down local decisions. But I guess that's where I'm going with this, right? Which is, alongside this idea that you should decentralize things when possible is this very sort of Catholic teaching-based view on what are the important things to not decentralize. And those are some kind of foundational ideological commitments around prioritizing the poor and one of these principles I'm sure you'll get to soon is the universal destination of all goods, which seems to mean something like communal ownership. So, as we talk about this, one thing I'll be keeping an eye on is to what extent is subsidiarity in tension with this idea of there being a kind of a common good that the pope can tell you about. Kevin (00:10:27): Yeah, and I really appreciated your flagging of the fact that so much of this seems to be grounded in the pope's insistence and hope for people to feel a sense of agency in the AI era and really leaning into the sort of humanity of this all. And so I think subsidiarity is a sort of end round circumvention to that point of saying, how can we find a way for people to feel like they have a mechanism by which to actually assert autonomy in this domain? But to your point, Andre, and something that I think from a technical standpoint is really interesting, is that even feasible in terms of policy development that reflects cultures around the world, even the entirety of the Catholic base, right, which we know spans from South America to Southeast Asia and everywhere in between. As things stand right now, there are whole countries that don't have access to Claude yet, right? And there are whole countries for whom I'm guessing that their performance in their language is probably pretty poor relative to English, for example. And so just from a technical standpoint, that's something that I don't know was as thoroughly addressed about the just technical feasibility of some of the ideals of subsidiarity and something like that. Seth (00:11:49): Yeah. He talks about trying to bring ethics into the research lab. So maybe that's pointing towards what he wants us to be working on. Andrey (00:12:00): Yeah. It's not really clear in some way who the audience ... of this piece is. It's unlikely that typical Catholics would read it, of course. It's a very long and dry document. People at the labs, I guess, could be reading it. I guess that might be just the main audience of this document, but I'm curious what you think. Alan (00:12:26): Yeah. I think there are kind of two different parts of the document. Obviously, there's this whole long thing about Catholic social thought, which is interesting, but I think somewhat orthogonal to the discussions that AI watchers are interested in. On the AI side, it seems like he's making two different sets of arguments. One is a set of social arguments about the effect of AI and the dignity of labor and the need to spread resources equitably, and I think those are perfectly fine arguments. You can agree with them, you can disagree with them, but those seem like perfectly reasonable social and kind of political positions to take. The other side is his engagement with the technology itself, which I did find somewhat disappointing. Now, on the one hand, the fact that the Pope's first major written output is about AI is itself, I think, remarkable, and he should get a lot of credit for that. You can hardly accuse him of ignoring this epochal issue, but we shouldn't grade him on a curve, right? He is- Andrey (00:13:33): [laughing] Alan (00:13:34): He is trying to engage- Andrey (00:13:36): Grade him on curves. Alan (00:13:37): Well, I'm just saying, I'm really impressed that he's engaging on this issue, but he is engaged on the issue. Okay, so how well is he engaged on the issue? And on the actual issue of AI itself, I don't know. I think I'm stealing this from Matt Yglesias, who had this, I think, pretty good post on X, which was something like, "I get why he has to say this because he is, after all, the Pope, but the whole mind-body dualism is not very helpful in discussions of AI." I'm paraphrasing here. And what Yglesias is referring to, and I felt this as well when reading it, is there are these ex cathedra pronouncements in the encyclical- Andrey (00:14:11): Literally ex cathedra. Alan (00:14:13): Yeah, I guess literally, right? I guess literally. You are the Pope, after all. About how AI can never have moral responsibility, AI can never have thoughts, AI can never have feelings, AI can never have this, and AI can never have that. Which, look, I understand that is a highly intuitive view, and yes, I guess if you are literally the Pope, there are certain metaphysical commitments of your religion that perhaps might require you to take this position, and I don't think this is simply a Catholic position either. I suspect if you were to ask the chief rabbi of Israel or some high-level Islamic thinker, they might give you a sort of similar position on the metaphysics of all of this. I'd be very interested in what a Buddhist scholar would say. I suspect actually that might be the most fruitful engagement. A kind of tradition that really takes no self seriously, I think, might have a lot of very interesting things to say about the metaphysics of AI personhood. But I think in some ways, the problem to me with the encyclical is that it doesn't take AI seriously enough. It's not actually as AGI-pilled as I would want it to be because, and here I'm going to steal from friend of the Lawfare pod, Dean Ball, who made a very excellent point. The existential questions about AI are not actually about the distribution of resources in a post-scarcity economy, though those are very important, to be clear. It's about: what is the place of humanity when we are no longer the smartest and most sophisticated entities? That is the fundamental spiritual and existential question, and that is one which a posture of "AI will never have feelings, AI will never have moral responsibility" just tries to kind of drive out of polite conversation. But the problem with that is that, first of all, I think it's wrong. But even if it weren't wrong, it's actually not, I think, going to be responsive. I've had this idea that I think may be crazy, but I think may also be correct. We'll see. I think- Andrey (00:16:20): Those are my favorite ideas. Alan (00:16:21): Yeah, right. That the future religious schisms will not be between Christians and Muslims and Jews and Buddhists and whatever, but it's going to be people who think that AI has potential moral personhood, maybe even divine personhood. Certainly, if you're going to kneel at the altar of the machine god, that's a big deal. But even if you don't think that they're literally divine entities, if you're convinced that the AI that you're interacting with has so thoroughly passed the Turing test, that it has a kind of moral personhood, that's going to have profound religious implications. And then on the other hand, you're going to have a set of religious views, and I think you're going to have a lot of the incumbent religious bodies here. Which is why the Catholics and the Jews and the Muslims and the Hindus, it's the beginning of a joke. We'll all get together on this side that says, no, AI must be. We must have a kind of Butlerian jihad, to cite Frank Herbert here and the Dune series. We must take a kind of Butlerian jihad approach to machines, because otherwise, these machines, which are already so much smarter and more capable of us, if, my God, we allow them moral personhood, then we're no longer the apex dignity-holding entity on this planet. And while I don't expect the Catholic Church to be able to metabolize that terribly well or, frankly, I'm not picking on the Catholics here, any organized religion to be able to metabolize that particularly well, just denying that, not engaging with that, I think is not going to work in the long term. Certainly not by the time Claude 17 comes out. Seth (00:18:06): I guess I would say I don't think that's 100% fair to the pope. I think he does have a big take that is related to the questions that you raise, which is he takes a very strong stance on transhumanism. So these questions that you're raising around will we have a machine God? Will we destroy the AI? Will there be some sort of intermediate result? One very common answer to those questions is, is we'll merge with the machine. We'll become immortal, embodied, Ms on the computer, or we'll become physical cyborgs, or we'll use all sorts of advanced eugenics techniques in order to become more than human. And so that is an answer some people have, and the pope very strongly comes out against that answer. He says it is the fact that we suffer and die and have miserable things happen to us and are limited by our nature is what makes us human, therefore do not do transhumanism. So I think you might not agree with the take, but he's got a take. Andrey (00:19:13): I don't think that those are in contradiction to each other necessarily, in that I agree with Alan that I expect a blossoming of new religions to come about, and the existing religions certainly have a commitment to the primary role of humanity as it is today. Seth (00:19:38): Right. Andrey (00:19:38): And so a lot of this document is spent justifying why today's humans are essentially the relevant moral unit. So- Seth (00:19:52): Made in the image of God and all that Andrey (00:19:53): ... humans die, but that's what makes it good in the light of God. I also don't want to be speaking on behalf of the Catholics here. But, that was kind of my sense from it, and it was just don't go for efficiency. Humans aren't meant to be the smartest or the most efficient or anything like that. Even though they're imperfect, that is as it should be. So there is maybe a sliver there where we might have AIs, but as long as they don't have pretense to being moral beings, if they're designed in a way that tries to make them be less like that, then they could coexist with humans in a way that might be satisfactory to the Catholic Church. Seth (00:20:41): Right. And then the natural follow-up question is, okay, all right, if you're going to throw out transhumanism and efficiency for the sake of efficiency, aren't you going to be outcompeted by the groups or the nations that do go full hog for AI- Andrey (00:20:54): Yes Seth (00:20:54): ... transhuman efficiency? And then he's got an answer on that. Do you want to pick that up? Kevin (00:20:59): I wanted to hit on one key point, though, which is a critique that was in the "New York Times" on the fact that, and I found this pretty persuasive, one of the bigger omissions was the lack of specificity around when AI use may constitute a sin, and- Seth (00:21:18): Mm Kevin (00:21:18): ... when AI use may be something that is inherently and definitively bad. And something that I think stood out to me about that was when I talk to people about AI in a moral context and in a setting where we're trying to identify what are those red lines about how and when you use AI, that's where some of the most difficult conversations come up. I love to pose a very dumb hypothetical, which is imagine your best man wrote his best man speech with AI. Are you happy or mad? You're probably pissed off, in my opinion. Now, we could switch it and make the stakes even higher. We could say something like a eulogy. If you found out your eulogy was delivered by your homie, and they're like, "Yeah, I just used Claude, and it was really good." I'm probably rolling in my grave. What are those instances, though, in a moral context? Can you consult ChatGPT as a proxy for your pastor or your religious leader? Can you use ChatGPT for relationships? What kinds of relationships? How far can you take that relationship? And those are some of the questions I find that people have some of the most difficulty resolving, where the contestation by "The Times" was, hey, if there was a question or some lines to be drawn, we would kind of expect that the religious authority would be the one who draws those lines and says, "Yes, this is bad. Yes, this is good." Now you can go forth and use AI in a way that you feel less moral ambiguity. And I just thought that was a really interesting take because we have struggled, in my opinion, about how to draw red lines about when and why AI is used and when and why AI should not be used. Seth (00:23:13): But isn't that the right answer here? The pope notes that the technology is moving fast. It seems like subsidiarity could help a lot with that question. Maybe one region develops the norms to not use the AI for eulogies, and another region develops another norm. Why is that the centrally planned pope should have an opinion on one? Kevin (00:23:34): Well, I think that if we need to have moral clarity as to how and when to use AI, I'm not sure that subsidiarity is going to magically percolate those use cases in a way that is never hyper-relativistic. It's always going to be context-driven, which from the point of a faith, I think if you have no principle other than you do you based off of context, that kind of isn't a faith. Seth (00:23:59): He's got five principles. We can give the five principles he gives. Which are subsidiarity. You're right, I could list them, but one of them- Kevin (00:24:08): But that's not really about AI use on an individual basis. Subsidiarity doesn't change how I use AI Seth (00:24:16): If you're Dario Amodei, it might change the way that you allow people to customize the AI. Andrey (00:24:26): I do think it's very pragmatic of him not to go into specifics, for the reasons stated. But I do think that traditional Christian morality does bear on some of those questions, and in particular, you're not supposed to lie. So if you're giving your best man speech and the AI wrote it for you, that to me seems like it's a lie, no? Kevin (00:24:56): Is it a lie? Seth (00:24:57): Would depend on the norm. Kevin (00:24:59): I don't know. But- Seth (00:25:00): In some places, it'd be a norm to not disclose. In some places, vice versa. Kevin (00:25:05): Well, let's switch to something even easier, which is whether we should make AI universally accessible to everyone, based off of this idea, as Seth mentioned, the universal destination of goods. So for folks who did not spend their entire Saturday or whatever- Andrey (00:25:24): [chuckles] Kevin (00:25:24): ... stationary bike ride reading the encyclical, here's what we are referring to. So this principle, and I'm quoting now from the encyclical, "Reminds us that the Earth's goods, soil, water, air, and natural resources are given by God to the entire human family to sustain the lives of all, and that every person has an inherent right to use of such goods both now and in the future." And the Pope then clarifies, "Certainly, there is a right to private property which has its own specific meaning and purpose, yet it is always subordinate to the universal destination of goods." According to John Paul II, this subordination is the golden rule of social conduct and the first principle of the whole ethical and social order. So that's a bit of a mic drop, or to go back to my earlier- Andrey (00:26:20): [laughs] Kevin (00:26:20): ... refrain, a quill drop by the Pope to say that this is the first principle of the whole ethical and social order. Coming from the Pope, that's a big statement, which is to say making sure everyone has access to these goods, to this knowledge, is incredibly profound to me because, again, if you go and you look at the research Anthropic's done around its economic index report, for example, you'll see whole countries that are just blacked out because there is no access to Claude yet. There is no sort of universal use of AI. And again, as I mentioned earlier, even if it were, the idea that the AI is culturally sensitive, or as robust or as reliable in a certain language, for example. That clearly hasn't been the case so far. And then just to make sure that listeners don't forget the fact that we still have a digital divide, we still have millions, if not billions, of people who don't have access to high-speed internet. So if we are going to realize this idea of the universal destination of, let's say, AI, we are very far behind in terms of just the infrastructure that would be required to make real on that. So I would love the economists' hot takes on this because I'm getting Lockean vibes are coming up. We can Kosian concerns. We could just start name-dropping tons of economists here. What are y'all thinking? Andrey (00:27:57): Well, the first thing is just, yeah, it's a very socialist notion from the Pope, although he tries to thread it with, "We also respect private property." It's a bit weaselly in my opinion, lacking specifics. But I think, in particular with regards to AI, to me, it's a bit of a funny concern. AI is the fastest diffusing technology in the history of the world. It is almost universally accessible. Yes, Claude is blacked out in some countries. By the way, people in those countries have figured out a way to use Claude if you talk to them. Yes, not all languages are equal, but also, we have the best translation tools in history available. I'm not saying there isn't inequality to AI access, but it's actually you can get pretty good AI almost everywhere. And it'll become even more ubiquitous over the coming year, I'm sure. And in particular, Google is essentially serving AI to everyone. So sometimes I just find these concerns extraordinarily contrived. It's people who haven't actually thought about the specifics of the product diffusion, just making pronouncements from their chair about how everyone should get AI, whatever that is. Seth (00:29:15): It's a really special chair, Andre. Andrey (00:29:17): Yes. Seth (00:29:18): It's not just any chair. Andrey (00:29:18): It is. I know. [laughs] Seth (00:29:21): [laughs] I'll second your comments there, Andre. I think you're exactly right about the speed of diffusion. The thing that I would add here is that the new thing that isn't just run-of-the-mill, let's split up the goods equally, is there's a take that he seems to be an innovation. It's unclear if he is drawing on prior teaching here, where he says that extends to immaterial goods as well. That cultural products, intellectual products, are also part of this common universal wheel. And I don't know, that kind of got my Ayn Rand hackles up thinking about if I have an idea in my own head, is that all of society's idea just because I just had it? I think that there is a little bit of an expansive view on what constitutes the goods in the universal destination of all goods here to include things that usually we wouldn't think of as even in communist states, things you would have to share. Alan (00:30:27): Yeah. So look, I'll say, I think the pope is allowed whatever social teaching the pope- Kevin (00:30:33): [laughing] Alan (00:30:33): No, I'm not trying to be snarky about it. I know, I think that you're allowed to be a socialist, you're allowed to be a hardcore capitalist, you're allowed to be anything in between. I certainly don't feel like I have any particular wisdom or expertise to adjudicate between rival conceptions of the political economic good. I think what I think is interesting is to say, okay, let's take this seriously, and let's say that what we do want is this common good type distribution of resources. Does the world of really powerful AGI change the approach to doing that? And I just don't know the answer to that question. But I'd be curious what the economists say. If you are, in fact, a socialist, does the possibility of profound artificial intelligence change what your existing toolkit is? Kevin (00:31:30): Well- Alan (00:31:30): Or should be? Kevin (00:31:31): That's- Andrey (00:31:31): Yeah Kevin (00:31:32): ... I want to hear from the economists, too, but I do want to make sure, I'm going to push back just on the AI adoption and the AI diffusion narrative. I do agree that if you are already on the internet and tech-savvy, you can find a way to use these tools. But I think across the whole of humanity, in a lot of the global majority, there's just still not the infrastructure even in place to do that reliably, and the idea that we'll have the infrastructure in place in the near future to have an equivalent access to AI as someone living in the Bay, for example, is just not going to be the case for years, if not decades, absent some crazy change. So in terms of whether or not this is an actual policy matter, I do think that if your goal is diffusion across a lot of communities, that is a huge barrier, and billions, if not trillions of dollars would have to be spent pretty quickly to actually accomplish that kind of universal use of AI. Aren't we just talking about Starlink plus cell phones? What other technologies, what other infrastructure do you have in mind? Even with that, the idea that is Elon going to make Starlink available for free to everyone around the world? And then- Andrey (00:32:53): Of course not. Kevin (00:32:54): Yeah. Andrey (00:32:55): Well, Google will, and Facebook will, and Facebook is the internet in many places. And not everyone has a smartphone, but almost everyone has a smartphone, and they have Facebook on it. So I've been skeptical of digital divide narratives for the longest time, I think, and they've led to enormously bad policies like investments in extremely inefficient broadband solutions when Starlink makes it all obsolete. Just central planning gone wild, in my opinion. So- Kevin (00:33:25): Which is the pope would back him up on. Pope's anti. This is a whole anti-technocratic centralization overreach essay. This is a very seeing like a state-pilled essay we read. And I think it's really interesting to kind of think about this question which you posed, which is, is there something about this new technology which is kind of essentially more centralizing? And I know a lot of ink has been spilled about, oh, well, here's various ways that AI will tend to allow us to do things independently or in small groups that you would have needed huge teams for, or maybe it'll make us weirder in ways that'll be more diverging and idiosyncratic, and that'll lead to smaller scale groups. But I got news for you guys. My reading of everything together is that at the end of the day, AI is a technology that tends to make more centralized concentrations more efficient. It can process vast amounts of data in order to make more centralized decisions. That was always the critique of centralized decision-making, is that you couldn't process everything. Well, we're starting to get to the place where you can process a lot more. And now I'm not going to argue for central planning, but it does make me think that this is an age coming up where the economic forces will be towards centralization, and that includes big foundation model companies like OpenAI and Anthropic, and it might include companies that we don't even know about yet that'll grow to immense size. But I think that's why it's so important for the pope to be arguing for subsidiarity, not from the perspective of this Hayekian efficiency argument, but from actually, there's some other reason we want to preserve it, and it has to do with dignity. It's not a neo-Brandicean concern. It's not that big companies are essentially evil. It's just that there's something ennobling about decentralization. I think that's particularly compelling, going back to this idea of having some degree of agency in an age in which you may have just a handful of companies dictating incredibly powerful decisions. Just if I accept your scenario, Andre, for example, where it's just Starlink and Meta and Google offer you internet, Apple gives you a phone, and then you're running OpenAI or Anthropic's AI. And so now you just have seven companies who dictate kind of the entirety of your economic existence, if not your informational existence. How then, to Seth's point, do we maintain some degree of, I have a degree of control over my own future and well-being in a world in which seven companies shape the ins and outs of how you wake up, what you do for work, what you read, so on and so forth. Maybe faith and a connection to humanity and an emphasis on agency is the only thing you can kind of stress to make people feel like they have some meaningful role in shaping their lives. Andrey (00:36:37): Yeah Seth (00:36:38): They also serve what we can like- Andrey (00:36:39): So it's a pretty nihilistic take. I don't feel like I don't have agency over my life just because I use Google and Apple products. Those are products, those are bicycles of the mind. I can choose how I use them. Seth (00:36:54): I'm with... Yeah. Andrey (00:36:55): And I think that's true for AI models. Look, I understand that they have values baked into them and so forth, but in the end, I can do a variety of things with them across any ideological spectrum that I can practically think of. Yes, there are subtle biases and nudges and so on, but I don't feel like I've lost agency due to AI. I feel like I've gained agency due to AI. So I just think that this is a bit of a hypothetical, honestly. Alan (00:37:26): Well, isn't all of this a hypothetical? That way [chuckles] we've- Andrey (00:37:31): No, but it is just like this loss of agency. I found one part of the essay interesting, which is the part about the labor market. So he says: "The labor market is one area in which the risks associated with new technologies more clearly emerge. It is thus necessary to remember that economic freedom is not absolute. It must be measured against the common good and the dignity of every person." Blah, blah, blah, blah. "This is possible when it recognizes the creation of dignified, valuable jobs are an essential part of its proper service to society." Seth (00:38:09): Right. Andrey (00:38:09): So I think maybe more speaking to this point of dignity, the pope is arguing for a make work program, like in the style of the New Deal, to give people jobs just so they feel dignity. Seth (00:38:21): The pope likes free distribution. The pope is not sucking wealthier theorem pilled. Alan (00:38:26): No, and look, just to defend the pope for a second, I think his instinct that work is an immense source of dignity and that once people's material needs are met, as they are increasingly in the not just developed world, but in the developing world, that these questions of dignity and entity become really... They sort of begin to hedonically dominate. They really are where you get a lot of your utility from. The idea that AI will replace all potential, certainly white-collar work, is a huge threat. The problem is, what do you do about that? And that's where I feel like the insufficient AGI pilled-ness is the problem here. Seth (00:39:09): Right. Alan (00:39:09): Because a world in which people who use AI can outcompete by a factor of 10 or 100 or 1,000 to one, the economic productivity of those who don't, is not a world in which make work is going to work. I feel like people get very excited about the three make work jobs created during the New Deal, and then they're like, "Okay, that's an actual thing that can get done." Make work, paying people to dig ditches is not a thing that is sustainable, and people also see through that. So the real question is, I think not even an economic one- Seth (00:39:43): It's not sustainable in a dignified sense, right? The idea is there'd be so much income Alan (00:39:47): It's not sustainable in a dignified sense. Well, also, people just won't do it. Governments just won't tolerate this indefinitely. And so the question is- Seth (00:39:55): Well, this is a real politic. Wait, no. Wait, I want to understand why you don't think it's sustainable. So I can understand why it wouldn't be sustainable in a dignified sense if you like, "Oh, it's actually make work, so I'm not getting dignity from it." But the idea is in this AGI scenario, there'll be so much income that we can support people who aren't actually contributing. Alan (00:40:11): Yeah. I guess that's right. Seth (00:40:13): Are you saying that like we'll- Alan (00:40:14): No, I think no. So I'll take that back, because you see that a lot in petro states where a lot of the economy is propped up by sort of this kind of pointless public sector, this pointless bloated public sector. So fine, let me just going to go back to my dignity point, which is to say, at some point, people start seeing through it, and then the question is the one that I feel like the pope's encyclical keeps pushing off, and we're like, I'm actually more interested in the pope's kind of... I want the pope's sort of Catholic existential analysis more than I want the sort of political economy analysis, which is, just go back to the original question. In a world where we are no longer the most useful, most intelligent beings, how do we deal with that in a way that is productive and in a way that allows us to save face? And there are models- Seth (00:41:10): Yeah Alan (00:41:10): ... that we could look to. Chess, and I think chess is always an interesting example here. For a long time, machines couldn't play chess, and then there was a time when machines could beat humans, famously Deep Blue and Garry Kasparov, and then for about a decade, you had what was called the centaur era, where machines plus humans, these centaur teams, were the best, and then at some point, the humans just started causing problems. And so humans contribute nothing at this point to chess. I'm pretty sure an iPhone can defeat Magnus Carl... Like an old iPhone can defeat Magnus Carlsen at this point. And yet chess has never been more popular, and people watch Magnus Carlsen. Seth (00:41:49): Right. Alan (00:41:49): So there are examples where we have worked through the existential malaise of we're not the best anymore, but we still want to see humans do it. What I'm very curious about and where I think religion could play a really useful role is going domain by domain and helping us get to the other side of that transition. But to do that, you do have to take it seriously. You can't just continue to pretend that we are the best. No, no, we're not the best. That's the whole point. How do you live an existentially meaningful life when you're no longer the best? That's the question that I think is the most interesting one. Seth (00:42:25): Well, and to add on to that, too, because I think that we need to distinguish, going to Andre's point earlier, between freedom and control. I think that there's a vast difference between freedom to Kevin (00:42:37): Use AI to do anything, to look up anything, to pursue boundless knowledge, to, in theory, create anything or do anything, versus actual control over the infrastructure and the decisions and the entities that are shaping the preponderance of governance and the shape of the economy itself. And so you can have freedom, and you can have an increase in freedom, but you can have a decrease in control in terms of your actual ability to shape broader circumstances around you. And I think it's important not to confuse the two, because absent having some degree of control over those meta constraints, over those larger aspects of your life- Andrey (00:43:17): Mm-hmm Kevin (00:43:17): ... I do think it's hard to feel a sense of dignity, right? If you're born and to go to this make work idea, and you're told, "Hey, you have one of three jobs. Hey, great, you have freedom to choose your one of three jobs. Enjoy whichever. You can dig a ditch, or you can dig a well, or you can be in charge of high fives. But those are your three jobs." Andrey (00:43:41): [laughing] Kevin (00:43:43): Freedom, yay, but no real control over the nature of your life or the lives around you. And so I think that's the sort of dignity plus or- Andrey (00:43:52): So I guess- Kevin (00:43:53): ... plus Andrey (00:43:55): ... I guess I have a question for you. So suppose that we use democratic mechanisms to govern AI labs. So people elect representatives, the representatives sit in a House of Representatives at Anthropic. They vote on the latest constitution. Do you think that people will actually feel more in control of their lives that way? Because to me, it's not very obvious, right? Seth (00:44:24): Congress is not glowingly reviewed by US society. Kevin (00:44:29): I think that if you give an American an outlet, they'll feel more control. That's my attempt at a riff of If You Give a Mouse a Cookie. Look at how- Andrey (00:44:40): [chuckles] Kevin (00:44:40): ... folks are making- Andrey (00:44:41): Yeah, those books don't tend to end well. I have two small children, and I can tell you, those books are not optimistic stories. Those books are psychological horror films- Kevin (00:44:50): Oh, Jesus Andrey (00:44:50): ... in miniature. Kevin (00:44:51): We'll save that for our next episode. But I think that if you look at how people have made use of town halls and permitting processes right now- Seth (00:45:03): Right Kevin (00:45:03): ... with respect to data centers, I think those people would feel like they are actively shaping, and they are actively shaping the AI infrastructure build-out. So I do think if there were an avenue for people to feel as though they or their neighbor could participate, that would meaningfully change how they felt about AI. Whereas right now, the absence of- Seth (00:45:26): But it's bad. They're using their power for bad. [laughs] Kevin (00:45:30): That's a different point. On the question of whether they would feel like- Seth (00:45:34): Well, I think if we're doing political... I guess what I would say here is, I agree with the previous argument that this is an essay that's about the political economy and not about the existentialism. And so, yeah, if you're going to talk about the political economy, you should care about the economy part of the political economy, right? You want to get the voting power mixed the right way to create the prosperity that makes doing what you want to do possible. I think the pope would've been in an excellent position to do what you described before and talk about how a monastic life could be a model for thinking about what an AGI age would look like. Or, I think about the Messianic age in Judaism, where it's kind of a post-scarcity society, and everyone's devoting themselves to Torah study and mitzvah or whatever, right? That would've been a really interesting essay. This is a political economy essay. And then I do think you have to take a stance on, well, we should maybe put the power in a place where people are making better decisions. Kevin (00:46:37): Well, I think- Andrey (00:46:39): Well, there is an inevitable efficiency trade-off, right? If we think that the ASI is going to be very smart and is going to be well-aligned, the well-aligned is the questionable part, maybe. But if it's well-aligned, then we know the masses have their issues in terms of making decisions. And so- Seth (00:46:58): Is that a Catholic pun, masses? Andrey (00:47:00): [laughs] So yeah, we could delegate to the ASI to make our decisions. And I think this is, I guess, what it's being warned against, is it doesn't matter if the ASI is more efficient, that just the very fact that humans are in the loop in a real way is- Kevin (00:47:20): Well, at 42,000 words- Seth (00:47:21): That's the essay we got. Yeah. Kevin (00:47:23): We could keep going until eternity, religious pun intended. Andrey (00:47:30): [laughs] Kevin (00:47:30): But assuming that we're not going to, why don't we transition here? Seth (00:47:34): [upbeat music] For those of you playing along at home, now is your chance to think about how this conversation has changed your priors. This chance to contemplate your posteriors is sponsored by Revelio Labs. Revelio Labs is a leading provider of labor economics data and data services for companies, academics, and independent researchers. Andre and I have been working in economics of AI, digitization, and automation for a long time, and we can confirm just how useful Revelio's data is. Revelio's team combines comprehensive micro-level data on employee professional profiles, job postings, and employee sentiment with standardizations, mappings, and enrichments available, all to make that data useful without making your modeling decisions for you. The data can be flexibly aggregated to company, market, or industry, and can be used to study questions ranging from career trajectories to occupational transformation, to the returns to skills, and the impact of AI on labor demand for tasks. Can't imagine anyone who would be interested in that. And Revelio data is available on WRDS. Kevin (00:48:39): So if you're an academic with a good library, go see if you have access to their premier data already. And if you don't, you can reach out to their excellent economics team, and they'll hook you up. Andrey (00:48:49): Ooh, the next piece that we're considering is this piece by DeepMind, various DeepMind authors, Positive Alignment. Kevin (00:48:57): Including the friend of the show. Andrey (00:49:00): Yeah. Seb Krier, who refused to take ownership of this paper. [laughing] But yeah. Alan, Kevin, what'd you think? Kevin (00:49:12): I'll let Alan start. I've been talking for way too long. Alan (00:49:17): Yeah, look, I think it's an interesting point, and the inner social psychologist in me likes it, and thinks that the positive psychology turn should apply to alignment as well. And then the old school, Cold War liberal in me gets very nervous about these kind of positive conceptions of human flourishing, right? So, there's this idea that post-war liberalism became what's sometimes called the liberalism of fear, which is the idea that in the wake of all the totalitarian ideologies of the 20th century, Western liberals retreated to a kind of defensive crouch, where the point of liberalism was just to prevent the worst excesses of totalitarian and authoritarian ideologies. And you're not supposed to look to liberalism, or frankly any political philosophy, for a sort of a positive conception of the good. Kevin (00:50:20): And that's also Rawls, right? Alan (00:50:22): Yes. This is related to Rawls's idea of political liberalism. This has become unfashionable lately because something, something neoliberalism, something, something. But I still think there's a decent amount of wisdom in that. And so, whenever I read these proposals that AI systems should promote human flourishing, which to have bite is always... Which only has bite- Kevin (00:50:53): On the other hand Alan (00:50:54): ... if the systems are not doing what their human wants them to do in that moment. Otherwise, none of this would matter, right? That would just be covered by regular alignment. I get a little nervous. So again- Kevin (00:51:07): I wish the essay even got there Alan (00:51:08): ... at a high level of abstraction, these are my trade-offs. Sorry, what? Kevin (00:51:11): [laughs] I wish the essay even got there. If the essay had made the point we need a thicker notion of the good in order to blah, blah, blah, it would've been something fun to argue about and get antsy in our liberal pants about. But it equivocates so much, and it's so wishy-washy about, "Yeah, we want a thicker notion of the good, but there are so many different notions of the good." So, I don't even think it ended up making me afraid as a liberal. Alan (00:51:37): Yeah. Andrey (00:51:37): I don't know, Kevin, do you have anything to add? Kevin (00:51:39): Yeah. For me, this kind of does tie neatly, at least in part, about what we were discussing with the encyclical, which is we also still need a idea of what it does mean to positively align a model to anything, right? The selection of your social welfare function about whatever your objective is going to be is a really, really hard task, and I don't think that we necessarily know, again, to bring in the prior discussion as well. The idea of what the positive outcome is in any context that you are training a model on or setting a model to might vary wildly from one culture to another, and it may change- Alan (00:52:25): Right Kevin (00:52:25): ... over time. And so I appreciate the idea that we should not only look to avoiding negative outcomes and instead try to train models for some degree of positive behavior. But this brings to mind that folks, when Amanda Askell came on scaling laws and talked about Claude's Constitution, many people were not pleased when she said that her ideal was Claude acting like a good neighbor or a good traveler. Folks did not like that as the ambition for Claude. And I'm not saying that's good or bad or otherwise, I'm just saying it's really, really difficult to try to even encapsulate, envision what should a model do. What should that positive alignment be? What does human flourishing- Alan (00:53:16): Right Kevin (00:53:16): ... even look like? And that, to me, is where having more granular ability to train models will be profoundly important. Or at least to use some system prompt that directs your model quickly to whatever your community or your conception of human flourishing looks like, so that we can have that broader kind of polycentric approach to aligning models. But I don't think there's ever going to be one single approach, which is really difficult. Andrey (00:53:51): Yeah. I totally agree with you. I think one of the things much of this discussion misses or it's swept under the rug is that they want to pose this as some philosophical dispute, but it is an empirical question. As someone who studied digital platforms for my entire career, whether something is good or bad, in many ways evaluated by running an AB test and seeing whether the outcomes that you're measuring are improving. And it's completely not obvious ex ante. And I think with anything in such a constitution, that's also likely to be the case. Now, I understand why you would take this approach before you have the data. You have to take a stand on some of this stuff. But in the end, I think a lot of this is an empirical question in addition to a philosophical question. I think I have a broader take on this work is I have no idea who the audience for this is. And I feel like DeepMind puts out a lot of these papers. They're very general. You can see that they're citing a lot. They're citing everyone, and everyone is, we have RLHF, and we have supervised- Seth (00:55:01): 10 co-authors Andrey (00:55:01): ... fine-tuning. Yeah. And let's put in a lot of people on this paper to say a bunch of vague stuff that means nothing. They don't take a stand on anything, essentially. And then people laud this sort of stuff online, I think because they read the abstract. It's like, "Oh, yeah, vaguely I agree that we should try to have the models help human flourishing." But then when you read this paper, even though grammatically it's correct, it is vacuous. It is such a waste of time for them to write it, or alternatively, I have no idea who the audience of this thing is. And this is not only criticism of DeepMind. So many of these AI policy people write this stuff. And I just found it just to be deeply uninteresting. Yeah. Seth (00:55:58): It's for people who don't own a thesaurus and want lots of synonyms for flourishing in different languages. I have to say, reading this after the pope's encyclical made me more negative on the pope's encyclical. Because in it, he says something like, "No technology is neutral. We need to design technology from the ground up in the labs to have all of these five Catholic principles." And I read this, and it's like, do not let the computer scientists do ethics because they're bad at it. [laughs] Kevin (00:56:29): So what would be your conception of a useful, positive alignment paper? If you were to have a one-on-one with these authors and say, "Y'all, look, A for effort, F in execution," based off of what I've heard. Maybe you all assign slightly different grades. What is your feedback? Speak to us, the AI policy community. How can we now go to our friends and say, "Hey, y'all, we talked with Seth. We talked with Andre. Apparently, we're getting it wrong. We're not doing it well." What is the feedback? What can we do to be better? Andrey (00:57:10): Take a stand. If you think that you want a particular conception of positive alignment, tell me what metrics are indicative of that, and then propose a methodology, even if it's not immediately actionable, for how to measure whether a system is pushing us in that positive direction. Maybe do post-interaction surveys with users to see how satisfied they feel or how happy they are, and then you put them in an A/B test and compare different system prompts or... I'm just spitballing here. But give me something actionable instead of giving me a list of vagaries and then it's not even clear in this paper whether they consider that currently labs are already doing positive alignment or whether that's something that's new that needs to be done. Because they list a bunch of things in there that already sound a lot like positive alignment to me. Seth (00:58:07): Right. Andrey (00:58:08): So, yeah. Seth (00:58:09): You guys talked to Askell, which I'm so jealous of. We read the Claude Constitution, and that's where I want people thinking about. That's the actual principles that we're building AIs on right now. What would it look like to get a different team together to have its own hierarchy of values in the Claude Constitution? Can we think about other ways of adjudicating whether an AI is following its principles? Yeah, because like Andre says, if you read the Claude Constitution, there's plenty of positive goals in there. Kevin (00:58:42): Well, so this goes to my point earlier, where I do think that constitutional AI presents a really interesting nexus for public engagement and public participation that we have be the mechanism by which people do feel like they're in control or have some degree of oversight with respect to AI because even if it were- Andrey (00:59:08): Mm Kevin (00:59:09): ... a citizens assembly of 1,000 people across the US who are engaging, let's say, with Claude in shaping Claude's constitution with one another, so on and so forth. Well, now if I'm like, "Well, hey, I know my buddy Alan was a part of the latest constitutional convention for Claude," and I told him, "Hey, bro, you better make sure that Claude's a little bit more of a fan of the Texas Longhorns or whatever." [laughing] Seth (00:59:36): Principle five. Kevin (00:59:37): Maybe now I'm like, "Hey, at least I had some chance of influence," or I know somebody who influenced that person. Whereas right now, I don't think most folks even know anyone who lives in San Francisco because no one can afford it, with the exception of you, Andre, which I'm pleased to hear you're in town. [laughing] But right now- Andrey (00:59:58): What are your secrets, man? Kevin (00:59:59): Yeah. Andrey (01:00:00): I'm on leave at Amazon. I think it's not a secret. Kevin (01:00:02): Yeah. That helps. [laughing] But so that, to me, is a promising vehicle by which we can use that mechanism of what does positive alignment mean in practice. I like your point, Andre, of like, "Hey, go present some scenarios. Go do that A/B testing of saying, how do you want Claude to respond to this very difficult, let's say, even democratic context. Should the president or should the president not invest in this stake in Intel?" Big, huge question. Let's see what people say. Let's see what values we can then deduce from their answers, right, and have that sort of inverse constitutionalism, which could be really interesting. But- Seth (01:00:47): Yeah, there was the MIT Moral Machine Project where they were doing... Did you ever see this? They did millions of trolley problems with people decentralized to see, would you rather run over one grandma or two criminals? Have you seen this? Kevin (01:00:59): No, I'm going to have to check it out. Seth (01:01:01): All right. You have to look that up. Kevin (01:01:02): How many criminals guys versus grandmas? Seth (01:01:05): Exactly. Well, it's the ratio. Can I actually ask you guys a question? This is kind of my big law question I was hoping to get an insight on, which is to us, this constitutional approach seems really promising, but to what extent do you think the constitutional part of constitutional AI, does that metaphor really hold versus where does that metaphor work versus where does it break down compared to something like the US Constitution? Kevin (01:01:34): Well, I'll start briefly by saying Anthropic has been clear, in defense of Anthropic, to say they do not intend this to be a constitution qua the US Constitution, something that evokes the same legal understanding. So it's important to note that the labs have tried to distance the exact mapping of, let's say, the frontier safety framework or the model spec or the Constitution to a legally binding document. Now, with that said, I think that the idea that we're going to have high-level principles and values instilled within something that's going to have an enduring and stable influence on something makes a heck of a lot of sense to me in terms of a constitutional analogy there, which is to say, hey, if you actually go read the US Constitution, there are a lot of huge open questions, which is the reason Alan and I have jobs- Alan (01:02:33): [laughing] K

Seb Krier on AGI, the Coasean Singularity, and EDM

Seb Krier on AGI, Scaffolding, and Coasean Bargaining at Scale In this episode of Justified Posteriors, we welcome Seb Krier [https://x.com/sebkrier] — policy lead for AGI at Google DeepMind and excellent Twitter poster. Speaking in his personal capacity, Seb walks us through his understanding of AGI, why AI alignment has gone better than expected, the potential and limitations of a world where agents constantly barter on our behalf, and — of course — electronic music. We also cover AI in London vs. New York, how Seb went from reading Marginal Revolution for 15 years to becoming a recurring character on it, and Seb’s side-splitting humor on mediocre AI conferences. Related Links * Seb Krier on X: @sebkrier [https://x.com/sebkrier] * Seb’s Substack, Technologik [https://technologik.substack.com/] * “Coasean Bargaining at Scale” [https://blog.cosmos-institute.org/p/coasean-bargaining-at-scale] — Seb’s essay at the Cosmos Institute (also republished here [https://www.aipolicyperspectives.com/p/coasean-bargaining-at-scale]) * “Musings on Recursive Self-Improvement” [https://technologik.substack.com/p/musings-on-recursive-self-improvement] — Seb’s essay separating model-side RSI from societal-side * “The Cyborg Era: What AI Means for Jobs” [https://aleximas.substack.com/p/the-cyborg-era-what-ai-means-for] — Seb’s guest essay on Alex Imas’s Substack, defending the scaffolding view * Anthropic’s Project Deal [https://www.anthropic.com/features/project-deal] — the agent-bargaining experiment among Anthropic employees * Fradkin & Krishnan, “MarketBench” [https://andreyfradkin.com/assets/marketbench.pdf] — Andrey and Rohit experiment of LLMs bidding in procurement auctions as an investigation of the future of AI marketplaces and the companion writeup: Rohit Krishnan, “Agent, Know Thyself! (and bid accordingly)” [https://www.strangeloopcanon.com/p/agent-know-thyself-and-bid-accordingly] * Edge Esmeralda [https://www.edgeesmeralda.com/] — Devon Zuegel’s pop-up village in Healdsburg, CA * MATS [https://www.matsprogram.org/] — for junior economists looking to skill up on AI safety/governance * Cosmos Institute [https://cosmos-institute.org/] and FIRE [https://www.thefire.org/] * bianjie.systems [https://bianjie.systems/] — the art platform Seb is co-organizing a dinner with in NY (Seb’s announcement [https://x.com/sebkrier/status/2054941198406602861]) * Drexciya [https://en.wikipedia.org/wiki/Drexciya] — James Stinson, Gerald Donald, and the Detroit electro-afrofuturism canon Timestamps (00:00) Intro (01:16) What is AGI? (07:30) In defense of scaffolding — Hayek, division of labor, and why one giant model won’t do it (13:00) Markets for cognition: will agents bid in procurement auctions? (18:40) Recursive self-improvement — separating the model side from the societal side (24:44) Alignment has gone better than 2017-Seb expected; prefer “intent following” (31:14) What economists should actually work on to inform AI labs(33:32) What does a DeepMind policy lead’s day look like? (38:20) AI Conferences(41:52) Coasean bargaining at scale — the positive vision(55:00) Inequality, property rights, and who gets the initial allocation (01:03:00) The Helldivers 2 “Managed Democracy” dystopia as Coasean bargaining gone wrong (01:09:00) Sponsor: Revelio Labs (01:09:30) Lightning round Justified Posteriors is a reader-supported publication. To receive new posts and support our work, consider becoming a free or paid subscriber. You’re also invited to our discord community at: https://discord.gg/b8VpPbBUt Transcript 00:00:00,100 --> 00:00:20,480 [Seth] [upbeat music] Welcome to the Justified Posterior’s podcast, the podcast that updates beliefs about the economics of AI and technology. I’m Seth Benzell, the number two biggest fan, after Tyler Cowen, in the Seb Krier fan club. 00:00:20,480 --> 00:00:20,740 [Andrey] [laughs] 00:00:20,740 --> 00:00:24,660 [Seth] Coming to you from Chapman University in sunny southern California. 00:00:24,660 --> 00:00:34,120 [Andrey] And I’m Andrey Fradkin, coming to you from San Francisco, California. And Justified Posterior’s is sponsored by the fine folks at Revelio Labs. 00:00:35,560 --> 00:00:45,600 [Andrey] We’re very excited to have Seb Krier here with us today. He is the policy lead for AGI at Google DeepMind, and is, 00:00:46,840 --> 00:00:52,400 [Andrey] dare I say, a thought leader in this space. Welcome to the show, Seb. 00:00:52,400 --> 00:00:54,200 [Seb Krier] Thank you very much. It’s great to be here. 00:00:55,380 --> 00:00:58,160 [Seb Krier] Yeah, I’m Seb, calling in from New York. 00:00:58,160 --> 00:01:00,320 [Andrey] And we should remind our listeners that 00:01:01,340 --> 00:01:08,410 [Andrey] Seb is, during this podcast, expressing his personal opinions, and is not speaking on behalf of DeepMind. All right. 00:01:08,410 --> 00:01:09,740 [Seb Krier] Indeed. [laughs] 00:01:09,740 --> 00:01:11,060 [Andrey] [laughs] 00:01:12,780 --> 00:01:13,900 [Andrey] The usual caveat. 00:01:15,260 --> 00:01:16,760 [Andrey] Seb, what is AGI? 00:01:18,080 --> 00:01:19,450 [Seb Krier] What is AGI? [laughs] 00:01:19,450 --> 00:01:19,570 [Andrey] [laughs] 00:01:19,570 --> 00:01:19,580 [Seth] [laughs] 00:01:19,580 --> 00:01:19,780 [Seb Krier] Great question. 00:01:19,780 --> 00:01:21,900 [Andrey] We’re going to start with the big questions. 00:01:21,900 --> 00:01:22,880 [Seb Krier] Yeah, might as well. 00:01:24,259 --> 00:01:54,840 [Seb Krier] [sighs] I think there’s so many definitions out there of what AGI is, and I think most of them are kind of unsatisfactory in one way or another. I’ve seen stuff like many definitions are indexed on the societal transformations or economic impacts of the technology, which I don’t really like very much because it makes it very dependent on external factors whether or not we have AGI. If it’s banned, we don’t have AGI, and if it’s not banned, we have AGI. Is it? 00:01:54,840 --> 00:01:55,480 [Andrey] [laughs] 00:01:55,480 --> 00:02:04,670 [Seb Krier] And there are other tests, like if an AI makes $1 million or something, which I find is very weird because most humans do not make $1 million in the first place. 00:02:04,670 --> 00:02:05,080 [Andrey] [laughs] 00:02:05,080 --> 00:02:11,359 [Seb Krier] So the one I kind of like is actually Shane Legg’s definition- 00:02:11,360 --> 00:02:11,620 [Andrey] Mm 00:02:11,620 --> 00:02:12,420 [Seb Krier] ... who’s at Deep Mind, who is 00:02:13,640 --> 00:02:16,980 [Seb Krier] more of a capability-based definition, which is something along the lines of 00:02:18,420 --> 00:02:20,960 [Seb Krier] an AI or a system that does most 00:02:22,380 --> 00:02:30,360 [Seb Krier] standard cognitive tasks that people typically do. [lips smack] So it’s kind of the bar isn’t too low, and it’s also not too high either. 00:02:32,220 --> 00:02:35,480 [Seb Krier] And so I think he’s got this definition of a minimal AGI, 00:02:36,580 --> 00:02:43,020 [Seb Krier] and I think that we’re not exactly there yet. I would disagree with people saying that we have AGI today because I think 00:02:44,220 --> 00:02:48,900 [Seb Krier] a lot of the systems we have, there’s many things that a human can do that they don’t really do very well. 00:02:48,900 --> 00:02:50,360 [Seth] What’s the biggest gap that we’re missing? 00:02:52,020 --> 00:03:47,740 [Seb Krier] I’d say there’s a few. One of them might be continual learning, or at least the ability to adapt and learn over time, and in different contexts and situations, just kind of update your own world model or whatever. If I think of a new joiner in a company, they’re not super useful the first day, but their value goes up over time because they learn all sorts of things. And so [lips smack] that might be one of them. A lot of the systems we have today, I think, are not very good at software, and you’re using graphical user interfaces and software and whatnot. If I ask an agent right now to go and use a music production software and make a track, I think they’d generally struggle. That doesn’t mean it’s impossible to solve or anything like that, but I think, in many respects, they’re not as general as you’d want them to be. And then the other bit also is, [lips smack] and of course they still make some silly mistakes here and there, but I think that’s getting it fixed. But the creativity point is one that I’m really interested in as well, in that I think they’re really good at kind of 00:03:48,780 --> 00:04:02,700 [Seb Krier] exploiting maybe an existing paradigm or an existing knowledge and so on, and recombining knowledge and whatnot. But I think really coming up with new concepts and abstractions entirely is something I think humans can do, but I don’t see our current systems really doing either. 00:04:02,700 --> 00:04:10,060 [Andrey] How do you measure whether humans can do creative tasks? One of the things that 00:04:11,200 --> 00:04:15,940 [Andrey] strikes me as a bit of an unfair test in that, 00:04:17,060 --> 00:04:23,290 [Andrey] let’s say you ask an LLM to write a poem or to write a story. It’s very- 00:04:23,290 --> 00:04:23,290 [Seth] [laughs] 00:04:23,290 --> 00:04:32,050 [Andrey] ... times more entertaining than what a random human would write. So, do you have a benchmark for creativity? 00:04:32,050 --> 00:04:35,390 [Seth] This is the meme where the robot asks Will Smith if he can compose an opera. 00:04:35,390 --> 00:05:14,700 [Seb Krier] [laughs] Can you? Yeah, exactly. It depends, and you’re right. Obviously, most people aren’t creating new abstraction and concepts on a day-to-day level. But I imagine there’s still something qualitative about that kind of creativity that I think does get applied in everyone’s day-to-day life in various kind of ways. Maybe they’re not as big or significant as creating a symphony. But I don’t really have a strong test. There’s actually an interesting podcast that had Ben Goertzel and Yoshua, I think a few years ago, where they were saying something like, if you had a model that was trained knowing only classical music and West African drumming, could it come up with jazz in the first place, or recreate jazz? 00:05:16,460 --> 00:05:27,880 [Seb Krier] And I quite like that test. And in principle, I can imagine it being possible. You could kind of decompose all sorts of different kind of elements and variables here and just get something jazz-like. But it still feels a bit... 00:05:29,580 --> 00:05:40,580 [Seb Krier] It’s not the same as just coming up with the idea of jazz in the first place and saying, oh, I’m going to try these things out. And for whatever reason, I’m going to stick to that. And I don’t know. It’s- 00:05:40,580 --> 00:05:53,190 [Seth] Recombination versus paradigm shifting. I’ve also heard one test people would want for AGI is, can you train the model on the 1900s corpus and it comes up with Einsteinian physics? 00:05:53,190 --> 00:05:53,200 [Seb Krier] Yeah. 00:05:53,200 --> 00:05:54,720 [Seth] That would be really impressive. 00:05:54,720 --> 00:06:36,151 [Seb Krier] Yeah, I think actually Demis uses that test sometimes, or I think Pele Gritzer as well mentioned it before. And there are some people, I think David Duvenour and Nick Levine, I think, had this recent kind of language model talky that was trained up in, I think, the 1930s or something. And I tried to play around with it a lot. It was like, let’s try to get it to create something new, and it’s pretty tricky. Although they have apparently recently, some people kind of fine-tuned it on a very few examples of coding and gotten it to be good at coding. But for some reason, that doesn’t impress me maybe as much as other things I would’ve expected. It’s like [laughs] there’s the-I agree that the goalposts also kind of move a little bit over time, and it’s also maybe unfair of me. It’s like, oh, well, can it create a new programming language from scratch or something? 00:06:37,272 --> 00:06:43,052 [Seb Krier] So it’s a tricky one to kind of square off, but it does still feel like there’s a lack of that kind of true creativity, at least in my 00:06:44,212 --> 00:06:45,072 [Seb Krier] interactions with them. 00:06:46,392 --> 00:06:57,342 [Andrey] I am really worried that it is a goalpost moving exercise here. We don’t have a benchmark for creativity and therefore, 00:06:58,432 --> 00:07:03,211 [Andrey] all these claims are not quantitative in a way that I’d like. And let- 00:07:03,212 --> 00:07:10,612 [Seth] Right. What about all those IS papers we see where one of the axes is creativity and we instrument for something? [laughs] 00:07:10,612 --> 00:07:11,032 [Andrey] Yes. 00:07:13,132 --> 00:07:13,592 [Seth] There’s a lot of bad measures of creativity. 00:07:13,592 --> 00:07:19,762 [Andrey] Those are not creative, to be clear. I’m sure I’ve offended a ton of people. Sorry. 00:07:19,762 --> 00:07:20,992 [Seth] It’s okay. 00:07:20,992 --> 00:07:56,432 [Seb Krier] I think it’s fair. I agree that it’s a bit like... But I still feel like there’s, at least if part of the reason you’re going to create these systems is to come up with kind of also new sorts of theories and so on. And I think you can probably get that through good search and a lot of inference compute and trying out lots of different things. And I think there are many low-hanging fruits there, to be clear. So it’s not like I think, oh, we’ve hit some sort of wall or something. And I think there’s a lot that you can kind of get in terms of new knowledge and new creative knowledge from that. But I feel like there’s maybe something more needed. It’s maybe not that kind of magical or anything, right? Maybe you just need better scaffolding or better multi-agent systems. But 00:07:58,992 --> 00:08:02,072 [Seb Krier] yeah, at least so far, I would say that I see a bit more creativity, say, in 00:08:03,652 --> 00:08:11,612 [Seb Krier] humans so far as a collective. And maybe that’s, again, an unfair comparison. You don’t have a culture of AIs and AGIs to compare that against. So- 00:08:11,612 --> 00:08:11,682 [Andrey] Yeah 00:08:11,682 --> 00:08:15,092 [Seb Krier] ... the right comparison is also a hard one to do. 00:08:15,092 --> 00:08:52,772 [Andrey] So, you mentioned scaffolding, and I guess a question, you recently wrote about a defense of scaffolding, and I think just to frame things, some people you talk with, especially very AGI-pilled people, are like, “Scaffolding, it’s an epiphenomenon. It doesn’t matter. In the end, we are going to train a smarter model with more parameters and more training data, and it’s just going to do it out of the box. And so all these scaffolding hacks are just very temporary.” And then other people like yourself, I guess, argue the opposite. So what do you think about scaffolding? 00:08:54,832 --> 00:08:55,052 [Seb Krier] Yeah. 00:08:56,572 --> 00:08:59,372 [Seb Krier] The first thing is I’m definitely not sure. This is kind of 00:09:00,532 --> 00:09:39,672 [Seb Krier] one of many hot takes, but I think, I guess there are a few reasons why I see it as, I think it’s going to stay over time. The first is that I think it’s plausible that as, I think scaling laws continue, I think you scale models and they get better over time and so on, but I think the inputs are expensive and grow over time. And I also think that it’s plausible that you might get more and more diminishing returns over time. And if that’s the case, I see the kind of utility of the scaffolding side and the harnesses as going up because you’re going to want to make more, you’ll want more bang for your buck kind of thing. You’re going to want to extract this intelligence and use this resource as efficiently as possible. 00:09:40,772 --> 00:09:51,532 [Seb Krier] So that’s maybe one reason. The other one is a bit more, I guess, Hayekian in nature or something, in that I see a lot of, I think there’s a lot of local knowledge, a lot of 00:09:53,212 --> 00:10:18,592 [Seb Krier] stuff that isn’t necessarily kind of codified. And I don’t really see one big giant AGI model now kind of perfectly guessing everything forever at infinite scales. And in a way, I see this as a little bit like a division of labor in that I think it’s actually more efficient to have this kind of integration layer that is closer to the local information or to the ground or to demand side that can better integrate this kind of cognitive resource 00:10:19,812 --> 00:10:23,632 [Seb Krier] to satisfy and create value and satisfy whatever consumers and businesses want. 00:10:25,552 --> 00:10:31,352 [Seb Krier] So to help with all the sorts of constraints and the context they’re dealing with, I think it’s very useful to have that. 00:10:33,712 --> 00:10:39,112 [Seb Krier] Of course, I don’t think this necessarily also implies or means that you’re going to get complete, full decentralization or something. 00:10:40,772 --> 00:10:42,212 [Seb Krier] Walmart gets huge 00:10:43,872 --> 00:10:48,872 [Seb Krier] returns from the scale that they have, and you don’t have loads of businesses downstream kind of reselling their stuff. 00:10:51,252 --> 00:10:53,932 [Seb Krier] But there’s two things. The first is that- 00:10:53,932 --> 00:10:56,812 [Seth] We have bodegas reselling stuff from Walmart on the corner. 00:10:56,812 --> 00:11:18,992 [Seb Krier] Actually, that’s a good point, yeah. And also, there are all sorts of other businesses kind of selling different things, right? If the task is generic and the demand is homogenous, then sure, maybe you can do more of that. But also, even Walmart relies on all sorts of kind of suppliers, local labor, compliance system, inventory systems, third parties, and whatnot, that help with this kind of integration and the delivery of these services. 00:11:18,992 --> 00:11:25,862 [Seth] So if I may summarize your answer, you’re very Hayek-pilled, but maybe not as Bitterlesson-pilled as most. 00:11:25,862 --> 00:11:25,972 [Seb Krier] Well, 00:11:27,212 --> 00:11:31,052 [Seb Krier] I think I’m definitely Bitterlesson-pilled in the sense that I don’t think you should 00:11:33,652 --> 00:11:48,992 [Seb Krier] try to kind of cement some sort of rules-based system you either devise or something and kind of hope that this just takes forever. If anything, I think the scaffold needs to be a lot more adaptive and evolve over time. In the same way as if you have a small startup and they have all sorts of kind of rules and, 00:11:50,332 --> 00:12:02,772 [Seb Krier] sorry, not rules, different functions. When the startup grows and gets more capabilities, they also kind of change from the inside. So I think that, of course, if you have some sort of light GPT-type wrapper that kind of makes your system a little bit better, whatever, yeah, that was not going to 00:12:03,812 --> 00:12:23,652 [Seb Krier] work out over time. But I think there are kind of scaffolds that help better integrate the wider environment, private data, deals with permissions or liability regimes or user preferences and whatnot. And also, at a somewhat higher level, kind of more coordination-type scaffolds maybe in terms of market interfaces, like clearing house equivalents or something. 00:12:24,516 --> 00:12:33,536 [Seth] The third example you gave is maybe it’s not the super frontier model that are going to these scaffolds, but simpler models that are still very useful and cheaper to run with a scaffold. 00:12:33,536 --> 00:12:46,176 [Seb Krier] Yeah, totally. Because I think you’re not going to need the enormous, super expensive brain for every single random task. And so it’ll make, for most kind of basic queries, people aren’t using Opus’s latent space or something as- 00:12:46,176 --> 00:12:46,186 [Seth] [laughing] 00:12:46,186 --> 00:12:48,236 [Seb Krier] ... it’s a big waste in some sense. 00:12:48,236 --> 00:12:50,036 [Seth] What toothbrush should I buy? [chuckles] 00:12:50,036 --> 00:12:51,196 [Seb Krier] Yeah. Exactly. 00:12:51,196 --> 00:12:53,896 [Andrey] Wait. That is an important question, Seth. 00:12:53,896 --> 00:12:54,516 [Seb Krier] I mean- 00:12:54,516 --> 00:12:56,536 [Andrey] I would definitely use Opus for that. 00:12:56,536 --> 00:12:57,385 [Seb Krier] It’s funny because I’ve actually- 00:12:57,385 --> 00:12:59,696 [Seth] Use all the collective intelligence of reality. [chuckles] 00:12:59,696 --> 00:13:02,266 [Seb Krier] I have actually used Opus for that exact question not long ago- 00:13:02,266 --> 00:13:02,626 [Seth] [laughing] 00:13:02,626 --> 00:13:06,256 [Seb Krier] ... in trying out this new electric toothbrush that I found out as a result. But, 00:13:07,636 --> 00:13:22,076 [Seb Krier] so yeah, I agree there’s that and also there’s all sorts of ways in which actually kind of using tools or specialized kind of tools is just more effective and more efficient. Why would you expect a large model or something to kind of calculate things innately or something when you can just access a calculator? It’s a much better use of tokens. 00:13:22,076 --> 00:13:36,856 [Andrey] But it should kind of know that the calculator is available and then use it when it’s there. So that’s the argument against scaffolding, or you’re giving it a general environment, but you’re not scaffolding it much. I think a curious thing is just, 00:13:38,376 --> 00:13:40,356 [Andrey] it seems like most people who are using 00:13:41,416 --> 00:13:49,156 [Andrey] scaffolded agents today are using them with essentially one of two scaffolds, with Cloud Code or Codex. And 00:13:50,236 --> 00:14:00,475 [Andrey] those seem to be good enough maybe. I guess, do we see a lot of people customizing, a lot of people, whatever, companies customizing their scaffolds? 00:14:00,476 --> 00:14:03,856 [Seth] CladBot, do the CladBots count as that, I guess? 00:14:03,856 --> 00:14:04,236 [Andrey] Yeah. 00:14:05,396 --> 00:14:39,676 [Seb Krier] They are a form of it. I don’t know. I think a lot of power users and people in our immediate communities use a lot of Cloud Code and Codex, and particularly software engineers. But I don’t think most legal departments and most kind of firms out there are necessarily using Cloud Code either. And it’s not clear to me that this is necessarily the optimal interface or, there may be better systems that are Cloud Code-like, or CLI-like perhaps in some way. But, so I don’t know, maybe they’re sufficient, but even these tools end up kind of calling on loads of other external APIs and tools and so on in how they 00:14:40,836 --> 00:14:57,576 [Seb Krier] function. So if anything, these are actually scaffolds. You’re not kind of calling the model directly. There’s all sorts of different sub-agents behind the scenes. It’s not just a one-shot call. There’s quite a lot going on, which is in fact this more, I don’t know, dynamic scaffolding thing I was mentioning earlier, I guess. 00:14:58,976 --> 00:15:06,736 [Andrey] Okay. The natural question here is, what is going to be the role of the market in coordinating- 00:15:06,736 --> 00:15:07,375 [Seb Krier] Mm 00:15:07,375 --> 00:15:11,276 [Andrey] ... AI here? And I’ll just very shamelessly plug- 00:15:11,276 --> 00:15:11,285 [Seb Krier] [chuckles] 00:15:11,285 --> 00:15:24,796 [Andrey] ... some recent work with Rohit Krishnan, where we’re kind of playing around with the idea of LLMs bidding in a procurement auction and seeing whether that results in more efficient use of AI. 00:15:26,696 --> 00:15:29,655 [Seb Krier] Well, first of all, I need to properly read that again. But the- 00:15:29,655 --> 00:15:30,476 [Andrey] [laughing] 00:15:30,476 --> 00:15:31,016 [Seb Krier] In terms of, 00:15:32,496 --> 00:15:32,916 [Seb Krier] I guess, 00:15:34,556 --> 00:15:46,396 [Seb Krier] at a very high level, markets are good at just coordinating in general, including AI. And so, assuming they function as intended in it, you’ve got the pricing mechanism to get... 00:15:47,556 --> 00:15:49,396 [Seb Krier] I don’t know. I expect that to kind of work as well with 00:15:50,476 --> 00:15:52,616 [Seb Krier] matching, I guess, supply and demand or something. 00:15:54,016 --> 00:15:55,196 [Seb Krier] The supply of this 00:15:56,216 --> 00:16:00,036 [Seb Krier] raw resource of cognition or something, and the demand of all sorts of different businesses and users. 00:16:01,696 --> 00:16:05,516 [Seb Krier] So maybe, at a very high level, I don’t know. What exactly do you mean by the role of the market or something here? 00:16:09,076 --> 00:16:21,356 [Andrey] Obviously the market is involved in many parts of the AI vertical supply chain, right? From competition in chips. There’s competition between models. There might be also competition between 00:16:22,516 --> 00:16:28,576 [Andrey] scaffolds, bundles of environments, scaffolds, and LLMs. 00:16:28,576 --> 00:17:06,496 [Seth] I guess maybe it would be useful to juxtapose this versus, so what Andrey, one of the things he’s imagining is, I have a job. I post it to some sort of Upwork-like future platform. Different companies that host different AI models bid to do that job. “Oh, I think I can do that job with $1 of electricity and tokens,” versus another model, and then we get efficient allocation of intellectual tasks to models, right? So do we think that that’s going to be important, or is it going to be more like I ask the super model what the best model is, and I just get allocated in a non-market way? Might be one version of this question. 00:17:08,156 --> 00:17:18,836 [Seb Krier] I guess intuitively, my mind goes to the former question. But, or there’s a little bit of both in some sense, because even in the former one, you’re going to be using the large model for some sort of 00:17:20,436 --> 00:17:26,686 [Seb Krier] cognitively demanding task or something. It kind of depends what kind of quality of output you also need and want. 00:17:26,686 --> 00:17:26,706 [Seth] [chuckles] 00:17:26,706 --> 00:17:27,056 [Seb Krier] But then 00:17:28,376 --> 00:17:49,636 [Seb Krier] you’re still going to be constrained by your own resources or something, and depending on what you have to spend, if you can get the output for cheaper by kind of relying on this kind of competitive marketplace of smaller models or something, not even smaller models, they might just be all be big and kind of just scaffolding different, you’re offering a slightly different thing. Why wouldn’t you go for that, and why wouldn’t that exist in the first place? Unless the very first- 00:17:49,636 --> 00:17:52,216 [Andrey] Doesn’t exist yet, just to be clear. 00:17:52,216 --> 00:17:52,716 [Seb Krier] Um- 00:17:52,716 --> 00:17:58,416 [Seth] A, it doesn’t exist yet, and as Andrey proves, at least current models are bad at understanding their own capabilities. 00:17:58,416 --> 00:17:58,666 [Andrey] Oh, yeah. 00:17:58,666 --> 00:18:00,496 [Seth] Now maybe that’s going to be fixed. 00:18:00,496 --> 00:18:08,096 [Seb Krier] Yeah. Oh, no, I agree. I think that we’re not there yet, right? I think, again, and that goes back to the earlier AGI question, is there’s all sorts of, then again, what’s the right comparator? But, 00:18:09,476 --> 00:18:21,316 [Seb Krier] yeah, I don’t think we’re exactly there. Yeah, I think a lot of this will have to be built as well. The kind of an ability for a model to just better kind of operate in a more multi-agent environment, kind of have a better sense of 00:18:22,596 --> 00:18:32,556 [Seb Krier] delegation. I think the kind of, yeah, industrial intelligence or something seems to be maybe more neglected, as opposed to just single-agent intelligence or something, if that makes sense. 00:18:32,556 --> 00:18:34,776 [Seth] Do we need to bring the word cybernetics back? 00:18:34,776 --> 00:18:35,496 [Seb Krier] Yeah. 00:18:35,496 --> 00:18:36,116 [Andrey] [laughs] 00:18:36,116 --> 00:18:38,816 [Seb Krier] Somewhat. [laughs] 00:18:40,756 --> 00:18:51,256 [Andrey] All right. A little change in subject, but I know this has been in the discourse, the topic of recursive self-improvement, RSI. 00:18:51,256 --> 00:18:52,956 [Seth] Ooh, very scary. 00:18:52,956 --> 00:18:54,896 [Andrey] Jack Clark recently had an essay about it. 00:18:56,376 --> 00:18:58,876 [Andrey] Seb, what is your take? 00:18:58,876 --> 00:18:59,206 [Seb Krier] [chuckles] 00:19:00,316 --> 00:19:07,896 [Seb Krier] What is my take? I don’t know. I think it depends what exactly we mean by recursive self-improvement. 00:19:09,096 --> 00:19:50,336 [Seb Krier] I had a blog post not long ago, I guess, when trying to disentangle a little bit what I have in mind when I think about this. On the one hand, there’s the model getting recursively better through the usage of more AI and whatnot. And on the other hand, there’s the more kind of societal side of things, the transformation side, which I think very often, these two worlds are a little bit blurred in the discourse. It’s like, oh, you get RSI, and then X, Y, Z about the world or something. Things go really fast or they don’t go fast. And, I think these should be separated very neatly because on the model side, of course, I expect, already there’s a lot of AI being used everywhere to kind of create models. And I expect that to continue. 00:19:52,536 --> 00:19:55,976 [Seb Krier] But it’s not clear to me that this necessarily now leads to a dynamic by which 00:19:57,156 --> 00:20:16,596 [Seb Krier] the model now gets extremely or exponentially intelligent in a very short amount of time. It’s still kind of bottlenecked by all sorts of resources. And as I was saying earlier, I still see them as better at kind of paradigm exploitation than kind of exploration, which I think is the thing you might need to get to the next step. But, first of all, what do I know? But secondly, 00:20:17,616 --> 00:20:19,986 [Seb Krier] the other thing is, yeah, on the societal side of things, 00:20:20,996 --> 00:20:29,756 [Seb Krier] people sometimes talk about foom or hard takeoffs and whatnot, and these have very clear kind of real-life implications. It’s not just kind of a model of getting better in a 00:20:31,216 --> 00:20:34,576 [Seb Krier] data center somewhere. And that side, I think, is where you have to think about 00:20:36,116 --> 00:21:27,056 [Seb Krier] [lip smack] all the kind of usual bottlenecks, adoption, deployment, diffusion, the kind of productive integration of all these systems at scale, both in terms of manufacturing and so on and so forth. And, I guess it’s not clear to me that the shift from GPT-2 to GPT-3 or coming up with kind of, we’re just very classic kind of software engineering, meat and potatoes type tasks that you can just easily just automate away. It’s maybe one of these things that’s maybe easy to say ex post, but, I’m not sure. And certainly, my expectation is you’re going to get loads of gains in the coming years of kind of automating part of that pipeline. But that seems good. You just get better models, and that’s just overall helpful for all sorts of other things, even if you’re doing safety work and kind of governance work and whatnot, we benefit a lot from that cognitive resource, I guess. 00:21:27,056 --> 00:21:40,696 [Andrey] What would happen in the world for you to change your mind? Is there any, let’s say that recursive self-improvement is actually kind of this much more profound change than you’re painting. 00:21:41,816 --> 00:21:42,036 [Andrey] What 00:21:44,136 --> 00:21:45,696 [Andrey] signs would there be, I guess? Yeah. 00:21:45,696 --> 00:21:51,656 [Seb Krier] But to be clear, I’m not claiming it’s just business as usual, nothing to see here or whatever, right? I’m 00:21:52,796 --> 00:22:14,936 [Seb Krier] kind of just claiming that some of the stronger versions of the claim aren’t kind of self-evident. And so I see a lot of this happening in some sense. Certainly, in 10 years, I expect to have larger kind of more, again, acceleration of economic growth and whatnot and kind of faster diffusion across the board. I certainly don’t expect diffusion to take the same amount of time as, say, electricity or these other technologies. 00:22:16,576 --> 00:22:23,236 [Seb Krier] So it depends what exactly you mean, because what specifically am I looking to change my mind on? 00:22:23,296 --> 00:22:30,656 [Andrey] Well, let’s say the scenarios of AI 2027, right? Presumably, 00:22:31,996 --> 00:22:45,176 [Andrey] in 2027, you’ll see something that’s like, “Oh, wow, I was wrong. This is not going to be so gradual. This is going to be this sudden foom,” that you’re criticizing. Yeah. 00:22:45,176 --> 00:22:52,236 [Seb Krier] The original foom or hard takeoff definition literally talks about this change happening within hours or days. 00:22:52,236 --> 00:22:53,236 [Andrey] [chuckles] 00:22:53,236 --> 00:22:56,056 [Seb Krier] Which is not even, it’s not what the 2027 scenario, I think, predicts. 00:22:56,056 --> 00:22:56,296 [Andrey] Yes. 00:22:57,556 --> 00:23:00,446 [Seb Krier] But the 2027 scenario, from what I remember, again, it’s been a bit of time now. 00:23:01,796 --> 00:23:08,816 [Seb Krier] One thing with the scenarios there is that there’s the kind of misalignment assumption, and which I’m kind of uncertain about. 00:23:08,816 --> 00:23:09,255 [Andrey] Mm. 00:23:09,256 --> 00:23:17,296 [Seb Krier] And it also talks about a lot of progress in robotics, which I think is a bit further away. I think it’s close. We’re getting there, too. 00:23:19,116 --> 00:23:19,476 [Seb Krier] But 00:23:21,156 --> 00:23:25,916 [Seb Krier] I don’t know. Probably kind of AI, if in 2030, we start seeing AI is making all sorts of crazy 00:23:26,956 --> 00:24:06,196 [Seb Krier] inventions, innovations in fields other than just kind of perhaps math and coding across the boards, and I’m like, okay, this is clearly-- And you get extremely fast adoption, too, right? You have entire businesses doing completely, it’s not business as usual, clearly, in the economy or something and wide adoption. But it’s hard to say because I expect all that to some degree, right? It’s not that I’m saying, “Oh, this is never going to happen.” I just think of it as a little bit more elongated and the implications of that being maybe not as like, we have Dyson spheres in five years or something like that, so. It’s more of a disagreement maybe on the extremes or the margins or something, but not so much at the core of the claim that yes, models are going to make models better and... 00:24:07,276 --> 00:24:27,536 [Seb Krier] But, again, even having-- In fact, actually, here would be a thing. If Anthropic or DeepMind or something in 2037 have fewer and fewer employees, fewer people kind of just doing AI research, engineers and so on, you’re clearly seeing kind of that profession. Because of course, I can imagine these jobs to change, right? Maybe you’re kind of managing more agents or something. That 00:24:28,616 --> 00:24:35,966 [Seb Krier] I expect. But the fact that you just need far fewer people to kind of do not only these large training runs, but the kind of 00:24:36,976 --> 00:24:43,476 [Seb Krier] large training runs that give you just much, much better systems, then I think I’d be like, okay, this is going a little bit faster than maybe expected or something. 00:24:44,656 --> 00:24:51,676 [Andrey] Okay. One thing you mentioned in that kind of hints at another hot take you have, which is about alignment. 00:24:51,676 --> 00:24:52,026 [Seb Krier] Uh-huh. 00:24:54,596 --> 00:24:55,926 [Andrey] What’s the deal with alignment? 00:24:57,196 --> 00:24:58,086 [Andrey] [laughs] 00:24:58,086 --> 00:24:58,136 [Seb Krier] [laughs] 00:24:58,136 --> 00:25:02,136 [Seth] Is it hard? Is it easy? Is it different than we would’ve expected going in? 00:25:02,136 --> 00:25:19,646 [Seb Krier] Yeah. It’s perhaps that. I think my take about alignment is something-- Well, first of all, I just don’t like the word. I think it’s a bit of an annoying word because it’s being used for all sorts of things. The AI says something that we just kind of don’t like, or you say, “Oh, it’s misaligned.” No one pre-registers what they expect the aligned behavior to be, and then just kind of tests. 00:25:19,646 --> 00:25:20,116 [Andrey] [laughs] 00:25:20,116 --> 00:25:35,626 [Seb Krier] But I think my general claim is maybe the fact that it’s been easier than we would’ve predicted a decade ago or so. Then when I first got into AI in 2017, that was partly as a result of reading things like “Superintelligence” by Bostrom. 00:25:35,626 --> 00:25:36,236 [Andrey] Mm-hmm. 00:25:36,236 --> 00:25:48,496 [Seb Krier] And you’d read these books, like Stuart Russell’s “Human Compatible” and others, that kind of had all these analogies like King Midas and you ask a system to optimize for goal X, and in pursuit of that goal, it does all sorts of other things that you don’t want it to do. 00:25:48,496 --> 00:25:51,916 [Seth] Right. The paperclip maximizer, and we seem to not have those. 00:25:51,916 --> 00:25:57,476 [Seb Krier] Yeah. It’s like one version of it or one variant of it. And certainly at the time you didn’t really have language models. A lot of these intuitions were kind of based off 00:25:58,596 --> 00:26:48,236 [Seb Krier] reinforcement learning systems in very basic kind of game scenarios where they were actually given a single goal to optimize for. And this is not actually what we do, I think, with models. And you had these kind of examples, even the value loading problem was something discussed at the time where actually specifying these complicated nuanced human values in mathematical terms would be extremely hard. So even if you managed to tell a robot to clean the room, it would then just pick up a baby and put it in the trash or something. And I think it turns out a lot of this stuff is actually much easier. You have problems. You’ve got things like reward hacking. You’ve got AIs behaving in weird ways that we were not always kind of anticipating because of the ways they were post-trained. So my claim is not like, oh, again, it’s all fine, and safety is a scam or whatever. It’s more that it’s certainly much easier than, or at least we’re in a much better track than I would’ve at least guessed perhaps a decade ago. And secondly, I think it 00:26:49,916 --> 00:26:54,816 [Seb Krier] just seems tractable. There’s a lot of progress in terms of chain-of-thought monitoring and all these other things. And 00:26:56,696 --> 00:26:57,796 [Seb Krier] I also think that the 00:26:59,016 --> 00:27:05,825 [Seb Krier] hard part is maybe more the kind of normative question of whose values and when, and what and everything. That’s the kind of thing that we’re looking into more. But 00:27:07,096 --> 00:27:13,696 [Seb Krier] yeah, I prefer the word actually instruction following or intent following or something instead of alignment. And I think by and large, they’re actually pretty good at that. 00:27:14,796 --> 00:27:31,636 [Seb Krier] So again, that doesn’t mean you have to dismiss all sorts of theories and all the kind of power optimization stuff. But I guess my immediate outcome is this goes rather well. Or if I am more concerned by other things like misuse, if you’d like, than kind of the AI’s being innately, inherently kind of internally misaligned. 00:27:31,636 --> 00:28:03,676 [Seth] This really seems related to your take that intelligence is not at odds with being a tool, right? So a lot of people have this intuition where if you had a super-duper intelligent genie or oracle, it would develop even implicitly some sort of value or goal that orthogonality thesis might have nothing to do with what we want. But you’re more optimistic about the idea that the LLM doesn’t want anything. It’s incorrect to take the intentional stance towards an LLM. 00:28:03,676 --> 00:28:09,236 [Seb Krier] Not incorrect. It’s actually kind of descriptively useful, even functionally sometimes to use that language. 00:28:10,796 --> 00:28:18,836 [Seb Krier] But that’s the thing, right? I think we kind of lack the language to properly delineate and differentiate when it’s useful to use that or appropriately descriptive and when it’s not. 00:28:20,076 --> 00:28:41,496 [Seb Krier] And so I agree that, of course, I think the take I had on this was something like, and I can imagine a tool being an agent and an agent being a tool. Or in principle, I can imagine something being hyper-capable and still being broadly instruction following rather than at a certain level of capability, aha, that’s when the goals change and things get... And it kind of depends on the type of system as well. I imagine not all 00:28:42,656 --> 00:28:45,116 [Seb Krier] paths lead to the same kind of outcome. But, 00:28:46,256 --> 00:29:13,596 [Seb Krier] so again, I can see plausible versions of the world where homo hundrio drives or something are a more salient feature of the way we kind of train models. Right now, it doesn’t seem to me very likely that this is a core feature that they have. But of course, it’s hard to kind of either prove or disprove, right? Because someone might just say, well, that’s because they’re very good at hiding this or something, or once they’re capable enough or whatever. So there’s always a bit of this kind of gotcha thing. It’s like deception. But 00:29:14,936 --> 00:29:39,896 [Seb Krier] yeah. So in principle, I guess I can totally conceive of at least a superintelligence that is controllable, that is benign, that is at least subservient to the goals of humanity or a user or principle or whatever. That could still be used to cause enormous harm, but it’s just I don’t necessarily think the analogies of, oh, I think Tegmark was thinking, look at the zoo where the monkey’s going. I think these are just not really 00:29:41,736 --> 00:29:43,136 [Seb Krier] helpful kind of analogies. 00:29:44,276 --> 00:30:02,396 [Seth] Monkey at the zoo, but you’ve also got the monkey’s paw, right? Maybe the reason some prefer alignment to instruction following is we all know the story of, be careful what you wish for. You wish for something, and it’s under-specified, and you get the bad version of it because the AI doesn’t understand the context. 00:30:02,396 --> 00:30:08,336 [Seb Krier] I think that’s why, yeah, I think maybe instruction following is maybe too... Intent following or something gets to it more. 00:30:09,936 --> 00:30:18,316 [Seb Krier] But of course, that problem doesn’t go, even if it follows intent or something, you could still have all the problems because your intent is nefarious or whatever. So 00:30:19,436 --> 00:30:19,816 [Seb Krier] I think the 00:30:21,356 --> 00:31:06,756 [Seb Krier] way you deal with that is all sorts of, I don’t know how to conceptualize it, but in fact scaffolds. It’s a bit more this outside of the model or something. I’m kind of almost indexing on a world that will indeed have agents that are trained to be bad or whatever, or someone going to be instructed to do bad things. But just like with humans, you come up with all sorts of kind of systems, rules, laws, norms, kind of protocols that either discourage the kind of bad behavior, or punishes it, or makes it just not worthwhile or something. But I’m not going to put all my bets on the, oh, it has to be pure-hearted, and that will be sufficient. And then you just scale it forever, and it’s going to be an amazing goal. I just think that the way of seeing or thinking about AI is that I just find kind of a bit 00:31:08,096 --> 00:31:12,656 [Seb Krier] too narrow, I guess. I think it’s important, it’s just insufficient, and it’s certainly not my main kind of a-- yeah. 00:31:14,946 --> 00:31:15,206 [Andrey] Okay. 00:31:16,666 --> 00:31:20,086 [Andrey] Our audience is very much composed of economists. 00:31:22,586 --> 00:31:30,506 [Andrey] If you’re an economist and you’re very interested in AI, what sort of work would you be trying to do? 00:31:30,506 --> 00:31:32,146 [Seth] Maybe to be useful to AI people- 00:31:32,146 --> 00:31:32,216 [Andrey] Yes 00:31:32,216 --> 00:31:37,466 [Seth] ... in particular. What would you want, what did the DeepMind team want to read from economists? 00:31:37,466 --> 00:32:20,766 [Seb Krier] I think kind of engaging with their assumptions or something, right? If you assume, let’s say, an AG-- and I think some do, to be fair. I actually think there’s a lot more, I think, discourse now going on between economists and AI people, whatever. But assuming that you do have AI systems that are interchangeable or almost quasi-fully substitutable with humans, that come up with good ideas, that are parallelizable and whatnot, what does that change to your kind of growth function and so on? So, maybe that’s useful. Right now, in the short term, at least, there’s all sorts of questions around labor, there’s questions around productivity or adoption. Clearly, there’s useful work to be done there. But I think in terms of AGI specifically, given that a lot of the field just thinks you’re going to get to AGI in the next five to 10 years, 00:32:22,746 --> 00:32:26,806 [Seb Krier] what are the implications for taxation? What are the implications for 00:32:28,626 --> 00:32:37,786 [Seb Krier] how that’ll affect different states across the world? I think I’m probably more worried about a call center in Hyderabad than I am about the white-collar worker in North America or something. So, 00:32:39,066 --> 00:32:57,306 [Seb Krier] yeah. I think all these kind of questions, but just indexing more and making fewer, I guess, assumptions around the limits of capabilities. Because sometimes you see them kind of being implicitly snuck in somewhere or something of like, well, because AIs can’t do XYZ, therefore... And yeah, fine, but maybe they will do XYZ. And then what? How does that change your thinking? Yeah. 00:32:57,306 --> 00:32:59,506 [Seth] Maybe more scenario planning than, 00:33:00,526 --> 00:33:04,746 [Seth] here’s my median projection, or here is one projection I think is plausible. 00:33:04,746 --> 00:33:22,846 [Seb Krier] Yeah. And embedding the kind of thoughtful models and thinking that economists have within these scenarios and making them more salient to the kind of computer scientists, right? Even when I brought up competitive advantage, people will be like, “Oh, but what if the AI is cheaper and better?” It’s like, well, that’s not the point. The opportunity cost point of competitive advantage, there’s a difference. 00:33:22,846 --> 00:33:23,286 [Andrey] [laughs] 00:33:23,286 --> 00:33:31,786 [Seb Krier] And again, there are answers to that as well, but I think just kind of better translating, I think, some of these insights to the AI tribe, the thing is useful. 00:33:32,846 --> 00:33:40,526 [Andrey] So that’s very naturally leading us to this question about yourself. And you do lots of different things. 00:33:41,946 --> 00:33:50,426 [Andrey] You’re prolific on Twitter, for sure. But also, you’re doing internal work for DeepMind. How do you allocate your time? 00:33:52,066 --> 00:33:52,166 [Seb Krier] I don’t know. 00:33:52,166 --> 00:33:53,266 [Seth] What percentage is Twitter? 00:33:53,266 --> 00:33:54,646 [Andrey] Yeah. [laughs] 00:33:54,646 --> 00:34:04,686 [Seb Krier] Twitter is actually not that much today. It must be an hour max or something, an hour and a half, two hours, maybe, something. But that is maybe much by others’ standards. But the- 00:34:04,686 --> 00:34:06,476 [Andrey] [laughs] What is the optimal amount of Twitter? [laughs] 00:34:06,476 --> 00:34:29,866 [Seb Krier] [laughs] Yeah. It’s the Pareto optimal. I guess, in my day-to-day work, it’s a mixture of proactive and reactive. Proactive in the sense that I think, oh, these questions of agents and cybersecurity and liability and whatnot, and biosecurity are kind of important things to look into, and therefore, there’s a lot of research that I do and colleagues do, and a lot of coordination across the org. 00:34:31,026 --> 00:34:39,486 [Seb Krier] But there’s also more reactive stuff because we’re a policy team, and so there’s things happening in the external world like CA 53, the preemption debates. 00:34:40,546 --> 00:34:48,386 [Seb Krier] So it’s a bit of a mix of that. And of course, all sorts of internal dynamics. But, yeah. I guess I’m curious about all sorts of other things, and so when I do have time, and I’ve kind of 00:34:50,006 --> 00:34:58,106 [Seb Krier] completed the main quests, I try to keep some time for other stuff I’m interested in. I work with some research teams and kind of look into what they’re into. I’ll 00:34:59,266 --> 00:35:09,826 [Seb Krier] find topics or themes that I think are maybe kind of neglected or underrated or I just don’t see out there as much, and like, “Oh, cool. We’re going to try to find out about this more.” But I think it’s just very kind of curiosity driven, and the allocation of time is 00:35:11,566 --> 00:35:16,705 [Seb Krier] not super thought out. It’s more like, oh, I think these things are interesting, and I’m going to get into that for a bit. [laughs] 00:35:16,706 --> 00:35:22,306 [Andrey] So it wasn’t a deliberate strategy of getting Tyler’s attention and adoration. [laughs] 00:35:22,306 --> 00:35:25,126 [Seb Krier] No, not at all. Not at all. But I’m very- 00:35:25,126 --> 00:35:25,746 [Seth] The long play 00:35:25,746 --> 00:35:30,565 [Seb Krier] ... very grateful for his... [laughs] For the meme. But- 00:35:30,566 --> 00:35:41,766 [Seth] What kind of, but I know you can’t be specific, but for your sort of internal work, what does a work product look like? Are you participating in a meeting and giving hot takes? Are you writing internal memos? What is- 00:35:41,766 --> 00:35:42,026 [Seb Krier] Yeah 00:35:42,026 --> 00:35:42,276 [Seth] ... in- 00:35:42,276 --> 00:35:56,406 [Seb Krier] It’s a mixture. Obviously, meetings. Any large bureaucracy will have meetings. But I think a lot of analysis, memos to execs sometimes. Just research, managing researchers sometimes, depending on the project. 00:35:57,626 --> 00:36:04,106 [Seb Krier] We’ll have a lot of coordination. Actually, I’m realizing through a lot of these kind of meetings, a lot of it is just kind of coordination and information transfer, right? 00:36:04,106 --> 00:36:04,146 [Andrey] [laughs] 00:36:04,146 --> 00:36:07,006 [Seb Krier] It’s maybe why I’m so obsessed with the Coasean bargaining thing. Just let- 00:36:07,006 --> 00:36:07,326 [Seth] Ah 00:36:07,326 --> 00:36:08,546 [Seb Krier] ... the agents do it. But, 00:36:09,806 --> 00:36:34,116 [Seb Krier] yeah. I think the day-to-day work is a lot of reading, a lot of meetings, a lot of writing, and distilling and translating information, I think, across different tribes also. So if I’m talking to legal people, like lawyers, about what’s going on in, say, the more technical side of the org, or if I’m speaking to the researchers about something that’s more... But yeah, there’s a lot of translating of concepts across different stakeholders, I guess. 00:36:34,116 --> 00:36:45,726 [Andrey] So how does that work in an org like Google? Because I think in a lot of orgs, they’re really obsessed with KPIs and output metrics. 00:36:45,726 --> 00:36:46,156 [Seb Krier] Mm-hmm. 00:36:46,156 --> 00:36:48,746 [Andrey] And what you’re describing sounds very- 00:36:48,746 --> 00:36:49,706 [Seth] Hot takes per meeting. [laughs] 00:36:49,706 --> 00:36:54,926 [Andrey] Yeah. Very much amorphous, very hard to measure. 00:36:56,066 --> 00:36:56,196 [Seb Krier] Yeah. 00:36:56,196 --> 00:37:00,606 [Andrey] Obviously, you have a lot of external visibility, but is that 00:37:02,786 --> 00:37:07,846 [Andrey] a problem? Or is that just it’s understood that that’s how this goes? Yeah. 00:37:07,846 --> 00:37:13,846 [Seb Krier] I think the external stuff is kind of almost just very separate from the kind of day-to-day work side of things. 00:37:14,986 --> 00:37:23,366 [Seb Krier] And yeah, internally, we do have KPIs or equivalents or whatever. I think they may be less numerical in nature. But you might still have some, develop a consistent position on 00:37:24,506 --> 00:37:30,819 [Seb Krier] X issue or something in the next two, three months.And that requires a lot of research work, coordinating. 00:37:30,819 --> 00:37:32,929 [Seth] Have 10 opinions. [laughs] 00:37:32,930 --> 00:37:38,100 [Seb Krier] No, ideally they just want one. I think 10 opinions, that’s the issue. There are a lot of opinions out there. You’ve got to find the good ones. 00:37:38,100 --> 00:37:39,530 [Seth] That’s the main problem with economists. 00:37:39,530 --> 00:37:42,350 [Seb Krier] But [laughs] yeah. Exactly. Who was that quote? 00:37:43,830 --> 00:37:44,290 [Seth] Truman. 00:37:44,290 --> 00:37:44,330 [Seb Krier] Yeah. 00:37:44,330 --> 00:37:46,210 [Seth] Truman begged for the one-handed economist. 00:37:46,270 --> 00:38:20,990 [Seb Krier] Yeah, exactly. But, so I think, yeah, I think internally it’s just a kind of analysis or something. Say you’re thinking about, oh, agents and legal liability. How do these things work? What does the existing legal environment say and prescribe? What happens if something goes wrong? What are relevant factors? There’s a lot of that kind of thing. And I guess particularly within the DeepMind side, because when we’re on the frontier side, we’re thinking about the next five years as opposed to what’s going on right now. But yeah, the other side stuff is really just kind of out of personal interest and just me writing stuff, and they seem fine with it so far. [chuckles] 00:38:20,990 --> 00:38:26,510 [Andrey] What about... So we’ll be at a conference together, the Post-AGI conference- 00:38:26,510 --> 00:38:26,830 [Seb Krier] Ooh 00:38:26,830 --> 00:38:28,370 [Andrey] ... at Lighthaven, Berkeley. 00:38:28,370 --> 00:38:30,110 [Seth] Ooh. Prestigious. 00:38:31,130 --> 00:38:32,990 [Andrey] I don’t know if it’s prestigious. 00:38:34,550 --> 00:38:34,629 [Seth] [laughs] 00:38:34,630 --> 00:38:45,730 [Andrey] But you’ve gone to a few of these conferences, like the Curve is another fairly well-known one. What’s your take on these? 00:38:45,730 --> 00:38:54,750 [Seb Krier] I think some are useful. The majority of conferences I go to, I don’t exactly find that life-transforming, I guess. 00:38:54,750 --> 00:38:57,610 [Andrey] [laughs] You’re going to the wrong conference. [laughs] 00:38:57,610 --> 00:39:09,290 [Seb Krier] I know. Can someone show me the... But I think, yeah, they obviously perform a social function to some degree, right? There’s a lot of meeting people, some networking or something, some kind of finding out new ideas. But 00:39:10,390 --> 00:39:20,310 [Seb Krier] my issue with conferences, very often they’re just very tame. They’re very risk-averse. They’re very the same ideas you’ve-- Already if you can read it online or something, it depends on the conference. But, 00:39:21,510 --> 00:39:24,190 [Seb Krier] although I have been to really good ones, too. There was this 00:39:25,570 --> 00:39:43,529 [Seb Krier] IMF conference with Econ Ty, with I think Anton Korinek and others had organized. And that was great because that was a nice one where you had both the technologists and a lot of economists and loads of presentations, and you got to learn lots of new things. But, in general, I don’t see a huge... Beyond maybe showing, again, some hot takes here and there. 00:39:45,370 --> 00:39:49,990 [Seb Krier] Yeah, some I assume are good conferences. [chuckles] 00:39:49,990 --> 00:40:00,670 [Seth] I’m just the exception, but you had a great joke on your Twitter the other day about this, which is, Caveman panelist one, “Fire is bad.” Caveman panelist two, “Fire is good.” 00:40:00,670 --> 00:40:00,770 [Seb Krier] Yeah. 00:40:00,770 --> 00:40:02,100 [Seth] Caveman panelist three, 00:40:03,450 --> 00:40:07,120 [Seth] “We need to balance the upsides and downsides of fire and use it wisely.” 00:40:07,120 --> 00:40:07,320 [Seb Krier] Absolutely. 00:40:07,320 --> 00:40:09,620 [Seth] Wild applause. [laughs] 00:40:09,620 --> 00:40:09,650 [Andrey] [laughs] 00:40:09,650 --> 00:40:14,850 [Seb Krier] Exactly. There’s a lot of that. That’s the energy that I’m getting very tired of because it’s- 00:40:14,850 --> 00:40:15,050 [Seth] [laughs] 00:40:15,050 --> 00:40:21,700 [Seb Krier] And I like playing the role of the wise centrist opinion, whatever. But it does get very- 00:40:21,700 --> 00:40:23,150 [Seth] You do get wild applause. 00:40:23,150 --> 00:40:24,470 [Seb Krier] Yeah. All the time. [chuckles] 00:40:26,490 --> 00:40:29,770 [Seb Krier] But yeah, I think there’s a lot of that. I wish there were more 00:40:30,810 --> 00:40:35,090 [Seb Krier] almost private Chatham House-y conferences, where you had people who highly disagreed with each other- 00:40:35,090 --> 00:40:35,210 [Andrey] Mm 00:40:35,210 --> 00:40:36,770 [Seb Krier] ... but were polite and didn’t get at 00:40:37,950 --> 00:40:49,370 [Seb Krier] each other’s throats. And you had more setups that actually allowed ideas to clash a bit more, in a civilized way, of course. But that would be a bit hard, but also much more interesting, I think, than 00:40:51,490 --> 00:40:55,390 [Seb Krier] everyone broadly agreeing that it’s good to be good and it’s bad to be bad, and yeah. [chuckles] 00:40:55,390 --> 00:41:03,710 [Andrey] I do feel like the Lighthaven conferences are quite good for this, in that there’s an enormous amount of free time and- 00:41:03,710 --> 00:41:04,130 [Seb Krier] Mm-hmm 00:41:04,130 --> 00:41:07,770 [Andrey] ... free space that’s not where the talk is happening. 00:41:07,770 --> 00:41:07,940 [Seb Krier] Yeah. 00:41:07,940 --> 00:41:10,630 [Andrey] And so you do get a lot of this. 00:41:10,630 --> 00:41:11,040 [Seb Krier] Well, yeah, I agree. 00:41:11,040 --> 00:41:21,090 [Andrey] But I agree that many conferences are not like that, where you’re just packed. You have a conference hall, and you don’t have anywhere else to go, and it’s packed with talks. Yeah. 00:41:21,090 --> 00:41:21,710 [Seb Krier] Yeah. No, totally. 00:41:21,710 --> 00:41:23,550 [Seth] NBER Summer Institute. [laughs] 00:41:24,750 --> 00:41:28,330 [Andrey] Seth, there is disagreement. Say what you will. At NBER- 00:41:28,330 --> 00:41:28,540 [Seth] There is fire 00:41:28,540 --> 00:41:29,430 [Andrey] ... people throw down. 00:41:30,450 --> 00:41:31,430 [Andrey] [laughs] 00:41:31,430 --> 00:41:37,720 [Seth] [laughs] I’ve never seen a meaner comment than I have seen from a discussant at NBER Summer Institute. [laughs] 00:41:37,720 --> 00:41:52,570 [Seb Krier] [laughs] The Progress Conference, for example, last year, was one that I thought was really good. That was at Lighthaven, in fact. I think the setup and the kind of people and the curation and so just made it something that I found quite engaging. [upbeat music] 00:41:52,570 --> 00:41:56,490 [Seth] So you brought up this idea, as we were talking, about you 00:41:58,330 --> 00:42:21,049 [Seth] think there are so many meetings in your organization because it’s so hard, yet so critical to transfer information. And there’s this Coasean idea that so much of why the economy works the way it does is just the idea of transaction costs, right? In addition to kind of this Hayekian idea of local information that’s hard to share. 00:42:21,050 --> 00:42:21,810 [Seb Krier] Mm-hmm. 00:42:21,810 --> 00:42:23,960 [Seth] You have a very influential essay 00:42:25,130 --> 00:42:30,230 [Seth] that kind of maybe stole some of Andrey’s thunder, but is still an excellent essay- 00:42:30,230 --> 00:42:31,040 [Seb Krier] [laughs] 00:42:31,040 --> 00:42:46,210 [Seth] ... about this idea of, well, what happens when AIs go out there and can micro-bargain costlessly with each other at high frequency over very, what might seem to us, small issues. 00:42:47,570 --> 00:42:57,440 [Seth] Tell us maybe in a few sentences, what’s that vision and what’s the positive vision for why that would be good for society, for us to have AI agents constantly bargaining for us over stuff? 00:42:59,130 --> 00:43:01,810 [Seb Krier] Yeah. I guess the idea is, as you mentioned, there’s all sorts of 00:43:03,990 --> 00:43:26,350 [Seb Krier] transaction costs that mean that we don’t get to bargain on things that we would otherwise bargain for. And instead, you get these blunt rules and these solutions that kind of work, but come with all sorts of externalities or aren’t super efficient. And so the idea is, if you can actually do this kind of negotiation at scale for very little, and that’s a big assumption. That’s not a given either, 00:43:27,850 --> 00:43:35,586 [Seb Krier] then you could solve all sorts of things thatAnd also just kind of problems that would otherwise not be even conceivable in the first place. 00:43:36,726 --> 00:43:41,186 [Seth] One example you give, just so we can be a little bit more specific, is noise standards, right? 00:43:41,186 --> 00:43:41,456 [Seb Krier] Right. 00:43:41,456 --> 00:43:57,226 [Seth] So you can’t throw a loud party after 10:00 PM in such and such a place. But you think that maybe AI agents could come to a less coarse rule that is, get us more to the grand coalition of allocative efficiency than a coarse rule like that. 00:43:57,226 --> 00:44:01,166 [Seb Krier] Yeah. To be fair, that’s probably a problem that no one really cares about except me because of like- [chuckles] 00:44:01,166 --> 00:44:02,086 [Seth] No. Dude. 00:44:02,086 --> 00:44:03,645 [Andrey] I care about it so much. 00:44:03,645 --> 00:44:04,626 [Seb Krier] Oh, really? Okay, cool. 00:44:04,626 --> 00:44:04,746 [Andrey] Yes. 00:44:04,746 --> 00:44:07,816 [Seb Krier] Maybe that’s a good example then. But yeah, the idea here is, 00:44:09,146 --> 00:44:17,006 [Seb Krier] my neighbor is throwing a party, and instead of there being some sort of rule that says you’re not allowed to throw parties after 11:00, he could maybe just compensate me for the noise or something. 00:44:18,326 --> 00:44:21,686 [Seb Krier] Or in fact, that’s one of the key crux of the whole Coasean thing is maybe 00:44:24,186 --> 00:44:36,085 [Seb Krier] I have to compensate him to stop his parties. And it kind of depends where the initial right is. But broadly, you could have these kind of, my whole neighborhood doesn’t want me to party, and they’re just giving me a small payment or the reverse, depending on where the initial allocation is. 00:44:37,226 --> 00:44:44,446 [Seb Krier] But I think you could have all sorts of micro ways in which these transaction costs at scale help you get much better beneficial outcomes. 00:44:45,486 --> 00:44:48,486 [Seb Krier] And so that would be the noise one would be like, okay. 00:44:50,406 --> 00:45:18,666 [Seb Krier] And it’ll probably just also let people kind of regroup into the party people just going into the neighborhood where that’s just generally more party tolerant or something, and the kind of peace and quiet preferring people just... Because I think one of the points with the piece was that AI also helps you coordinate better. You can use this stuff to find people who have the same interests and preferences as you or something, and just then bargain or negotiate or whatnot in that way as well. 00:45:20,626 --> 00:45:27,386 [Seth] So it’s not just bargaining over externalities that are negative, it’s maybe coordinating over positive externalities, right? 00:45:27,386 --> 00:45:27,526 [Seb Krier] Yeah. 00:45:28,766 --> 00:45:51,746 [Seth] What pieces do we need in the economy to make this a reality, and what time horizon are you thinking about? So obviously this is an idea that you could have a small version of, and then like the sci-fi, this is constantly, I’m allowed to speed in my car today because I really need to get to work because I’m late, and it’s bargaining with all the cars on the highway at ultra-high frequency. So what are the time horizons you have in mind, and what pieces do we need? 00:45:51,746 --> 00:46:21,786 [Seb Krier] Honestly, I haven’t even thought about the timelines really. [laughing] For me, this was mostly kind of an aspirational thing of like, well, it looks like we could unlock some cool things, and because there’s all these-- It’d be nice to have a positive vision of how things might pan out. It certainly doesn’t mean that everything has to be negotiated and bargained over. But I could see a large proportion of things, certainly in everyday life, like I could just tell my aunt, “You don’t have to worry about your parking issues anymore. It’s just sorted now,” whatever. The agents are taking care of that. And so it kind of depends on what scale you’re talking about. Certainly having democracy at scale and 00:46:23,626 --> 00:46:29,086 [Seb Krier] half automated and half made more efficient through these systems or something is something that I think is going to take a long time. 00:46:30,426 --> 00:46:47,986 [Seb

19 mei 20261 h 23 min

Avi Goldfarb on Prediction Machines, O-Ring Tasks, and How AI is Reshaping Economics

This week, we’re joined by Avi Goldfarb, one of the leading economists of artificial intelligence and co-author of Prediction Machines [https://www.google.com/search?sca_esv=bc87673d3ad1280f&rlz=1C1GCEA_enUS1209US1209&sxsrf=ANbL-n4AnrHPqrHiXM4Cb3oXCBXAennzbw:1777914708243&q=Prediction+Machines:+The+Simple+Economics+of+Artificial+Intelligence&stick=H4sIAAAAAAAAAONgFuLVT9c3NEwzqCw0q8wrU4Jw003S0pMLsnK1pLKTrfST8vOz9RNLSzLyi6xA7GKF_LycykWsLgFFqSmZySWZ-XkKvonJGZl5qcVWCiEZqQrBmbkFOakKrsn5efm5mclADWkKjkUlmWmZyZmJOQqeeSWpOTmZ6al5yakAebQ6E4MAAAA&sa=X&ved=2ahUKEwjFtIC1kKCUAxWiJkQIHRQiDEoQ9OUBegQIDRAD&biw=2183&bih=1080&dpr=1.75]. Avi has been thinking seriously about AI economics long before the ChatGPT shock, so we asked him what he thinks the earlier framework got right, what it missed, and how economists should update their beliefs now. The conversation starts with Avi’s seminal book, Prediction Machines, and the idea that AI is best understood as a drop in the cost of prediction, which is a complement to judgement. We ask what that book got right and what it got wrong. From there, we interrogate Avi on the murky boundary between prediction and judgment. We had investigated the idea that maybe judgment and prediction were not as separable as economists like to believe in our episode with Alex Imas [https://empiricrafting.substack.com/p/alex-imas-demand-collapse-bargaining]. We also ask whether, if AI gets better at predicting human judgment, whether judgment disappears, or do humans simply “move up the stack”? And what is taste exactly? Avi says that sometimes judgment becomes predictable, but humans still matter because goals, values, organizational politics, and “what matters” are often implicit, unstable, and hard to codify. Avi shoots down Seth’s galaxy-brain suggestion that correct ontology choice — i.e., deciding what sort of natural kind [https://en.wikipedia.org/wiki/Natural_kind] a thing is, or understanding when a problem is out of context [https://theculture.fandom.com/wiki/Outside_Context_Problem] — is a uniquely separate skill (taste?), calling it just another prediction error. But he does concede that deciding how much to prepare for ‘Black Swan’ events may be an enduring role for judgment. We then revisit the O-ring theory of production and what it means for automation. We had covered Kremer’s article in a recent episode (see here [https://empiricrafting.substack.com/p/weak-links-strong-predictions-kremers]) and asked Avi about his new paper, riffing on the idea at the worker level [https://www.nber.org/papers/w34639]. Avi says that if tasks inside jobs are complements rather than substitutes, then automating one task may make the remaining human tasks more valuable, not less. Avi explains why workers may reallocate attention toward the tasks machines cannot yet perform (shooting down Seth’s suggestion that this is actually difficult in most jobs). The discussion also covers whether AI will augment or replace workers, whether governments should try to steer AI toward human-complementing technologies, and why that distinction may be much harder to define in practice than it sounds. Avi agrees with Andrey and Seth’s pushback on “augmentation good, automation bad” framings (e.g. friend of the show Erik Brynjolfsson’s “Turing Trap [https://digitaleconomy.stanford.edu/news/the-turing-trap-the-promise-peril-of-human-like-artificial-intelligence/]”). Then we get into forecasts: how fast AI capabilities might advance by 2030, what that means for GDP growth by 2050, whether GDP is still the right thing to forecast, and why even very powerful AI may run into bottlenecks in the real economy. We use the paper Forecasting the Economic Effects of AI [http://Forecasting the Economic Effects of AI] to ground the discussion. We close with lightning-round topics including AI’s impact on centralization, privacy/de-anonymization, peer review, and whether academic journals still serve the function they once did. Papers, books, and ideas mentioned * Avi Goldfarb’s seminal book with Ajay Agrawal, and Joshua Gans — Prediction Machines [https://www.google.com/search?sca_esv=bc87673d3ad1280f&rlz=1C1GCEA_enUS1209US1209&sxsrf=ANbL-n4AnrHPqrHiXM4Cb3oXCBXAennzbw:1777914708243&q=Prediction+Machines:+The+Simple+Economics+of+Artificial+Intelligence&stick=H4sIAAAAAAAAAONgFuLVT9c3NEwzqCw0q8wrU4Jw003S0pMLsnK1pLKTrfST8vOz9RNLSzLyi6xA7GKF_LycykWsLgFFqSmZySWZ-XkKvonJGZl5qcVWCiEZqQrBmbkFOakKrsn5efm5mclADWkKjkUlmWmZyZmJOQqeeSWpOTmZ6al5yakAebQ6E4MAAAA&sa=X&ved=2ahUKEwjFtIC1kKCUAxWiJkQIHRQiDEoQ9OUBegQIDRAD&biw=2183&bih=1080&dpr=1.75#] * A black swan is the occurrence of a wildly unpredictable event, which Nassim Taleb argues, in his book by the same name [https://en.wikipedia.org/wiki/The_Black_Swan:_The_Impact_of_the_Highly_Improbable], is more common than we like to think * A New Riddle of Induction [https://en.wikipedia.org/wiki/New_riddle_of_induction] — by Nelson Goodman — is the source of Seth’s thought experiment about “bleen”, a color which is green until 2029 and blue after, and green * Michael Kremer — “The O-Ring Theory of Economic Development”, covered in this episode of the pod: * Daron Acemoglu and Pascual Restrepo’s task-based models of automation, especially “The Race Between Man and Machine [https://www.aeaweb.org/articles?id=10.1257/aer.20160696].” * Avi mentions David Autor and Ben Thompson on automation and skill scarcity when Seth comments that you may not be able to reallocate effort between tasks as a worker, including their paper “Expertise [https://www.nber.org/papers/w33941]” * Erik Brynjolfsson in the “Turing Trap [https://digitaleconomy.stanford.edu/news/the-turing-trap-the-promise-peril-of-human-like-artificial-intelligence/]” argues that automation technologies are less good than augmenting technology * Eric Topol’s book on AI in medicine — Deep Medicine [https://www.amazon.com/Deep-Medicine-Artificial-Intelligence-Healthcare/dp/1541644638] * John Markoff — Machines of Loving Grace [https://www.amazon.com/Machines-Loving-Grace-Common-Between/dp/0062266683] — The source of a title for an influential essay of the same name [https://www.darioamodei.com/essay/machines-of-loving-grace] by Dario of Anthropic. Both draw from an earlier poem about a Sci Fi utopia: https://allpoetry.com/All-Watched-Over-By-Machines-Of-Loving-Grace * Korinek and Stiglitz on AI, capital, and taxation; Lockwood and Korinek on optimal taxation and automation — We covered these topics at the end of our episode with Basil Halperin in the context of “Tax Policy at the End of History” around the 1:19:00 mark * We talk about de-anonymization, and Avi references this provocative paper [https://arxiv.org/abs/2409.15948] from Florian Ederer * Avi brings up Bob Gordon, and his argument, famously in the book The Rise and Fall of American Growth [https://www.amazon.com/Rise-Fall-American-Growth-Princeton/dp/0691147728], that the early 20th century was incredibly important for increases in US living standards, which digital technologies have not lived up to * Digital Hermits [https://www.nber.org/papers/w30920], by Jeanine Miklós-Thal, Avi Goldfarb, Avery M. Haviv & Catherine Tucker, is a paper by Avi thinking about how information spillovers, now from AI, drive some people to be more private than they would otherwise be. In our conversation, we speculate AI will make these hermits even more “hermetic” * We discuss this paper on new forecasts of AI and its impact on economic growth: Forecasting the Economic Effects of A [http://Forecasting the Economic Effects of AI]I * Refine and AI-assisted peer review are discussed in this pod. For more, see our episode with Ben Golub, founder of Refine [https://empiricrafting.substack.com/p/ben-golub-ai-referees-social-learning]. This episode is sponsored by Revelio Labs [https://www.reveliolabs.com/] — a great source of labor economics data for academics and firms. Now available on WRDS. Join our Discord community at this link: https://discord.gg/w3GSapx2d Transcript Introduction [00:00] Seth: Welcome to the Justified Posteriors podcast, the podcast that updates beliefs about the economics of AI and technology. I’m Seth Benzell, your loyal non-fiction machine, coming to you from Chapman University in sunny Southern California. Andrey: And I’m Andrey Fradkin, coming to you from San Francisco, California. And we are very happy that Justified Posteriors is sponsored by the fine folks at Revelio Labs. And we’re very delighted to have Avi Goldfarb, who is a leading thinker in the field of AI economics and has also been a personal mentor on the show. We’re very excited to hear his thoughts on a variety of topics. Welcome, Avi. Avi: Thanks so much and thanks for having me on the show and looking forward to it. Andrey: All right, let’s get started. I have in front of me this book that you might remember writing at some point. Seth: Gaze into the soul of the man in the bookstore. What Did Prediction Machines Get Wrong? [01:12] Andrey: Now, I just think it’s a good cover. And I had to check: when was it released? It was released in 2018. And as I was skimming through it, you know, a lot of interesting points made there are still things that we’re talking about today, almost 10 years after it was released. So let me start off with the following question. And then maybe we can work backwards more into the ideas in the book. But what do you think prediction machines got wrong? Avi: I think prediction may... I’ll start with a hard question. Seth: No softballs on Justified Posteriors. Avi: So on the specifics of which industries and when, to the extent we tried, at least I did not anticipate how quickly language and coding would become prediction problems. And when we talk about disruption and industry disruption, a lot of the examples are things like driving, and we talk about radiology. And we still have plenty of radiologists around. Self-driving cars and trucks. seem like they’re now imminent, but it certainly took a lot longer than we expected back in 2018. Andrey: So is it a fair assessment to say that the large language models, even in 2018, weren’t on your radar? I guess they weren’t on many people’s radar. The Three Ideas of Prediction Machines [02:45] Avi: Not really. We have some discussion of machine translation. So that’s in there as a huge potential use case, but the arrival of ChatGPT and how it sort of changed how we interact with machines and how we think about AI was not really there. Another way to put it is prediction machines had three ideas. So idea number one is AI can be framed as a drop in the cost of prediction. So prediction. As in filling in missing information, statistical prediction is getting better, faster and cheaper. Idea number two is that when something gets cheap, you start using it for unanticipated uses. So when arithmetic got cheap, it wasn’t just that we use computers for accounting. We started to use computers for all sorts of things that we never used to think of as arithmetic problems like imaging and mail and music. And then idea number three is what are the complements to machine prediction? And we talked about data and judgment. The book, and certainly our attention to the book in the first three or four years after it was published, was on idea number one and idea number three. So identify prediction problems in your organization, and then think about what data you need to make those predictions better, and try to understand what matters to you in terms of judgment. And that second point kind of got lost. But in the last four years, it’s become clear to me is that that second point was maybe the biggest one, which is this tool, which still under the hood is computational statistics, enables us to find all sorts of applications for computational stats that we didn’t really imagine before. Judgment and data are still gonna be useful, but that phase one, that step one, that first idea of identifying prediction problems, that’s not really how we think about using AI today. And in some sense, that... was a missing emphasis throughout the book and throughout how we thought about that book, or at least how I thought about that book for the first few years. Does Proprietary Data Still Matter? [04:59] Andrey: Very interesting. You mentioned one kind of underlying idea there, whereas you should identify the data that’s going to make your predictions better. Do you think to what extent is that now true, given that your foundation models seemingly can be very smart without having any proprietary data? Avi: Data is still central to the use of AI, the building of the models. In building a foundation model that, at least in the pre-training stage, that data is essentially interchangeable. You just need more. It doesn’t really matter what. To build a structure of language, and then you can move from there. On later stages of using that model, at least the AI companies seem to think data is valuable to the model companies. And then in terms of use cases within organizations, that’s more a matter of whether you want to delegate sort of the judgment of how to use the model and what the model should output to the vendor or whether it’s something that you need to build in-house. And depending on the organization, some of them are very happy to delegate to the foundation model provider and some of them think they need to fine tune in-house. Andrey: Well, so there are kind of two little sub ideas in there. One is you have choice. You can fine tune a worse model with your own data. And maybe that will outperform as a frontier model. I think for many cases so far, that’s been a bad bet. But there’s a different idea here. Use whatever model you want, but you design the evaluation. And then you optimize via the prompting strategy or scaffolding towards that. that benchmark for your own use case. Is designing a benchmark proprietary? Should we think of that as a proprietary data that an organization has? Seth: Is that the judgment part in the judgment prediction distinction? Vendor Choice as Delegated Judgment [07:01] Avi: Yeah, I think there’s a bunch of judgment. there’s judgment number one: which which vendor do you use? Because you’re delegating a lot of values as in like, knowing what matters to the maker of the model. And then there is judgment in how heavy-handed do you want to be to make the outputs fit your needs? And then there’s judgment on, okay, you’ve decided to be heavy-handed. What exactly does that mean? And is it, guardrails or is it really making sure that the output from the prompts every time fits your organization’s values or what matters to you? Andrey: Have you had an opportunity to kind of advise companies on this judgment decision? Like what has your experience been in these situations? Avi: At a high level, yes. I don’t want to exaggerate my experience, but the things I emphasize and the things that seem to resonate are, one, what I just said, which is recognizing when you choose a vendor, you are delegating your understanding of what matters to that vendor. And then two, that means before you start thinking about choosing a vendor, you need to know what matters to you. So think through, you know, before you go talk to somebody, you should know what your KPIs are and what outcomes you want to see. Because otherwise, once you talk to them, they’ll convince you that their outcomes are the ones you want to see. and so it’s this, I talked to, someone who is running an AI at a... Let’s call it a big healthcare organization. And his job used to be, like five years ago, his job was building tools. He’s like, my job isn’t building tools anymore. There are all sorts of vendors building AI tools for healthcare. Okay. And what my job is now is every week, 20 or more people come in and say, I have a solution for you. And he chooses one or two of them. Seth: Kind of seems like a good job for an AI. Avi: Well, maybe, maybe not. But he understands the individuals, the people, guess, in theory that could happen, but the individuals in his organization, what they’re willing to accept, what they don’t. Which decisions they like to have control over, which ones they’re comfortable delegating. For the ones they like to have control over, he has a sense of what might be negotiable and what might not be. He knows where the power structures are and what things might change. Therefore face resistance from people who have the power to resist. He knows those things that might not face resistance from people because the people don’t have power to resist, but they’re going to be really, really unhappy about it. It’s going to bad for the organization. And so there’s all these things that I guess in principle an AI could do, but we’re a long way away, I think, from that. Can Prediction Eat Judgment? [10:16] Seth: So let me let me just push down that line a little bit longer is the way to think about this sort of prediction and judgment distinction is is that like as the models get better the Prediction is like eating more and more of the stack right? You know we give the information about our organizational structure to the AI and then maybe it can make a couple more of these decisions for us And you could either imagine that asymptoting to, you know, in 20 years, AI does everything, or you could imagine there are higher and higher levels of judgment that humans keep on getting promoted to. Are one of those two ways the way that you think about it? Avi: Yes, Andrea Pratt has a note in our first Economics of AI volume that covers that exact idea. I think actually it’s a comment on our paper or the model behind the Prediction Machines book. it’s, well, in principle, with enough data, you can learn to predict judgment. And so you move up the stack. So absolutely. There are some limits to that. There’s limits on you may never get enough data. on that kind of judgment. Judgment can change over time. To the extent that ultimately you’re trying to predict your tastes, then they can change over time. And there’s some limits on causal inference and the impossibility of seeing the counterfactual, which creates a need for a model. Andrey: But humans have that problem too. Avi: Yeah, yeah, yeah, no, I agree. But in the need for a model. So then the question is, well, how come LLMs and some of these models seem to be pretty good at doing that? And in the process of prediction, I suspect -- though I don’t know rigorous work on this, so I’m being cautious -- Seth: That’s what this podcast is for. Avi: this is building some kind of model of the world that is embedded in the training data, like the language. Taste, Values, and Human Wants [12:16] Seth: So let’s go back to the one of the examples you gave, which is this idea of taste, right? Because I’ve had so many conversations with other economists about this idea that, well, taste will save us as a scientist, right? Because the AI won’t have taste. I have some ideas about what taste might mean, but can you be a little bit more precise about what you think taste means and why it’s something worth saving? Avi: So, okay, let’s operate under the assumption that whatever we want to call the machines, their goals are to help humans. Okay, not all humans. And we can debate about which humans, but like ultimately. Seth: Well, the Anthropic Constitution says, you know, safety first, the idealized anthropic researcher, then the guy that then then like virtue and then like the customer in some order like that. Avi: I’m gonna, all that matters for the point I’m about to make is that it’s not about the machine’s needs. So in that case, at the very limit, humans have wants and needs and those wants and needs, the machines need us, our judgment to know what our wants and needs are. Seth: So taste literally as in, this tastes good to me, I want more of this food. Avi: That would be one specific example of it. Absolutely. Okay. Now, I think we’re a long way from that limit, but that’s what I would argue the limit is. Seth: That’s the Bailey, right? So now let’s go out to the motte. Avi: So then it’s more like, okay, what matters to a set of humans, a group, an organization? What can we codify? If you can codify it and say, like, this is your goal, you’re not quite at that limit, but pretty close to it, then the machines can try to optimize on a goal. Goals have so much that are implicit. And so the machine would have to be able to infer the implicit part. Maybe it can, maybe it can’t, I don’t know. And then you can sort of ratchet back all the way to where we are now, which is you still need to tell your agent what you want. You still need to check on it every once in a while and guide it in the right direction. Prompting still has a role. Ontology, Umbrellas, and Context Shifts [14:45] Seth: Here’s another way of thinking about taste. And I’m curious whether you think this is in one of the categories you already listed or a new idea or you wouldn’t call this taste, which has to do something like with the idea of your ontology that is kind of built into the system, right? It’s your way of sort of dividing the world up into parts and maybe a good tastemaker or a good judger might have a more refined or more adaptable ontology. than the prediction machine. So I’ll give you an example of what I mean. have a couple of examples in mind, but one example I have is, you know, historically in the data, it’s always been the case that if lots of people show up with umbrellas, it means that you can predict that it’s raining. But then we have these Hong Kong protests and in the Hong Kong protests, they’re the umbrella protests and people bring umbrellas to show that they’re protesting, right? And it seems like a human would do better at adapting to like the completely new context for why you would need umbrellas than, you know, a pre-trained system that was only on historical data. So you can say that that’s like a context switch problem. Is that one of your ideas of taste or is that more of a judgment that’s not a taste? Avi: Honestly, that seems like a prediction failure to me. Seth: Right. That’s just we don’t have data on the context that we’ve moved to. The job is to understand when the context has changed, maybe. Avi: The judgment, I would say the judgment is like, what’s the consequential decision that’s going to be a function of, look outside and I see a lot of people in umbrellas. Yeah. What am going to do? And. Seth: You know, I should water my plants. Should I water my plants? Avi: No, I water my plants. Okay. So I look outside, a lot of people are carrying umbrellas and I think, no, I don’t need to water my plants. Okay. And then it turns out it’s a protest. It’s a little bit of weird context, but going with your example. Seth: It’s gotta be a weird context. That’s the reason that the AI is going to make the wrong decision because it’s out of context. Avi: the, the automated sprinkler doesn’t go on and, my plants die. Right. Okay. So, the judgment is, is it then worth it for me to invest more either in my prediction technology or to actually go outside and look and to see if there’s rain, to overcome that downside. So what you described as an error in prediction, there’s ways to reduce that error in prediction. The judgment is whether it’s worth the bother to reduce that error in prediction or to create some kind of insurance system where you would say, you know what, I’m gonna water the sprinklers. I’m just gonna run the sprinklers anyway. That’s how I think about judgment. It’s sort of what goes wrong when your prediction fails or it’s one important aspect of judgment. Seth: Sorry, can I give you an even more abstract? Andrey: Wait, wait, wait. No. I actually disagree with the premise of the example in many ways. I think a reasoning model would be able to handle the situation, especially with internet access, substantially better than many humans already, because you can call an API to get the weather forecast if you’re unsure. You can read the news. You can use reasoning traces. There’s this kind of implicit assumption in your question that like, we’re just using a raw pre-trained model and like asking it to like, if you, like, if you had a gun to your head, what would you do? You know, and not use any reasoning. Seth: Okay, but I can tell you a story, right? The weather API was always reliable in the data, but now there’s been a government takeover and I don’t trust the new government and you shouldn’t trust the API weather data anymore, right? Avi: So Andrey, I actually agree with, like, that seems unrealistic, but I think the idea is what you’re describing is how many resources you wanna put toward making it right, and I would view that as judgment. Andrey: But I guess the model has that judgment, maybe. Already. Already. Yeah, that’s kind of goes out like the stack of when judgment problems become prediction problems, I guess. Avi: But then there’s going to be... well, there’s going to be some places where the model is imperfect. Okay. Yes. Still a prediction tool. It might be better than human. Actually, it doesn’t matter if it’s better than human. But to the extent the model is imperfect, how do you want to behave? Like, let’s say the model is right 99.99 % of the time. Does your behavior change at that versus 99.9999 % of the time, even if the human benchmark is 50? And that ultimately is going to is going to be essential to judgment. We do this with self-driving cars. The models aren’t perfect, but they’re better than human. And yet, I still drove to work today, partly because that’s the law in Canada. Andrey: Do you think there’s hope? I mean, maybe this is kind of too much in the weeds versus the abstract idea, but sometimes people implicitly assume that they’re anchoring on the current technology where there’s an instance of an LMM that does something. But we might be able to design systems of LLMs that are interacting with each other to cover some of these. shortcomings that we can think of. I mean, at a conceptual level, maybe it’s the same thing anyway... Avi: So maybe another way to think through these trade-offs is to talk about whose judgment, okay? Which is Seth’s example was about, or my example was about my judgment, know, the individual’s judgment and should they listen or not. Andre, I think what you’re describing is the model builder’s judgment on which things is it worth investing in making the model better and when is it okay not? Like they have choices on sort of rate and direction. And those require some understanding of what they think is going to matter in terms of the use cases, the model. And on that, yes, there is a limit where a small number of players have extraordinary power because AI scales their judgment because they embedded into the models. But I do think. then there is still a human or set of humans responsible. It’s not like, the AI did it. It’s humans making those kinds of decisions. And I understand, like, at the limit, that actually gets quite nuanced, especially once we have models with continuous learning. But that’s how I think about that problem. Grue, Bleen, and Black Swans [21:41] Seth: All right Andre, can I ask my riddle of induction question? Andrey: Do you need me to induce it? Seth: You already know where I’m going with this. I’m curious if Avi knows where I’m going with this, but this goes back to the question of maybe where taste comes in is having a better or a more human ontology than the machine. All right. Have you ever heard of grue and bleen, Avi? These are colors that are different than blue and green. No? Okay, awesome. So briefly, we have this conceptual category, which is a thing that’s green. And a thing that’s green, we think that if you don’t do anything to it, it should be green indefinitely, right? Avi: Okay, yeah. Seth: All right. There’s this other thing that’s called bleen and things that are bleen are green until the year 2029. And after 2029, they turn blue. Right. Here’s the issue is that bleen and green things are observationally identical until 2029. Right. Yeah. So an inhuman, bad at forming natural kinds, ontology of an AI might decide that something is bleen instead of thinking it’s green. Right? And a human’s role might be to say, no, that’s a bad definition of a natural kind. That’s a bad ontology. And that would be a role of either taste or judgment. Do you buy that? Is this way too abstract? Avi: I think what you’re describing is a failure of prediction. I don’t think that’s taste or judgment. The taste or judgment is if you or a machine aren’t sure if something is bleen or green, do you care? Seth: Okay. Well here’s the thing, you didn’t even have the concept of bleen until I told you about bleen, right? Avi: So this is just the difference, I think, between known unknowns and unknown unknowns. So in Prediction Machines, we have a whole chapter framed on Rumsfeld and his discussion of known unknowns and unknown unknowns. Look, sometimes you don’t have a prior on it, and it’s an unknown unknown. That doesn’t mean that it’s not a prediction failure. It was just off the support of your data, and you didn’t know what to do about it. And I think that happens all the time. Seth: Sometimes you find a black swan. Avi: Yes, exactly. And so like, there might be places where humans are better at that kind of prediction than machines. There might be places where both humans and machines are really awful at that kind of prediction. And if that’s the case, then you want to have robust systems to anticipate those kinds of things. And that’s where judgment comes in. Like, if you’re wrong about the existence of a black swan, you know, does that change anybody’s behavior? I think the answer is no, because black swans and white swans aren’t actually that different from each other. But if there were other examples, like financial crises, where he uses the metaphor of the black swan, then absolutely there are meaningful differences. And you should Andrey: Financial crises. Seth: All right, so you’re saying that jobs that will survive TAI number 7 should be Black Swan, anticipator. Andrey: Not an anticipator. Actually Seth, this is actually kind of the key point. The point is, anticipator of whether Black Swan affects your utility enough that you should plan for it. O-Ring Complementarities and Automation [25:22] Andrey: I think next it will be awesome to talk about automation and some O-rings. Actually, the previous episode we did, we reread Michael Kremer’s classic O-ring paper because it’s been so inspirational for so many. It’s a great paper. They don’t write them like this anymore. Seth: It’s so fun to read. They don’t like to do macro like that anymore, unfortunately. Andrey: So we were wondering, so you have your own spin on the O-Ring paper. Maybe you’ll tell, you can tell us a little bit about that. Avi: Paper makes a pretty simple point. There may be two simple points. First one is that when you think about tasks within a job, they’re not interchangeable and substitutable. So it’s not just like, okay, a machine comes in and takes tasks. Sometimes tasks are complements. Now that isn’t, I’m gonna a little cautious. We talk about that in our O-Ring automation paper. It’s not necessarily a new idea. It’s implicit in the constant elasticity models. you can have a Leontief production function. Seth: We’re talking about the Daron-style task-based models. But if you actually read the papers everything immediately goes Cobb-Douglas. It’s always immediately weird. All the tasks are substitutes and then Cobb-Douglas over all the tasks. Avi: Yes, but it’s possible to, within the canonical model, to have that. So our point number one is tasks can be complements. And I just wanted to be cautious because I don’t want to claim that that’s necessarily our idea. But it’s an emphasis maybe that the existing literature hasn’t had. And then the second is, well, once you have tasks that are complements, if a machine starts doing some of those tasks, human can move their attention to the other tasks that are not yet automated. And when that happens, the human gets better at those tasks, which then makes automation of those remaining tasks even harder because the machine has to be better than now the human who’s spending all of their time focused on the remaining few tasks. Skills Versus Tasks [27:40] Seth: So let’s pause right there because I have a couple of questions right there immediately. So one way to think about automating part of your job is you’ve automated part of your job and now I can reallocate to the stuff that’s not automated. also another way to think about tasks within a job that are complementary is to think about them as sort of like innate skills or abilities. So think about the job of being a basketball player. The job of being a basketball player involves being tall and being agile. If you somehow automated being tall, I can’t reallocate my skill points into being agile, right? If we think about my performance as more as a combination of my skills, then automating part of it or taking part of it away, it’s not necessarily obvious to me that I can get better at the thing that’s not automated. Avi: The way we, okay, so first the way the literature usually thinks about jobs is generally at the task level, not the skill level. Okay. So a worker does a bunch of tasks. Okay. Those tasks require skills, but the worker does a bunch of tasks and the A machine comes along and can do the task and not the skill. So I’m not sure what it means for a machine to be tall. What it means for a machine to slam down. Seth: Well, let’s think about being a doctor. Let’s assume you might imagine being a doctor involves bedside manner and judgment about and diagnosis right it’s not clear to me that if you automate my diagnosis I can reallocate more effort into bedside manner some people are just level five at that and some people are level one at that AI Doctors and the Future of Medical Work [29:25] Avi: It is obvious to me that there’s a bunch of tasks in a doctor’s workflow. Some of them involve diagnosis. Some of them involve talking to patients and making the patients feel better. And within those, there are skills in being good at filling in the missing information of what’s wrong with the patient and skills of making the patient feel comfortable. And actually, for some of those tasks, you might even need both. A machine comes along and automates the diagnosis skills. Okay. That means medical professionals are going to be spending more time on the other skills. This is actually an Eric Topol’s deep medicine book. I’m not sure if you’ve read it. It’s, it’s like a pre-ChatGPT, but like how AI might transform medicine. And that is his core thesis. The idea is that AI is going to make healthcare human again, because doctors are going to spend less time looking at screens and focused on diagnosis and more time. interacting with patients and making patients feel better. So in that sense, we get the automation of the diagnosis task and some of the computer tasks that should exactly lead to reallocation toward the human part. But then you brought up something else, which is, do our current doctors, if they spend that much more time interacting with patients, are they the right people for this job? Or alternatively, could we have a different set of medical professionals who we could train because now the machine can do some of those tasks who would be way better than our current doctors at the remaining tasks? I suspect if the machines get good enough at diagnosis and identifying appropriate treatments, there is an enormous opportunity for a new kind of medical professional who is focused on essentially interacting with patients. Seth: Yeah, so you’re making the occupational reorganization point and that’s that’s obviously essential and we’re going come back to that in the second. Yeah, I just I’m just pointing out that maybe maybe my example of basketball wasn’t so good. Maybe my medical example wasn’t so good. But I bet you I could pick out some domains where the elasticity of task output to effort is very inelastic. Avi: Okay, trying to think. You’ve switched from skills to task and that makes me much, much happier. Seth: Well, I mean, you would only need to worry about skills is if you were inelastic to effort, right? Then it’s just the skill. Rare Skills, Common Skills, and Wages [32:04] Avi: So there’s the new Autor and Thompson paper on automation, which I think gets at some of the things you’re talking about, which is if the things the machine does are relatively rare skills, like are tasks that involve relatively rare skills, to be precise, then what happens is we get entry into that profession. More people can do it and very likely wages go down. And if the machine things that the machine does are things that many people can do, they require less specialized skill, then the remaining humans in that job will, there’ll be fewer of them and they’ll likely be higher paid. Seth: Right, think that’s right, but I think maybe a missing component here is within the job already, what is the correlation in abilities between people who are good at the automatable and non- automatable part of the task, right? Avi: Yeah, but I think that’s the statement about that. Like in the short run, we’ll get the Autor and Thompson results. And in the long run, we’ll get a reallocation of jobs, right? There’s a system of professions and the system of professions will change. Are Tasks More Complementary Than Cobb-Douglas? [33:23] Seth: In the long run, you get the reorganization of jobs. Maybe one other thing I want to talk about before we get into reorganization of jobs is just this question about, tasks more complimentary or less complimentary than Cobb Douglas? Do you have a sense of that with tasks within a job? I mean, it seems like would vary a lot, a lot from occupation to occupation. I think we all have this intuition that they should have some kind of complementarity. That’s why they’re a job in the first place. That’s why they’re bundled. But you might bundle them and they still might just be, you know, gross substitutes that have a little bit of complementarity. Avi: I suspect there’s a lot of heterogeneity across jobs and I don’t think we have good data on that yet because sometimes we haven’t been looking because our model is substitute model and so our papers are fundamentally focused on the substitute. Seth: And I think this is an example of somehow the theory is sometimes a little bit downstream of the data, right? We just have so little data on people reallocating effort across tasks within a job that of course it makes sense to aggregate up to just add up all of the tasks done by all of the workers. That’s kind of, that’s my guess of why Acemoglu gets there. Avi: So of the task papers, the Eloundou et al., Dan Rock’s paper, is incredibly careful on every page. Seth: This is not an automation measure. Do not use this to measure automation. Avi: This could be a complement, it could be a substitute. These are just jobs that change. So like kudos to them, the four of them for being super, super careful. Nevertheless, when that paper is cited both in the academic literature and in the press, that idea seems to get lost. I’m not exactly sure why, maybe that’s because of the model. Seth: Question people want to answer, right? The people don’t want to know what job’s going to change. People want to know what job should I get, right? And so... Avi: Well, okay, but if it’s a question people want to answer, then the complements matter just as much as the substitute. I wonder if the answer that people want to know, like the answer that people want, and then they just... Andrey: I actually think it’s I think take has always been that just most people are pretty, they’re very sophisticated users of this data, but a lot of people don’t have a sophisticated economics model. And therefore to them, it’s just obvious that what’s going to happen is the machines are going to take our jobs. As a result, that’s just, they don’t have a more nuanced model of economic activity and therefore that’s how they interpret it. Now there are more sophisticated readers, think, we know some of them, where they’re just really just think that AI is going to be able to do everything in a very short period of time and then it all kind of becomes moot. You know, if you think that every single task can be done by an AI. Why the Impact of AI Was Ambiguous in Earlier Work [36:15] Seth: Yeah. Well, I guess this kind of brings us to your 2019 Journal of Economics paper, which is about where you guys kind of where you kind of throw your hands up. That’s not that’s a positive part and say there’s an ambiguous impact. So I guess I want to push you there on is the ambiguous impact because. We just don’t know all of the relevant elasticities, right? We need to know the elasticity within tasks within a job. We need to know elasticity across jobs within an organization, the elasticity across sectors of demand. And if we could put all of those together, we would be able to answer the question. Or is it more ambiguous than even that? Avi: No, I think you need to understand when that paper was written in order to understand the paper, which is in 2019 or late 2018 when we were writing it, we had no concept of anything but a task- based model with substitutes. Okay, maybe that was on us. We should have. But Acemoglu and Otter and Rastrepo were the dominant- Paradigm. ... working in literature, especially Acemoglu. Seth: Are you saying our ontology was limited? Avi: I’m not exactly sure what you mean by that, but... Andrey: You forgot about the O-ring which was the black swan of papers. Avi: Yeah, yeah. So like, we did. Seth: I mean in Kremer, I mean, presumably you looked at Kremer again before writing your paper. You can almost see he’s almost there. He’s almost at, and this is within workers too. He doesn’t exactly say it. Avi: Exactly. So when we wrote that paper, we were thinking task-based substitution. That was the model that we had. And actually, in the process of writing that paper, in some sense, we learned what was wrong with that model and ended up with, we just don’t know. And part of that is, we wrote it in 2018, 2019. We were looking for new tasks from AI. So this is before ChatGPT, like four years before ChatGPT. So new tasks hadn’t really come up yet. All we had was identifying space junk and treatment for complex disease, which actually wasn’t our idea. It was Tim Taylor’s idea, our editor. Andrey: Well, you already had AlphaFold, right? Avi: Yeah, but it’s not clear what the new task is because of AlphaFold. Yeah, fair enough. In terms of... So, and actually that paper in some sense directly led to our work on system change and GPTs, because Tim Bresnahan pulled me aside that summer at the Summer Institute and told me he hated our GPT paper. I’ve told you guys this before. Because it was a task-based model and that’s not how meaningful change happens. That then led to all this work on trying to understand, well, if it’s not a task-based model, how does the system change? Andrey: Okay. And we’ve covered that to Bresnahan paper on this podcast. Reorganizing Jobs Around AI [39:22] Seth: I guess let’s talk about reorganization of tasks. Obviously that seems to be, that’s the best case answer. The best case answer is you split off the, I guess from the perspective of a firm trying to boost productivity, maybe not necessarily from a worker’s perspective. From the firm’s perspective, you want to slice off the automatable thing, let that rip, and then figure out what you have to leave behind for humans. Is there any good research about... How do you do that? What industries are better than that at others? Like, what’s the next research frontier on that question? Avi: I think you just defined it. there are two. One is like within the firm, how do we think about where the complements are and what’s left for humans and how does that vary across organizations? The second part, and Alex Emas has highlighted this recently, is it also depends on elasticity demand for the... Seth: products. Avi: Like, you know, even if within an organization workers reallocate and they become hard to automate because they’re more productive, but then the organization is producing more, well, someone has to want that more or else then, you know, at least that organization or its competitors are going to to business. Seth: Well it’s factor, well its price will come down, know there’s a kind of a nebulous connection between price and profitability. Avi: Right. Price goes down. It’s got to go down like, well, quantity has to go up enough that we still need the workers. Andrey: There might be a paradox in there that’s not really a paradox. The misnamed Jevons paradox. Avi: Maybe. Should We Want Less Automation? [41:05] Andrey: Following up on this idea, think several prominent economists have called for a government push or ideological push to make AI that complements humans rather than substitutes for humans. Seth: Friend of the show, Erik Brynjolfsson has written about the Turing Trap. Is the Turing Trap misnamed? Is it not a trap? Should we embrace the Turing? Avi: Okay, so this is our science paper. Seth: Let’s get the hot takes. This is where we brought you on. Avi: Do want more automation? Yeah, so Eric has said it. Doron has said it. There’s lots of policy. We should complement humans, not replace them. And John Markoff is a journalist. He has this book called Machines of Loving Grace, same title as Amodei’s essay, essay, but older book. It is about the history of computing. Seth: When you’re a tech billionaire, you’re allowed to use cool phrases unsighted. I’ve noted this. Augmenters, Automaters, and Inequality [42:10] Avi: Well, they’re both referencing a poem. And in Markov’s book, there’s these two streams of computer science. There’s the, I forget exactly how he labels them, but essentially there’s the augmenters and the automaters. And at least from my perspective, the augmenters seem like the heroes of his story. And the automators who start to become prominent as this book is getting written around 2014-2015 Seth: They’re trying to trap us. They’re trapping us. Avi: But we also know that the rise of computing the internet massively increased inequality. They generated enormous wealth, but they massively increased inequality. And I hypothesize that the reason for that is, yes, they were augmenting what humans do, but they weren’t augmenting what all humans do. They were augmenting what a set of humans who are good at abstract thinking do. And those people were already doing pretty well. And so in the process of augmenting humans, right, because no human can do what the internet does or what a computer can do, they augmented folks at the top and left others with relatively stagnant incomes. Seth: Is this story there really at the task level? The way I think about that inequality story is that it’s kind of at the firm level, right? It’s we’ve now put the corner store into competition with Amazon and so Amazon wins and whatever Amazon takes as input wins. Avi: There’s a bunch of different pieces. The one I’m emphasizing is like the Autor, Katz, and Kearney framework, which is about skills. Andrey: I mean, it has to be both, right? There’s a set, right? Like, the humans who are now able to market their unique skills match with the firms that are larger, but you kind of need both to create the inequality or some of the humans become superstars without like needing the firm in first place, right? Avi: I think in principle you could get within firm inequality without getting across firm inequality. We ended up getting both. Seth: Yeah, both. Both happened. Andrey: Fair enough. Avi: but as I’m thinking like Autor, Katz, and Kearney with computing and then Shane Greenstein, Chris Foreman and I have some work on sort of the internet inequality, same kind of idea. so on the other hand, automation technology, if it’s automating things that folks at the top do, could superpower everybody else. Okay. And this is a could, cause we hasn’t really happened. So what we hypothesize, so the question, the paper is called, Do We Want Less Automation? And our answer isn’t no. Our answer is, here are reasons why it’s not obvious. Okay? It’s very economist-like. And the essence of it is, we were just talking about this medical example. Well, if what doctors are paid for is 10 years of post-secondary schooling, that essentially is about prediction, diagnosis and treatment. Then someone potentially with two to four years of post-secondary schooling who was much better at managing patient stress and all these other things, training like a social worker, combined with a diagnosis machine could be super hard. And so their productivity goes up. And there’s a bunch of industries where What people at the top do seems a lot like filling in missing information. Are Intellectuals Giving Biased Advice About AI? [45:58] Seth: One might even cynically say that these thought leaders who have been so augmented by the internet are maybe not giving the populace the best advice. Avi: Maybe. So I had an undergrad RA write an essay for me. She’s a philosophy major. you know, a couple summers ago, it’s Amelia Agarwal. I feel like I should call her out. Seth: Love undergraduate research on the pod. Avi: Yeah, the opening of her essay was, part of her assignment was to read and hear about all these people who said AI is going to automate work. And so I’m going to have to have leisure, like essentially. And she’s like, that doesn’t strike me as bad. And then she dug into it and her framing was essentially the people whose identity was driven by their, you know, intellectual abilities, public intellectuals are exactly the people most threatened by AI. And so anyway. Andrey: You know, it’s very interesting. I actually disagree. Yeah, I think lots of intellectuals are threatened by AI but not public intellectuals and that’s because humans are going to want other humans to communicate to them in many ways. So, the role of the public intellectual is not going to go away. The role of the maybe the scientist toiling away on their research. That is in my opinion much more a threat. if you’re... one might even deduce that Seth and I have started this podcast as a hedge for that world. Seth: Well, what I say is as the price of writing papers goes down, the return to reading papers goes up. But maybe this goes back to the taste idea, right? Which is one way you might think of taste is a public intellectual doesn’t let’s let’s be cynical for a minute. The public intellectual, the public art critic doesn’t actually know art better than anybody else, but they serve a role as a coordination mechanism. Right. Everybody trusts Andrey. So when Andrey points at the thing and says it’s good, everybody converges to that. And then maybe that’s one notion of taste that will be preserved. Avi: Yes, and so you started in science and moved to art. There’s probably differences between them, but in the sciences, there’s a question, or a scholar’s, what’s our goal? What are we trying to accomplish? And I think different disciplines have different goals. And depending on the goal, the role of the human curator changes. If the goal is so that humans understand the world, and have sort of a consistent model, then there’s a real role for a curator. If the goal is to build a better spaceship, then maybe there’s not such a role for a curator. And so I haven’t been following that literature, so I don’t know really what the formal academic take on what I just described is. Can Policy Steer AI Toward Augmentation? [49:27] Andrey: Yeah, I agree. I haven’t seen much formalization. So listeners, if you know of any, send it along. Yeah, I mean, I sorry, I just want to make a final point is that I think I like your criticism of this augmentation idea. But to me, there’s like a much deeper criticism, which is there’s there’s just kind of a whiff of central planning involved in it. like, how how do you know? What technologies are going to automate versus augment. Like this is very hard to predict in my mind. And to think that the government is going to like somehow implement a system of taxes on technologies that are augmentation versus substitution, it’s ridiculous in my opinion. Avi: So I was taking as given that you can understand what is automation and what’s augmentation. I agree it’s a very hard challenge. There, I think the narrative, I’m gonna be careful. I think the argument is if even without choosing winners, we might be able to tax capital relative to labor or something like that. in order to push things in a particular direction. I think that’s it. Andrey: Yeah, that’s the most plausible. Seth: That’s pretty plausible, but when you actually hear versions of the Turing Trap articulated, it’s really like go and burn down the houses of the people who want to automate you. Avi: Okay. So Korinek and Stiglitz have a chapter that’s really about tax and capital that’s in our economics of AI book. And I think like the Acemoglu Johnson argument is really about tax and capital. I’m not enough of a macro economist to have a strong opinion about one way or the other, but that I agree seems more Seth: Right, and then there’s a deeper, deeper argument there about whether or not you want to tax capital, right? There’s the old Chamley-Judd result about, well, know, labor is inelastic and capital is elastic, so really you don’t want to tax it. There’s obviously international considerations about if you have a fully automated technology, isn’t that just going to locate itself in the lowest tax jurisdiction? And so it might be very hard to tax capital. And then of course the Iván Werning follow-up research kind of complicating the original Chamley-Judd results. So this gets in the weeds really fast. Andrey: And it’s also very blunt in many ways, right? A lot of capital is not about automation. it’s a... I don’t know. Avi: Yeah, and there’s all sorts of questions in public finance and how that all plays out to like the there’s under the names Trammell and Korinek. I think it’s Trammell. No, it’s not. Andrey: That’s Lockwood. Avi: Lockwood and Korinek, thank you. have a relevant paper there. AI Growth Scenarios Through 2030 [52:36] Andrey: Next topic. Yeah. So there was a very well-circulated survey of economists about their expectations of economic growth in different AI scenarios. Seth: Now Avi, I understand you have intentionally not read this so as to have an unbiased take, so you will not be contaminated by the opinions of everyone else. Is that right? Avi: That is absolutely right. Andrey: Excellent. You’re definitely not in the same university as many of the authors. Avi: I probably will, but we’ll see. Andrey: All right. So the first conceit is that there are three scenarios for AI progress that they want us to consider. The first one is slow progress, where by the end of 2030, the AI can do PhD student level assistance, half of eight hour long coding tasks, passable stories and songs. Robotics navigate homes with some help. So that’s kind of the slow. Moderate is you have semi-autonomous labs, five-day coding tasks, high-quality novels and hit songs. Robotics can perform basic tasks. And then rapid progress outperforms top humans in research coding and leadership, award-winning creative works, nearly all physical tasks. So those are the three scenarios by 2030. So the first question is, how do you allocate the probabilities between slow, moderate, and rapid by 2030? Avi: So, okay, so with the exception of the statement about hit songs and award-winning, those are all about the models and not about the outcomes. So I’m going to ignore the hit song and award-winning part because I think that’s... Andrey: It’s of the quality of the quality that could win it. Avi: Okay, because at a high level, what I think is the technology is going to accelerate rapidly, but there are all sorts of meaningful barriers to widespread diffusion and having an impact on the economy. and sometimes I think we’re already in the slow and for aspects of the medium versus the fast, I feel like I should call it 50-50 because I’m skeptical of the like, I’m skeptical of the robotics stuff, but the five day coding task seems very, likely. And so just. Andrey: Yeah, there’s some other things. CEO level agency, you know, like is is one of the criteria. Seth: I don’t know whether or not they can run a vending machine. Avi: But don’t like part of it. So much of what a CEO does is like is charisma and creating followers, right? And I’m not sure that’s a mission. Seth: Is it charisma judgment task? Is it charisma judgment? Avi: It’s a skill. I’m not sure it’s a prediction or judgment. It’s more like an action. Andrey: Yeah. But okay, fair enough. Just to give you like a sense of where economists came in and they took this in the fall, 39 % that were still in slow by 2030, 47 % that were in moderate and 14 % then were in rapid. So you are more bullish than a typical economist. Avi: I’m more bullish. I probably shouldn’t have said zero for slow. In retrospect, I was just going to be something five to 10 or something like that. GDP Growth by 2050 [56:22] Andrey: Okay, great. Now, and I think this is the question that really there was a lot of controversy about. So, the question was, by 2050, what is the annual change in GDP on average? Avi: GDP or GDP per capita. Andrey: This is GDP. Avi: I like I have to make a population assumption. somewhere between two and 3%. Andrey: All right. You are well within the economists’ answer here: 2.5%. Avi: duplicate. And so we’ll be a little above that. Andrey: So 0.5%, that’s all we get. okay. Extra from AI over and above. Avi: Well, no, I don’t think you want to say that because the reason we have 2 % is because of innovation in past. Andrey: Okay, so fair. I agree, I completely agree with you. Avi: Like it’s possible, especially with, you know, it’s possible we would have gotten zero. Seth: 5 % better than historical rate of technological growth. Avi: Yes, something like that. Andrey: Now, what if you were for sure, what if you for sure knew we were in the fast scenario by 2030? How would that like change your predictions? Seth: It’s hard to get to above three. Avi: Like, yeah, I just think there’s a lot of bottlenecks in the economy. I think that, and we’re going to figure out what they are. Seth: We’re gonna find out fast and that guy is gonna be rich. Avi: Yes. Andrey: So you’re once again, like a very down the median economist. Avi: On growth. Yeah, okay. Seth: Can I ask you, you think that’s mostly about bottlenecks? You don’t think that’s mostly about people taking leisure? Avi: I think it’s mostly about bottlenecks. What Are the Bottlenecks? [58:36] Seth: So gun to your head, what’s the biggest bottleneck in that high growth robots are awesome scenario. Avi: I feel like my best answer is we’ll find out. Andrey: Okay. I guess the pushback that folks gave is this is a scenario where by 2030 robots can do nearly all home and industrial tasks and faster than humans, right? So you might say, well, manufacturing and physical tasks are a tiny, not tiny, but they’re not that big of a portion of the GDP already. maybe- Avi: be essentially zero is the point. If they’re that efficient and that cheap, then they won’t mean like, I guess it depends on how we calculate the deflator. agriculture is way more productive. GDP hasn’t grown by that much. Andrey: But what if we have, you know, you know, robot doctors that can do, you know, like, Avi: Great, then medicine will be cheap. It’ll be less of GDP. Andrey: I guess, all right, so here’s a hypothetical. Here’s a hypothetical. Let’s say we had a cure for cancer as a result of this, which is very plausible in the rapid scenario, and that we also, at least in principle, have the technologies to administer it through robots very efficiently because we are in a world of just true abundance. My sense is that people would value that medical care extremely highly. And if one were to properly deflate the existing cost of cancer treatment, wouldn’t that imply a very large GDP effect? Now you can say maybe we’re not going to calculate that correctly. GDP, Consumer Surplus, and Health Breakthroughs [1:00:25] Avi: Now I feel like I’m going to, you know, it’s sort of the Bob Gordon sense. I don’t think we deflated antibiotics properly. I don’t think we deflated flush toilets properly. So if you’re talking about consumer surplus, then maybe consumer surplus will be found, especially, you know, to the extent that it’s health outcomes, then huge increase in consumer surplus, much more than the argument that we’ve had for digital. Because the that debate on whether digital really made us better compared to what was happening in the 20th century, I reasonable people can be on both sides of that debate. what you’re describing, is can’t secure people living wonderfully and healthy to 100, there might be some limits to how long, but that would be wonderful and great for consumer surplus. But if that happens, I guess it might and it’s that easy, it might become so cheap that it’s it’s like agriculture. Because food is pretty essential too. And food is so cheap that we don’t worry about it so much anymore. Seth: Inelastically demanded. think people will elastically demand years of life in a way that they won’t elastically demand calories, right? Avi: Potentially. Seth: You think people will get sick of it. I thought you were to go to maybe you’ll recall in Doron’s simple macro economics of AI, a favorite paper of this podcast. He actually predicts that actually consumer surplus might raise by less than is implied by the GDP growth rate, because we’ll invent evil jobs like social media manipulator. Do you are you still convinced that consumer surplus growth will be faster than GDP growth evolves? Or are you open to this idea of the invention of evil tasks? Avi: I feel like we are not in my expertise. Seth: Turn it up. Andrey: Seth is really trying to get the hot takes. Avi: I don’t like to judge what particular products, a particular. Seth: Well, you can’t judge, you can’t predict. Avi: Yeah, you know, what am I in a- Andrey: Then you become a economist. Avi: Actually, let me give... So I think it’s reasonable for people to say some roles, some jobs, some products are better than others. I don’t think that has a meaningful role in GDP calculation. And I also worry if in our consumer surplus calculations, we economists say some things are better and some things are worse because then... So much of it is just obviously to the taste of the... Seth: It’s such a normative can of worms, right? GDP we can measure, consumer surplus. I mean, we do things at the Stanford Digital Economy Lab around trying to do willingness to accept experiments, but obviously those are highly limited too. Avi: So consumer surplus as in figuring out the area under the demand curve, that’s the kind of task I think we’re good at. It’s within our domain. whether the demand curve is morally right or wrong, that’s not something I’m going to be finding out this day. Andrey: I wanted to just like close off that loop a little bit by just saying that you just gave me an answer that said that for our evaluation of how good of a world we’re gonna get in 2050, GDP is no longer the correct sufficient statistic, which obviously makes me question like why is this such a bench? Why are people so interested in forecasting GDP in 2050 if we think it’s going to get pretty uncoupled with consumer surplus in these scenarios? Avi: Well, I’m not sure it’s more or less uncoupled than it has been in the past. I think reasonable people can disagree on that. I think the debate between Bob Gordon and Erik Brynjolfsson or Bob Gordon and others over the years is sort of is really informative about how hard it is to say, you know, what’s better versus today versus the past. What happened in the early 20th century is pretty amazing. okay, that’s point one. Point two is it’s not obvious to me that GDP like GDP tells you your national capacity. That’s what it tells you. Seth: That’s useful for things like wars and public finance. Avi: If I remember my first year econ, haven’t taught first year econ for a long time. That was the idea. What’s the industrial capacity of the country? Or what’s the economic capacity of the country? It turns out it’s highly correlated, as I understand it, with lots of welfare measures. You guys know this. And so we use it for that. Once you start deviating, then... then that’s fine, but you’re now embedding a whole other set of values. At least with GDP, we know what the values are. It’s not it’s not value laden, but we at least know what the values are that we’re embedding in that measure. Andrey: But guess I’m not sure we know, just in many conversations with economists, this question of deflators has come up and most of us haven’t spent much time thinking about what actually goes into that and how well that’s don

4 mei 20261 h 20 min

Litigating the Pope's AI Encyclical with the Lawyers of Scaling Laws Pod

Beschrijving

Reacties

Probeer 14 dagen gratis

Alle afleveringen