Seb Krier on AGI, the Coasean Singularity, and EDM
Seb Krier on AGI, Scaffolding, and Coasean Bargaining at Scale
In this episode of Justified Posteriors, we welcome Seb Krier [https://x.com/sebkrier] — policy lead for AGI at Google DeepMind and excellent Twitter poster. Speaking in his personal capacity, Seb walks us through his understanding of AGI, why AI alignment has gone better than expected, the potential and limitations of a world where agents constantly barter on our behalf, and — of course — electronic music.
We also cover AI in London vs. New York, how Seb went from reading Marginal Revolution for 15 years to becoming a recurring character on it, and Seb’s side-splitting humor on mediocre AI conferences.
Related Links
* Seb Krier on X: @sebkrier [https://x.com/sebkrier]
* Seb’s Substack, Technologik [https://technologik.substack.com/]
* “Coasean Bargaining at Scale” [https://blog.cosmos-institute.org/p/coasean-bargaining-at-scale] — Seb’s essay at the Cosmos Institute (also republished here [https://www.aipolicyperspectives.com/p/coasean-bargaining-at-scale])
* “Musings on Recursive Self-Improvement” [https://technologik.substack.com/p/musings-on-recursive-self-improvement] — Seb’s essay separating model-side RSI from societal-side
* “The Cyborg Era: What AI Means for Jobs” [https://aleximas.substack.com/p/the-cyborg-era-what-ai-means-for] — Seb’s guest essay on Alex Imas’s Substack, defending the scaffolding view
* Anthropic’s Project Deal [https://www.anthropic.com/features/project-deal] — the agent-bargaining experiment among Anthropic employees
* Fradkin & Krishnan, “MarketBench” [https://andreyfradkin.com/assets/marketbench.pdf] — Andrey and Rohit experiment of LLMs bidding in procurement auctions as an investigation of the future of AI marketplaces and the companion writeup: Rohit Krishnan, “Agent, Know Thyself! (and bid accordingly)” [https://www.strangeloopcanon.com/p/agent-know-thyself-and-bid-accordingly]
* Edge Esmeralda [https://www.edgeesmeralda.com/] — Devon Zuegel’s pop-up village in Healdsburg, CA
* MATS [https://www.matsprogram.org/] — for junior economists looking to skill up on AI safety/governance
* Cosmos Institute [https://cosmos-institute.org/] and FIRE [https://www.thefire.org/]
* bianjie.systems [https://bianjie.systems/] — the art platform Seb is co-organizing a dinner with in NY (Seb’s announcement [https://x.com/sebkrier/status/2054941198406602861])
* Drexciya [https://en.wikipedia.org/wiki/Drexciya] — James Stinson, Gerald Donald, and the Detroit electro-afrofuturism canon
Timestamps
(00:00) Intro (01:16) What is AGI? (07:30) In defense of scaffolding — Hayek, division of labor, and why one giant model won’t do it (13:00) Markets for cognition: will agents bid in procurement auctions? (18:40) Recursive self-improvement — separating the model side from the societal side (24:44) Alignment has gone better than 2017-Seb expected; prefer “intent following” (31:14) What economists should actually work on to inform AI labs(33:32) What does a DeepMind policy lead’s day look like? (38:20) AI Conferences(41:52) Coasean bargaining at scale — the positive vision(55:00) Inequality, property rights, and who gets the initial allocation (01:03:00) The Helldivers 2 “Managed Democracy” dystopia as Coasean bargaining gone wrong (01:09:00) Sponsor: Revelio Labs (01:09:30) Lightning round
Justified Posteriors is a reader-supported publication. To receive new posts and support our work, consider becoming a free or paid subscriber. You’re also invited to our discord community at: https://discord.gg/b8VpPbBUt
Transcript
00:00:00,100 --> 00:00:20,480 [Seth]
[upbeat music] Welcome to the Justified Posterior’s podcast, the podcast that updates beliefs about the economics of AI and technology. I’m Seth Benzell, the number two biggest fan, after Tyler Cowen, in the Seb Krier fan club.
00:00:20,480 --> 00:00:20,740 [Andrey]
[laughs]
00:00:20,740 --> 00:00:24,660 [Seth]
Coming to you from Chapman University in sunny southern California.
00:00:24,660 --> 00:00:34,120 [Andrey]
And I’m Andrey Fradkin, coming to you from San Francisco, California. And Justified Posterior’s is sponsored by the fine folks at Revelio Labs.
00:00:35,560 --> 00:00:45,600 [Andrey]
We’re very excited to have Seb Krier here with us today. He is the policy lead for AGI at Google DeepMind, and is,
00:00:46,840 --> 00:00:52,400 [Andrey]
dare I say, a thought leader in this space. Welcome to the show, Seb.
00:00:52,400 --> 00:00:54,200 [Seb Krier]
Thank you very much. It’s great to be here.
00:00:55,380 --> 00:00:58,160 [Seb Krier]
Yeah, I’m Seb, calling in from New York.
00:00:58,160 --> 00:01:00,320 [Andrey]
And we should remind our listeners that
00:01:01,340 --> 00:01:08,410 [Andrey]
Seb is, during this podcast, expressing his personal opinions, and is not speaking on behalf of DeepMind. All right.
00:01:08,410 --> 00:01:09,740 [Seb Krier]
Indeed. [laughs]
00:01:09,740 --> 00:01:11,060 [Andrey]
[laughs]
00:01:12,780 --> 00:01:13,900 [Andrey]
The usual caveat.
00:01:15,260 --> 00:01:16,760 [Andrey]
Seb, what is AGI?
00:01:18,080 --> 00:01:19,450 [Seb Krier]
What is AGI? [laughs]
00:01:19,450 --> 00:01:19,570 [Andrey]
[laughs]
00:01:19,570 --> 00:01:19,580 [Seth]
[laughs]
00:01:19,580 --> 00:01:19,780 [Seb Krier]
Great question.
00:01:19,780 --> 00:01:21,900 [Andrey]
We’re going to start with the big questions.
00:01:21,900 --> 00:01:22,880 [Seb Krier]
Yeah, might as well.
00:01:24,259 --> 00:01:54,840 [Seb Krier]
[sighs] I think there’s so many definitions out there of what AGI is, and I think most of them are kind of unsatisfactory in one way or another. I’ve seen stuff like many definitions are indexed on the societal transformations or economic impacts of the technology, which I don’t really like very much because it makes it very dependent on external factors whether or not we have AGI. If it’s banned, we don’t have AGI, and if it’s not banned, we have AGI. Is it?
00:01:54,840 --> 00:01:55,480 [Andrey]
[laughs]
00:01:55,480 --> 00:02:04,670 [Seb Krier]
And there are other tests, like if an AI makes $1 million or something, which I find is very weird because most humans do not make $1 million in the first place.
00:02:04,670 --> 00:02:05,080 [Andrey]
[laughs]
00:02:05,080 --> 00:02:11,359 [Seb Krier]
So the one I kind of like is actually Shane Legg’s definition-
00:02:11,360 --> 00:02:11,620 [Andrey]
Mm
00:02:11,620 --> 00:02:12,420 [Seb Krier]
... who’s at Deep Mind, who is
00:02:13,640 --> 00:02:16,980 [Seb Krier]
more of a capability-based definition, which is something along the lines of
00:02:18,420 --> 00:02:20,960 [Seb Krier]
an AI or a system that does most
00:02:22,380 --> 00:02:30,360 [Seb Krier]
standard cognitive tasks that people typically do. [lips smack] So it’s kind of the bar isn’t too low, and it’s also not too high either.
00:02:32,220 --> 00:02:35,480 [Seb Krier]
And so I think he’s got this definition of a minimal AGI,
00:02:36,580 --> 00:02:43,020 [Seb Krier]
and I think that we’re not exactly there yet. I would disagree with people saying that we have AGI today because I think
00:02:44,220 --> 00:02:48,900 [Seb Krier]
a lot of the systems we have, there’s many things that a human can do that they don’t really do very well.
00:02:48,900 --> 00:02:50,360 [Seth]
What’s the biggest gap that we’re missing?
00:02:52,020 --> 00:03:47,740 [Seb Krier]
I’d say there’s a few. One of them might be continual learning, or at least the ability to adapt and learn over time, and in different contexts and situations, just kind of update your own world model or whatever. If I think of a new joiner in a company, they’re not super useful the first day, but their value goes up over time because they learn all sorts of things. And so [lips smack] that might be one of them. A lot of the systems we have today, I think, are not very good at software, and you’re using graphical user interfaces and software and whatnot. If I ask an agent right now to go and use a music production software and make a track, I think they’d generally struggle. That doesn’t mean it’s impossible to solve or anything like that, but I think, in many respects, they’re not as general as you’d want them to be. And then the other bit also is, [lips smack] and of course they still make some silly mistakes here and there, but I think that’s getting it fixed. But the creativity point is one that I’m really interested in as well, in that I think they’re really good at kind of
00:03:48,780 --> 00:04:02,700 [Seb Krier]
exploiting maybe an existing paradigm or an existing knowledge and so on, and recombining knowledge and whatnot. But I think really coming up with new concepts and abstractions entirely is something I think humans can do, but I don’t see our current systems really doing either.
00:04:02,700 --> 00:04:10,060 [Andrey]
How do you measure whether humans can do creative tasks? One of the things that
00:04:11,200 --> 00:04:15,940 [Andrey]
strikes me as a bit of an unfair test in that,
00:04:17,060 --> 00:04:23,290 [Andrey]
let’s say you ask an LLM to write a poem or to write a story. It’s very-
00:04:23,290 --> 00:04:23,290 [Seth]
[laughs]
00:04:23,290 --> 00:04:32,050 [Andrey]
... times more entertaining than what a random human would write. So, do you have a benchmark for creativity?
00:04:32,050 --> 00:04:35,390 [Seth]
This is the meme where the robot asks Will Smith if he can compose an opera.
00:04:35,390 --> 00:05:14,700 [Seb Krier]
[laughs] Can you? Yeah, exactly. It depends, and you’re right. Obviously, most people aren’t creating new abstraction and concepts on a day-to-day level. But I imagine there’s still something qualitative about that kind of creativity that I think does get applied in everyone’s day-to-day life in various kind of ways. Maybe they’re not as big or significant as creating a symphony. But I don’t really have a strong test. There’s actually an interesting podcast that had Ben Goertzel and Yoshua, I think a few years ago, where they were saying something like, if you had a model that was trained knowing only classical music and West African drumming, could it come up with jazz in the first place, or recreate jazz?
00:05:16,460 --> 00:05:27,880 [Seb Krier]
And I quite like that test. And in principle, I can imagine it being possible. You could kind of decompose all sorts of different kind of elements and variables here and just get something jazz-like. But it still feels a bit...
00:05:29,580 --> 00:05:40,580 [Seb Krier]
It’s not the same as just coming up with the idea of jazz in the first place and saying, oh, I’m going to try these things out. And for whatever reason, I’m going to stick to that. And I don’t know. It’s-
00:05:40,580 --> 00:05:53,190 [Seth]
Recombination versus paradigm shifting. I’ve also heard one test people would want for AGI is, can you train the model on the 1900s corpus and it comes up with Einsteinian physics?
00:05:53,190 --> 00:05:53,200 [Seb Krier]
Yeah.
00:05:53,200 --> 00:05:54,720 [Seth]
That would be really impressive.
00:05:54,720 --> 00:06:36,151 [Seb Krier]
Yeah, I think actually Demis uses that test sometimes, or I think Pele Gritzer as well mentioned it before. And there are some people, I think David Duvenour and Nick Levine, I think, had this recent kind of language model talky that was trained up in, I think, the 1930s or something. And I tried to play around with it a lot. It was like, let’s try to get it to create something new, and it’s pretty tricky. Although they have apparently recently, some people kind of fine-tuned it on a very few examples of coding and gotten it to be good at coding. But for some reason, that doesn’t impress me maybe as much as other things I would’ve expected. It’s like [laughs] there’s the-I agree that the goalposts also kind of move a little bit over time, and it’s also maybe unfair of me. It’s like, oh, well, can it create a new programming language from scratch or something?
00:06:37,272 --> 00:06:43,052 [Seb Krier]
So it’s a tricky one to kind of square off, but it does still feel like there’s a lack of that kind of true creativity, at least in my
00:06:44,212 --> 00:06:45,072 [Seb Krier]
interactions with them.
00:06:46,392 --> 00:06:57,342 [Andrey]
I am really worried that it is a goalpost moving exercise here. We don’t have a benchmark for creativity and therefore,
00:06:58,432 --> 00:07:03,211 [Andrey]
all these claims are not quantitative in a way that I’d like. And let-
00:07:03,212 --> 00:07:10,612 [Seth]
Right. What about all those IS papers we see where one of the axes is creativity and we instrument for something? [laughs]
00:07:10,612 --> 00:07:11,032 [Andrey]
Yes.
00:07:13,132 --> 00:07:13,592 [Seth]
There’s a lot of bad measures of creativity.
00:07:13,592 --> 00:07:19,762 [Andrey]
Those are not creative, to be clear. I’m sure I’ve offended a ton of people. Sorry.
00:07:19,762 --> 00:07:20,992 [Seth]
It’s okay.
00:07:20,992 --> 00:07:56,432 [Seb Krier]
I think it’s fair. I agree that it’s a bit like... But I still feel like there’s, at least if part of the reason you’re going to create these systems is to come up with kind of also new sorts of theories and so on. And I think you can probably get that through good search and a lot of inference compute and trying out lots of different things. And I think there are many low-hanging fruits there, to be clear. So it’s not like I think, oh, we’ve hit some sort of wall or something. And I think there’s a lot that you can kind of get in terms of new knowledge and new creative knowledge from that. But I feel like there’s maybe something more needed. It’s maybe not that kind of magical or anything, right? Maybe you just need better scaffolding or better multi-agent systems. But
00:07:58,992 --> 00:08:02,072 [Seb Krier]
yeah, at least so far, I would say that I see a bit more creativity, say, in
00:08:03,652 --> 00:08:11,612 [Seb Krier]
humans so far as a collective. And maybe that’s, again, an unfair comparison. You don’t have a culture of AIs and AGIs to compare that against. So-
00:08:11,612 --> 00:08:11,682 [Andrey]
Yeah
00:08:11,682 --> 00:08:15,092 [Seb Krier]
... the right comparison is also a hard one to do.
00:08:15,092 --> 00:08:52,772 [Andrey]
So, you mentioned scaffolding, and I guess a question, you recently wrote about a defense of scaffolding, and I think just to frame things, some people you talk with, especially very AGI-pilled people, are like, “Scaffolding, it’s an epiphenomenon. It doesn’t matter. In the end, we are going to train a smarter model with more parameters and more training data, and it’s just going to do it out of the box. And so all these scaffolding hacks are just very temporary.” And then other people like yourself, I guess, argue the opposite. So what do you think about scaffolding?
00:08:54,832 --> 00:08:55,052 [Seb Krier]
Yeah.
00:08:56,572 --> 00:08:59,372 [Seb Krier]
The first thing is I’m definitely not sure. This is kind of
00:09:00,532 --> 00:09:39,672 [Seb Krier]
one of many hot takes, but I think, I guess there are a few reasons why I see it as, I think it’s going to stay over time. The first is that I think it’s plausible that as, I think scaling laws continue, I think you scale models and they get better over time and so on, but I think the inputs are expensive and grow over time. And I also think that it’s plausible that you might get more and more diminishing returns over time. And if that’s the case, I see the kind of utility of the scaffolding side and the harnesses as going up because you’re going to want to make more, you’ll want more bang for your buck kind of thing. You’re going to want to extract this intelligence and use this resource as efficiently as possible.
00:09:40,772 --> 00:09:51,532 [Seb Krier]
So that’s maybe one reason. The other one is a bit more, I guess, Hayekian in nature or something, in that I see a lot of, I think there’s a lot of local knowledge, a lot of
00:09:53,212 --> 00:10:18,592 [Seb Krier]
stuff that isn’t necessarily kind of codified. And I don’t really see one big giant AGI model now kind of perfectly guessing everything forever at infinite scales. And in a way, I see this as a little bit like a division of labor in that I think it’s actually more efficient to have this kind of integration layer that is closer to the local information or to the ground or to demand side that can better integrate this kind of cognitive resource
00:10:19,812 --> 00:10:23,632 [Seb Krier]
to satisfy and create value and satisfy whatever consumers and businesses want.
00:10:25,552 --> 00:10:31,352 [Seb Krier]
So to help with all the sorts of constraints and the context they’re dealing with, I think it’s very useful to have that.
00:10:33,712 --> 00:10:39,112 [Seb Krier]
Of course, I don’t think this necessarily also implies or means that you’re going to get complete, full decentralization or something.
00:10:40,772 --> 00:10:42,212 [Seb Krier]
Walmart gets huge
00:10:43,872 --> 00:10:48,872 [Seb Krier]
returns from the scale that they have, and you don’t have loads of businesses downstream kind of reselling their stuff.
00:10:51,252 --> 00:10:53,932 [Seb Krier]
But there’s two things. The first is that-
00:10:53,932 --> 00:10:56,812 [Seth]
We have bodegas reselling stuff from Walmart on the corner.
00:10:56,812 --> 00:11:18,992 [Seb Krier]
Actually, that’s a good point, yeah. And also, there are all sorts of other businesses kind of selling different things, right? If the task is generic and the demand is homogenous, then sure, maybe you can do more of that. But also, even Walmart relies on all sorts of kind of suppliers, local labor, compliance system, inventory systems, third parties, and whatnot, that help with this kind of integration and the delivery of these services.
00:11:18,992 --> 00:11:25,862 [Seth]
So if I may summarize your answer, you’re very Hayek-pilled, but maybe not as Bitterlesson-pilled as most.
00:11:25,862 --> 00:11:25,972 [Seb Krier]
Well,
00:11:27,212 --> 00:11:31,052 [Seb Krier]
I think I’m definitely Bitterlesson-pilled in the sense that I don’t think you should
00:11:33,652 --> 00:11:48,992 [Seb Krier]
try to kind of cement some sort of rules-based system you either devise or something and kind of hope that this just takes forever. If anything, I think the scaffold needs to be a lot more adaptive and evolve over time. In the same way as if you have a small startup and they have all sorts of kind of rules and,
00:11:50,332 --> 00:12:02,772 [Seb Krier]
sorry, not rules, different functions. When the startup grows and gets more capabilities, they also kind of change from the inside. So I think that, of course, if you have some sort of light GPT-type wrapper that kind of makes your system a little bit better, whatever, yeah, that was not going to
00:12:03,812 --> 00:12:23,652 [Seb Krier]
work out over time. But I think there are kind of scaffolds that help better integrate the wider environment, private data, deals with permissions or liability regimes or user preferences and whatnot. And also, at a somewhat higher level, kind of more coordination-type scaffolds maybe in terms of market interfaces, like clearing house equivalents or something.
00:12:24,516 --> 00:12:33,536 [Seth]
The third example you gave is maybe it’s not the super frontier model that are going to these scaffolds, but simpler models that are still very useful and cheaper to run with a scaffold.
00:12:33,536 --> 00:12:46,176 [Seb Krier]
Yeah, totally. Because I think you’re not going to need the enormous, super expensive brain for every single random task. And so it’ll make, for most kind of basic queries, people aren’t using Opus’s latent space or something as-
00:12:46,176 --> 00:12:46,186 [Seth]
[laughing]
00:12:46,186 --> 00:12:48,236 [Seb Krier]
... it’s a big waste in some sense.
00:12:48,236 --> 00:12:50,036 [Seth]
What toothbrush should I buy? [chuckles]
00:12:50,036 --> 00:12:51,196 [Seb Krier]
Yeah. Exactly.
00:12:51,196 --> 00:12:53,896 [Andrey]
Wait. That is an important question, Seth.
00:12:53,896 --> 00:12:54,516 [Seb Krier]
I mean-
00:12:54,516 --> 00:12:56,536 [Andrey]
I would definitely use Opus for that.
00:12:56,536 --> 00:12:57,385 [Seb Krier]
It’s funny because I’ve actually-
00:12:57,385 --> 00:12:59,696 [Seth]
Use all the collective intelligence of reality. [chuckles]
00:12:59,696 --> 00:13:02,266 [Seb Krier]
I have actually used Opus for that exact question not long ago-
00:13:02,266 --> 00:13:02,626 [Seth]
[laughing]
00:13:02,626 --> 00:13:06,256 [Seb Krier]
... in trying out this new electric toothbrush that I found out as a result. But,
00:13:07,636 --> 00:13:22,076 [Seb Krier]
so yeah, I agree there’s that and also there’s all sorts of ways in which actually kind of using tools or specialized kind of tools is just more effective and more efficient. Why would you expect a large model or something to kind of calculate things innately or something when you can just access a calculator? It’s a much better use of tokens.
00:13:22,076 --> 00:13:36,856 [Andrey]
But it should kind of know that the calculator is available and then use it when it’s there. So that’s the argument against scaffolding, or you’re giving it a general environment, but you’re not scaffolding it much. I think a curious thing is just,
00:13:38,376 --> 00:13:40,356 [Andrey]
it seems like most people who are using
00:13:41,416 --> 00:13:49,156 [Andrey]
scaffolded agents today are using them with essentially one of two scaffolds, with Cloud Code or Codex. And
00:13:50,236 --> 00:14:00,475 [Andrey]
those seem to be good enough maybe. I guess, do we see a lot of people customizing, a lot of people, whatever, companies customizing their scaffolds?
00:14:00,476 --> 00:14:03,856 [Seth]
CladBot, do the CladBots count as that, I guess?
00:14:03,856 --> 00:14:04,236 [Andrey]
Yeah.
00:14:05,396 --> 00:14:39,676 [Seb Krier]
They are a form of it. I don’t know. I think a lot of power users and people in our immediate communities use a lot of Cloud Code and Codex, and particularly software engineers. But I don’t think most legal departments and most kind of firms out there are necessarily using Cloud Code either. And it’s not clear to me that this is necessarily the optimal interface or, there may be better systems that are Cloud Code-like, or CLI-like perhaps in some way. But, so I don’t know, maybe they’re sufficient, but even these tools end up kind of calling on loads of other external APIs and tools and so on in how they
00:14:40,836 --> 00:14:57,576 [Seb Krier]
function. So if anything, these are actually scaffolds. You’re not kind of calling the model directly. There’s all sorts of different sub-agents behind the scenes. It’s not just a one-shot call. There’s quite a lot going on, which is in fact this more, I don’t know, dynamic scaffolding thing I was mentioning earlier, I guess.
00:14:58,976 --> 00:15:06,736 [Andrey]
Okay. The natural question here is, what is going to be the role of the market in coordinating-
00:15:06,736 --> 00:15:07,375 [Seb Krier]
Mm
00:15:07,375 --> 00:15:11,276 [Andrey]
... AI here? And I’ll just very shamelessly plug-
00:15:11,276 --> 00:15:11,285 [Seb Krier]
[chuckles]
00:15:11,285 --> 00:15:24,796 [Andrey]
... some recent work with Rohit Krishnan, where we’re kind of playing around with the idea of LLMs bidding in a procurement auction and seeing whether that results in more efficient use of AI.
00:15:26,696 --> 00:15:29,655 [Seb Krier]
Well, first of all, I need to properly read that again. But the-
00:15:29,655 --> 00:15:30,476 [Andrey]
[laughing]
00:15:30,476 --> 00:15:31,016 [Seb Krier]
In terms of,
00:15:32,496 --> 00:15:32,916 [Seb Krier]
I guess,
00:15:34,556 --> 00:15:46,396 [Seb Krier]
at a very high level, markets are good at just coordinating in general, including AI. And so, assuming they function as intended in it, you’ve got the pricing mechanism to get...
00:15:47,556 --> 00:15:49,396 [Seb Krier]
I don’t know. I expect that to kind of work as well with
00:15:50,476 --> 00:15:52,616 [Seb Krier]
matching, I guess, supply and demand or something.
00:15:54,016 --> 00:15:55,196 [Seb Krier]
The supply of this
00:15:56,216 --> 00:16:00,036 [Seb Krier]
raw resource of cognition or something, and the demand of all sorts of different businesses and users.
00:16:01,696 --> 00:16:05,516 [Seb Krier]
So maybe, at a very high level, I don’t know. What exactly do you mean by the role of the market or something here?
00:16:09,076 --> 00:16:21,356 [Andrey]
Obviously the market is involved in many parts of the AI vertical supply chain, right? From competition in chips. There’s competition between models. There might be also competition between
00:16:22,516 --> 00:16:28,576 [Andrey]
scaffolds, bundles of environments, scaffolds, and LLMs.
00:16:28,576 --> 00:17:06,496 [Seth]
I guess maybe it would be useful to juxtapose this versus, so what Andrey, one of the things he’s imagining is, I have a job. I post it to some sort of Upwork-like future platform. Different companies that host different AI models bid to do that job. “Oh, I think I can do that job with $1 of electricity and tokens,” versus another model, and then we get efficient allocation of intellectual tasks to models, right? So do we think that that’s going to be important, or is it going to be more like I ask the super model what the best model is, and I just get allocated in a non-market way? Might be one version of this question.
00:17:08,156 --> 00:17:18,836 [Seb Krier]
I guess intuitively, my mind goes to the former question. But, or there’s a little bit of both in some sense, because even in the former one, you’re going to be using the large model for some sort of
00:17:20,436 --> 00:17:26,686 [Seb Krier]
cognitively demanding task or something. It kind of depends what kind of quality of output you also need and want.
00:17:26,686 --> 00:17:26,706 [Seth]
[chuckles]
00:17:26,706 --> 00:17:27,056 [Seb Krier]
But then
00:17:28,376 --> 00:17:49,636 [Seb Krier]
you’re still going to be constrained by your own resources or something, and depending on what you have to spend, if you can get the output for cheaper by kind of relying on this kind of competitive marketplace of smaller models or something, not even smaller models, they might just be all be big and kind of just scaffolding different, you’re offering a slightly different thing. Why wouldn’t you go for that, and why wouldn’t that exist in the first place? Unless the very first-
00:17:49,636 --> 00:17:52,216 [Andrey]
Doesn’t exist yet, just to be clear.
00:17:52,216 --> 00:17:52,716 [Seb Krier]
Um-
00:17:52,716 --> 00:17:58,416 [Seth]
A, it doesn’t exist yet, and as Andrey proves, at least current models are bad at understanding their own capabilities.
00:17:58,416 --> 00:17:58,666 [Andrey]
Oh, yeah.
00:17:58,666 --> 00:18:00,496 [Seth]
Now maybe that’s going to be fixed.
00:18:00,496 --> 00:18:08,096 [Seb Krier]
Yeah. Oh, no, I agree. I think that we’re not there yet, right? I think, again, and that goes back to the earlier AGI question, is there’s all sorts of, then again, what’s the right comparator? But,
00:18:09,476 --> 00:18:21,316 [Seb Krier]
yeah, I don’t think we’re exactly there. Yeah, I think a lot of this will have to be built as well. The kind of an ability for a model to just better kind of operate in a more multi-agent environment, kind of have a better sense of
00:18:22,596 --> 00:18:32,556 [Seb Krier]
delegation. I think the kind of, yeah, industrial intelligence or something seems to be maybe more neglected, as opposed to just single-agent intelligence or something, if that makes sense.
00:18:32,556 --> 00:18:34,776 [Seth]
Do we need to bring the word cybernetics back?
00:18:34,776 --> 00:18:35,496 [Seb Krier]
Yeah.
00:18:35,496 --> 00:18:36,116 [Andrey]
[laughs]
00:18:36,116 --> 00:18:38,816 [Seb Krier]
Somewhat. [laughs]
00:18:40,756 --> 00:18:51,256 [Andrey]
All right. A little change in subject, but I know this has been in the discourse, the topic of recursive self-improvement, RSI.
00:18:51,256 --> 00:18:52,956 [Seth]
Ooh, very scary.
00:18:52,956 --> 00:18:54,896 [Andrey]
Jack Clark recently had an essay about it.
00:18:56,376 --> 00:18:58,876 [Andrey]
Seb, what is your take?
00:18:58,876 --> 00:18:59,206 [Seb Krier]
[chuckles]
00:19:00,316 --> 00:19:07,896 [Seb Krier]
What is my take? I don’t know. I think it depends what exactly we mean by recursive self-improvement.
00:19:09,096 --> 00:19:50,336 [Seb Krier]
I had a blog post not long ago, I guess, when trying to disentangle a little bit what I have in mind when I think about this. On the one hand, there’s the model getting recursively better through the usage of more AI and whatnot. And on the other hand, there’s the more kind of societal side of things, the transformation side, which I think very often, these two worlds are a little bit blurred in the discourse. It’s like, oh, you get RSI, and then X, Y, Z about the world or something. Things go really fast or they don’t go fast. And, I think these should be separated very neatly because on the model side, of course, I expect, already there’s a lot of AI being used everywhere to kind of create models. And I expect that to continue.
00:19:52,536 --> 00:19:55,976 [Seb Krier]
But it’s not clear to me that this necessarily now leads to a dynamic by which
00:19:57,156 --> 00:20:16,596 [Seb Krier]
the model now gets extremely or exponentially intelligent in a very short amount of time. It’s still kind of bottlenecked by all sorts of resources. And as I was saying earlier, I still see them as better at kind of paradigm exploitation than kind of exploration, which I think is the thing you might need to get to the next step. But, first of all, what do I know? But secondly,
00:20:17,616 --> 00:20:19,986 [Seb Krier]
the other thing is, yeah, on the societal side of things,
00:20:20,996 --> 00:20:29,756 [Seb Krier]
people sometimes talk about foom or hard takeoffs and whatnot, and these have very clear kind of real-life implications. It’s not just kind of a model of getting better in a
00:20:31,216 --> 00:20:34,576 [Seb Krier]
data center somewhere. And that side, I think, is where you have to think about
00:20:36,116 --> 00:21:27,056 [Seb Krier]
[lip smack] all the kind of usual bottlenecks, adoption, deployment, diffusion, the kind of productive integration of all these systems at scale, both in terms of manufacturing and so on and so forth. And, I guess it’s not clear to me that the shift from GPT-2 to GPT-3 or coming up with kind of, we’re just very classic kind of software engineering, meat and potatoes type tasks that you can just easily just automate away. It’s maybe one of these things that’s maybe easy to say ex post, but, I’m not sure. And certainly, my expectation is you’re going to get loads of gains in the coming years of kind of automating part of that pipeline. But that seems good. You just get better models, and that’s just overall helpful for all sorts of other things, even if you’re doing safety work and kind of governance work and whatnot, we benefit a lot from that cognitive resource, I guess.
00:21:27,056 --> 00:21:40,696 [Andrey]
What would happen in the world for you to change your mind? Is there any, let’s say that recursive self-improvement is actually kind of this much more profound change than you’re painting.
00:21:41,816 --> 00:21:42,036 [Andrey]
What
00:21:44,136 --> 00:21:45,696 [Andrey]
signs would there be, I guess? Yeah.
00:21:45,696 --> 00:21:51,656 [Seb Krier]
But to be clear, I’m not claiming it’s just business as usual, nothing to see here or whatever, right? I’m
00:21:52,796 --> 00:22:14,936 [Seb Krier]
kind of just claiming that some of the stronger versions of the claim aren’t kind of self-evident. And so I see a lot of this happening in some sense. Certainly, in 10 years, I expect to have larger kind of more, again, acceleration of economic growth and whatnot and kind of faster diffusion across the board. I certainly don’t expect diffusion to take the same amount of time as, say, electricity or these other technologies.
00:22:16,576 --> 00:22:23,236 [Seb Krier]
So it depends what exactly you mean, because what specifically am I looking to change my mind on?
00:22:23,296 --> 00:22:30,656 [Andrey]
Well, let’s say the scenarios of AI 2027, right? Presumably,
00:22:31,996 --> 00:22:45,176 [Andrey]
in 2027, you’ll see something that’s like, “Oh, wow, I was wrong. This is not going to be so gradual. This is going to be this sudden foom,” that you’re criticizing. Yeah.
00:22:45,176 --> 00:22:52,236 [Seb Krier]
The original foom or hard takeoff definition literally talks about this change happening within hours or days.
00:22:52,236 --> 00:22:53,236 [Andrey]
[chuckles]
00:22:53,236 --> 00:22:56,056 [Seb Krier]
Which is not even, it’s not what the 2027 scenario, I think, predicts.
00:22:56,056 --> 00:22:56,296 [Andrey]
Yes.
00:22:57,556 --> 00:23:00,446 [Seb Krier]
But the 2027 scenario, from what I remember, again, it’s been a bit of time now.
00:23:01,796 --> 00:23:08,816 [Seb Krier]
One thing with the scenarios there is that there’s the kind of misalignment assumption, and which I’m kind of uncertain about.
00:23:08,816 --> 00:23:09,255 [Andrey]
Mm.
00:23:09,256 --> 00:23:17,296 [Seb Krier]
And it also talks about a lot of progress in robotics, which I think is a bit further away. I think it’s close. We’re getting there, too.
00:23:19,116 --> 00:23:19,476 [Seb Krier]
But
00:23:21,156 --> 00:23:25,916 [Seb Krier]
I don’t know. Probably kind of AI, if in 2030, we start seeing AI is making all sorts of crazy
00:23:26,956 --> 00:24:06,196 [Seb Krier]
inventions, innovations in fields other than just kind of perhaps math and coding across the boards, and I’m like, okay, this is clearly-- And you get extremely fast adoption, too, right? You have entire businesses doing completely, it’s not business as usual, clearly, in the economy or something and wide adoption. But it’s hard to say because I expect all that to some degree, right? It’s not that I’m saying, “Oh, this is never going to happen.” I just think of it as a little bit more elongated and the implications of that being maybe not as like, we have Dyson spheres in five years or something like that, so. It’s more of a disagreement maybe on the extremes or the margins or something, but not so much at the core of the claim that yes, models are going to make models better and...
00:24:07,276 --> 00:24:27,536 [Seb Krier]
But, again, even having-- In fact, actually, here would be a thing. If Anthropic or DeepMind or something in 2037 have fewer and fewer employees, fewer people kind of just doing AI research, engineers and so on, you’re clearly seeing kind of that profession. Because of course, I can imagine these jobs to change, right? Maybe you’re kind of managing more agents or something. That
00:24:28,616 --> 00:24:35,966 [Seb Krier]
I expect. But the fact that you just need far fewer people to kind of do not only these large training runs, but the kind of
00:24:36,976 --> 00:24:43,476 [Seb Krier]
large training runs that give you just much, much better systems, then I think I’d be like, okay, this is going a little bit faster than maybe expected or something.
00:24:44,656 --> 00:24:51,676 [Andrey]
Okay. One thing you mentioned in that kind of hints at another hot take you have, which is about alignment.
00:24:51,676 --> 00:24:52,026 [Seb Krier]
Uh-huh.
00:24:54,596 --> 00:24:55,926 [Andrey]
What’s the deal with alignment?
00:24:57,196 --> 00:24:58,086 [Andrey]
[laughs]
00:24:58,086 --> 00:24:58,136 [Seb Krier]
[laughs]
00:24:58,136 --> 00:25:02,136 [Seth]
Is it hard? Is it easy? Is it different than we would’ve expected going in?
00:25:02,136 --> 00:25:19,646 [Seb Krier]
Yeah. It’s perhaps that. I think my take about alignment is something-- Well, first of all, I just don’t like the word. I think it’s a bit of an annoying word because it’s being used for all sorts of things. The AI says something that we just kind of don’t like, or you say, “Oh, it’s misaligned.” No one pre-registers what they expect the aligned behavior to be, and then just kind of tests.
00:25:19,646 --> 00:25:20,116 [Andrey]
[laughs]
00:25:20,116 --> 00:25:35,626 [Seb Krier]
But I think my general claim is maybe the fact that it’s been easier than we would’ve predicted a decade ago or so. Then when I first got into AI in 2017, that was partly as a result of reading things like “Superintelligence” by Bostrom.
00:25:35,626 --> 00:25:36,236 [Andrey]
Mm-hmm.
00:25:36,236 --> 00:25:48,496 [Seb Krier]
And you’d read these books, like Stuart Russell’s “Human Compatible” and others, that kind of had all these analogies like King Midas and you ask a system to optimize for goal X, and in pursuit of that goal, it does all sorts of other things that you don’t want it to do.
00:25:48,496 --> 00:25:51,916 [Seth]
Right. The paperclip maximizer, and we seem to not have those.
00:25:51,916 --> 00:25:57,476 [Seb Krier]
Yeah. It’s like one version of it or one variant of it. And certainly at the time you didn’t really have language models. A lot of these intuitions were kind of based off
00:25:58,596 --> 00:26:48,236 [Seb Krier]
reinforcement learning systems in very basic kind of game scenarios where they were actually given a single goal to optimize for. And this is not actually what we do, I think, with models. And you had these kind of examples, even the value loading problem was something discussed at the time where actually specifying these complicated nuanced human values in mathematical terms would be extremely hard. So even if you managed to tell a robot to clean the room, it would then just pick up a baby and put it in the trash or something. And I think it turns out a lot of this stuff is actually much easier. You have problems. You’ve got things like reward hacking. You’ve got AIs behaving in weird ways that we were not always kind of anticipating because of the ways they were post-trained. So my claim is not like, oh, again, it’s all fine, and safety is a scam or whatever. It’s more that it’s certainly much easier than, or at least we’re in a much better track than I would’ve at least guessed perhaps a decade ago. And secondly, I think it
00:26:49,916 --> 00:26:54,816 [Seb Krier]
just seems tractable. There’s a lot of progress in terms of chain-of-thought monitoring and all these other things. And
00:26:56,696 --> 00:26:57,796 [Seb Krier]
I also think that the
00:26:59,016 --> 00:27:05,825 [Seb Krier]
hard part is maybe more the kind of normative question of whose values and when, and what and everything. That’s the kind of thing that we’re looking into more. But
00:27:07,096 --> 00:27:13,696 [Seb Krier]
yeah, I prefer the word actually instruction following or intent following or something instead of alignment. And I think by and large, they’re actually pretty good at that.
00:27:14,796 --> 00:27:31,636 [Seb Krier]
So again, that doesn’t mean you have to dismiss all sorts of theories and all the kind of power optimization stuff. But I guess my immediate outcome is this goes rather well. Or if I am more concerned by other things like misuse, if you’d like, than kind of the AI’s being innately, inherently kind of internally misaligned.
00:27:31,636 --> 00:28:03,676 [Seth]
This really seems related to your take that intelligence is not at odds with being a tool, right? So a lot of people have this intuition where if you had a super-duper intelligent genie or oracle, it would develop even implicitly some sort of value or goal that orthogonality thesis might have nothing to do with what we want. But you’re more optimistic about the idea that the LLM doesn’t want anything. It’s incorrect to take the intentional stance towards an LLM.
00:28:03,676 --> 00:28:09,236 [Seb Krier]
Not incorrect. It’s actually kind of descriptively useful, even functionally sometimes to use that language.
00:28:10,796 --> 00:28:18,836 [Seb Krier]
But that’s the thing, right? I think we kind of lack the language to properly delineate and differentiate when it’s useful to use that or appropriately descriptive and when it’s not.
00:28:20,076 --> 00:28:41,496 [Seb Krier]
And so I agree that, of course, I think the take I had on this was something like, and I can imagine a tool being an agent and an agent being a tool. Or in principle, I can imagine something being hyper-capable and still being broadly instruction following rather than at a certain level of capability, aha, that’s when the goals change and things get... And it kind of depends on the type of system as well. I imagine not all
00:28:42,656 --> 00:28:45,116 [Seb Krier]
paths lead to the same kind of outcome. But,
00:28:46,256 --> 00:29:13,596 [Seb Krier]
so again, I can see plausible versions of the world where homo hundrio drives or something are a more salient feature of the way we kind of train models. Right now, it doesn’t seem to me very likely that this is a core feature that they have. But of course, it’s hard to kind of either prove or disprove, right? Because someone might just say, well, that’s because they’re very good at hiding this or something, or once they’re capable enough or whatever. So there’s always a bit of this kind of gotcha thing. It’s like deception. But
00:29:14,936 --> 00:29:39,896 [Seb Krier]
yeah. So in principle, I guess I can totally conceive of at least a superintelligence that is controllable, that is benign, that is at least subservient to the goals of humanity or a user or principle or whatever. That could still be used to cause enormous harm, but it’s just I don’t necessarily think the analogies of, oh, I think Tegmark was thinking, look at the zoo where the monkey’s going. I think these are just not really
00:29:41,736 --> 00:29:43,136 [Seb Krier]
helpful kind of analogies.
00:29:44,276 --> 00:30:02,396 [Seth]
Monkey at the zoo, but you’ve also got the monkey’s paw, right? Maybe the reason some prefer alignment to instruction following is we all know the story of, be careful what you wish for. You wish for something, and it’s under-specified, and you get the bad version of it because the AI doesn’t understand the context.
00:30:02,396 --> 00:30:08,336 [Seb Krier]
I think that’s why, yeah, I think maybe instruction following is maybe too... Intent following or something gets to it more.
00:30:09,936 --> 00:30:18,316 [Seb Krier]
But of course, that problem doesn’t go, even if it follows intent or something, you could still have all the problems because your intent is nefarious or whatever. So
00:30:19,436 --> 00:30:19,816 [Seb Krier]
I think the
00:30:21,356 --> 00:31:06,756 [Seb Krier]
way you deal with that is all sorts of, I don’t know how to conceptualize it, but in fact scaffolds. It’s a bit more this outside of the model or something. I’m kind of almost indexing on a world that will indeed have agents that are trained to be bad or whatever, or someone going to be instructed to do bad things. But just like with humans, you come up with all sorts of kind of systems, rules, laws, norms, kind of protocols that either discourage the kind of bad behavior, or punishes it, or makes it just not worthwhile or something. But I’m not going to put all my bets on the, oh, it has to be pure-hearted, and that will be sufficient. And then you just scale it forever, and it’s going to be an amazing goal. I just think that the way of seeing or thinking about AI is that I just find kind of a bit
00:31:08,096 --> 00:31:12,656 [Seb Krier]
too narrow, I guess. I think it’s important, it’s just insufficient, and it’s certainly not my main kind of a-- yeah.
00:31:14,946 --> 00:31:15,206 [Andrey]
Okay.
00:31:16,666 --> 00:31:20,086 [Andrey]
Our audience is very much composed of economists.
00:31:22,586 --> 00:31:30,506 [Andrey]
If you’re an economist and you’re very interested in AI, what sort of work would you be trying to do?
00:31:30,506 --> 00:31:32,146 [Seth]
Maybe to be useful to AI people-
00:31:32,146 --> 00:31:32,216 [Andrey]
Yes
00:31:32,216 --> 00:31:37,466 [Seth]
... in particular. What would you want, what did the DeepMind team want to read from economists?
00:31:37,466 --> 00:32:20,766 [Seb Krier]
I think kind of engaging with their assumptions or something, right? If you assume, let’s say, an AG-- and I think some do, to be fair. I actually think there’s a lot more, I think, discourse now going on between economists and AI people, whatever. But assuming that you do have AI systems that are interchangeable or almost quasi-fully substitutable with humans, that come up with good ideas, that are parallelizable and whatnot, what does that change to your kind of growth function and so on? So, maybe that’s useful. Right now, in the short term, at least, there’s all sorts of questions around labor, there’s questions around productivity or adoption. Clearly, there’s useful work to be done there. But I think in terms of AGI specifically, given that a lot of the field just thinks you’re going to get to AGI in the next five to 10 years,
00:32:22,746 --> 00:32:26,806 [Seb Krier]
what are the implications for taxation? What are the implications for
00:32:28,626 --> 00:32:37,786 [Seb Krier]
how that’ll affect different states across the world? I think I’m probably more worried about a call center in Hyderabad than I am about the white-collar worker in North America or something. So,
00:32:39,066 --> 00:32:57,306 [Seb Krier]
yeah. I think all these kind of questions, but just indexing more and making fewer, I guess, assumptions around the limits of capabilities. Because sometimes you see them kind of being implicitly snuck in somewhere or something of like, well, because AIs can’t do XYZ, therefore... And yeah, fine, but maybe they will do XYZ. And then what? How does that change your thinking? Yeah.
00:32:57,306 --> 00:32:59,506 [Seth]
Maybe more scenario planning than,
00:33:00,526 --> 00:33:04,746 [Seth]
here’s my median projection, or here is one projection I think is plausible.
00:33:04,746 --> 00:33:22,846 [Seb Krier]
Yeah. And embedding the kind of thoughtful models and thinking that economists have within these scenarios and making them more salient to the kind of computer scientists, right? Even when I brought up competitive advantage, people will be like, “Oh, but what if the AI is cheaper and better?” It’s like, well, that’s not the point. The opportunity cost point of competitive advantage, there’s a difference.
00:33:22,846 --> 00:33:23,286 [Andrey]
[laughs]
00:33:23,286 --> 00:33:31,786 [Seb Krier]
And again, there are answers to that as well, but I think just kind of better translating, I think, some of these insights to the AI tribe, the thing is useful.
00:33:32,846 --> 00:33:40,526 [Andrey]
So that’s very naturally leading us to this question about yourself. And you do lots of different things.
00:33:41,946 --> 00:33:50,426 [Andrey]
You’re prolific on Twitter, for sure. But also, you’re doing internal work for DeepMind. How do you allocate your time?
00:33:52,066 --> 00:33:52,166 [Seb Krier]
I don’t know.
00:33:52,166 --> 00:33:53,266 [Seth]
What percentage is Twitter?
00:33:53,266 --> 00:33:54,646 [Andrey]
Yeah. [laughs]
00:33:54,646 --> 00:34:04,686 [Seb Krier]
Twitter is actually not that much today. It must be an hour max or something, an hour and a half, two hours, maybe, something. But that is maybe much by others’ standards. But the-
00:34:04,686 --> 00:34:06,476 [Andrey]
[laughs] What is the optimal amount of Twitter? [laughs]
00:34:06,476 --> 00:34:29,866 [Seb Krier]
[laughs] Yeah. It’s the Pareto optimal. I guess, in my day-to-day work, it’s a mixture of proactive and reactive. Proactive in the sense that I think, oh, these questions of agents and cybersecurity and liability and whatnot, and biosecurity are kind of important things to look into, and therefore, there’s a lot of research that I do and colleagues do, and a lot of coordination across the org.
00:34:31,026 --> 00:34:39,486 [Seb Krier]
But there’s also more reactive stuff because we’re a policy team, and so there’s things happening in the external world like CA 53, the preemption debates.
00:34:40,546 --> 00:34:48,386 [Seb Krier]
So it’s a bit of a mix of that. And of course, all sorts of internal dynamics. But, yeah. I guess I’m curious about all sorts of other things, and so when I do have time, and I’ve kind of
00:34:50,006 --> 00:34:58,106 [Seb Krier]
completed the main quests, I try to keep some time for other stuff I’m interested in. I work with some research teams and kind of look into what they’re into. I’ll
00:34:59,266 --> 00:35:09,826 [Seb Krier]
find topics or themes that I think are maybe kind of neglected or underrated or I just don’t see out there as much, and like, “Oh, cool. We’re going to try to find out about this more.” But I think it’s just very kind of curiosity driven, and the allocation of time is
00:35:11,566 --> 00:35:16,705 [Seb Krier]
not super thought out. It’s more like, oh, I think these things are interesting, and I’m going to get into that for a bit. [laughs]
00:35:16,706 --> 00:35:22,306 [Andrey]
So it wasn’t a deliberate strategy of getting Tyler’s attention and adoration. [laughs]
00:35:22,306 --> 00:35:25,126 [Seb Krier]
No, not at all. Not at all. But I’m very-
00:35:25,126 --> 00:35:25,746 [Seth]
The long play
00:35:25,746 --> 00:35:30,565 [Seb Krier]
... very grateful for his... [laughs] For the meme. But-
00:35:30,566 --> 00:35:41,766 [Seth]
What kind of, but I know you can’t be specific, but for your sort of internal work, what does a work product look like? Are you participating in a meeting and giving hot takes? Are you writing internal memos? What is-
00:35:41,766 --> 00:35:42,026 [Seb Krier]
Yeah
00:35:42,026 --> 00:35:42,276 [Seth]
... in-
00:35:42,276 --> 00:35:56,406 [Seb Krier]
It’s a mixture. Obviously, meetings. Any large bureaucracy will have meetings. But I think a lot of analysis, memos to execs sometimes. Just research, managing researchers sometimes, depending on the project.
00:35:57,626 --> 00:36:04,106 [Seb Krier]
We’ll have a lot of coordination. Actually, I’m realizing through a lot of these kind of meetings, a lot of it is just kind of coordination and information transfer, right?
00:36:04,106 --> 00:36:04,146 [Andrey]
[laughs]
00:36:04,146 --> 00:36:07,006 [Seb Krier]
It’s maybe why I’m so obsessed with the Coasean bargaining thing. Just let-
00:36:07,006 --> 00:36:07,326 [Seth]
Ah
00:36:07,326 --> 00:36:08,546 [Seb Krier]
... the agents do it. But,
00:36:09,806 --> 00:36:34,116 [Seb Krier]
yeah. I think the day-to-day work is a lot of reading, a lot of meetings, a lot of writing, and distilling and translating information, I think, across different tribes also. So if I’m talking to legal people, like lawyers, about what’s going on in, say, the more technical side of the org, or if I’m speaking to the researchers about something that’s more... But yeah, there’s a lot of translating of concepts across different stakeholders, I guess.
00:36:34,116 --> 00:36:45,726 [Andrey]
So how does that work in an org like Google? Because I think in a lot of orgs, they’re really obsessed with KPIs and output metrics.
00:36:45,726 --> 00:36:46,156 [Seb Krier]
Mm-hmm.
00:36:46,156 --> 00:36:48,746 [Andrey]
And what you’re describing sounds very-
00:36:48,746 --> 00:36:49,706 [Seth]
Hot takes per meeting. [laughs]
00:36:49,706 --> 00:36:54,926 [Andrey]
Yeah. Very much amorphous, very hard to measure.
00:36:56,066 --> 00:36:56,196 [Seb Krier]
Yeah.
00:36:56,196 --> 00:37:00,606 [Andrey]
Obviously, you have a lot of external visibility, but is that
00:37:02,786 --> 00:37:07,846 [Andrey]
a problem? Or is that just it’s understood that that’s how this goes? Yeah.
00:37:07,846 --> 00:37:13,846 [Seb Krier]
I think the external stuff is kind of almost just very separate from the kind of day-to-day work side of things.
00:37:14,986 --> 00:37:23,366 [Seb Krier]
And yeah, internally, we do have KPIs or equivalents or whatever. I think they may be less numerical in nature. But you might still have some, develop a consistent position on
00:37:24,506 --> 00:37:30,819 [Seb Krier]
X issue or something in the next two, three months.And that requires a lot of research work, coordinating.
00:37:30,819 --> 00:37:32,929 [Seth]
Have 10 opinions. [laughs]
00:37:32,930 --> 00:37:38,100 [Seb Krier]
No, ideally they just want one. I think 10 opinions, that’s the issue. There are a lot of opinions out there. You’ve got to find the good ones.
00:37:38,100 --> 00:37:39,530 [Seth]
That’s the main problem with economists.
00:37:39,530 --> 00:37:42,350 [Seb Krier]
But [laughs] yeah. Exactly. Who was that quote?
00:37:43,830 --> 00:37:44,290 [Seth]
Truman.
00:37:44,290 --> 00:37:44,330 [Seb Krier]
Yeah.
00:37:44,330 --> 00:37:46,210 [Seth]
Truman begged for the one-handed economist.
00:37:46,270 --> 00:38:20,990 [Seb Krier]
Yeah, exactly. But, so I think, yeah, I think internally it’s just a kind of analysis or something. Say you’re thinking about, oh, agents and legal liability. How do these things work? What does the existing legal environment say and prescribe? What happens if something goes wrong? What are relevant factors? There’s a lot of that kind of thing. And I guess particularly within the DeepMind side, because when we’re on the frontier side, we’re thinking about the next five years as opposed to what’s going on right now. But yeah, the other side stuff is really just kind of out of personal interest and just me writing stuff, and they seem fine with it so far. [chuckles]
00:38:20,990 --> 00:38:26,510 [Andrey]
What about... So we’ll be at a conference together, the Post-AGI conference-
00:38:26,510 --> 00:38:26,830 [Seb Krier]
Ooh
00:38:26,830 --> 00:38:28,370 [Andrey]
... at Lighthaven, Berkeley.
00:38:28,370 --> 00:38:30,110 [Seth]
Ooh. Prestigious.
00:38:31,130 --> 00:38:32,990 [Andrey]
I don’t know if it’s prestigious.
00:38:34,550 --> 00:38:34,629 [Seth]
[laughs]
00:38:34,630 --> 00:38:45,730 [Andrey]
But you’ve gone to a few of these conferences, like the Curve is another fairly well-known one. What’s your take on these?
00:38:45,730 --> 00:38:54,750 [Seb Krier]
I think some are useful. The majority of conferences I go to, I don’t exactly find that life-transforming, I guess.
00:38:54,750 --> 00:38:57,610 [Andrey]
[laughs] You’re going to the wrong conference. [laughs]
00:38:57,610 --> 00:39:09,290 [Seb Krier]
I know. Can someone show me the... But I think, yeah, they obviously perform a social function to some degree, right? There’s a lot of meeting people, some networking or something, some kind of finding out new ideas. But
00:39:10,390 --> 00:39:20,310 [Seb Krier]
my issue with conferences, very often they’re just very tame. They’re very risk-averse. They’re very the same ideas you’ve-- Already if you can read it online or something, it depends on the conference. But,
00:39:21,510 --> 00:39:24,190 [Seb Krier]
although I have been to really good ones, too. There was this
00:39:25,570 --> 00:39:43,529 [Seb Krier]
IMF conference with Econ Ty, with I think Anton Korinek and others had organized. And that was great because that was a nice one where you had both the technologists and a lot of economists and loads of presentations, and you got to learn lots of new things. But, in general, I don’t see a huge... Beyond maybe showing, again, some hot takes here and there.
00:39:45,370 --> 00:39:49,990 [Seb Krier]
Yeah, some I assume are good conferences. [chuckles]
00:39:49,990 --> 00:40:00,670 [Seth]
I’m just the exception, but you had a great joke on your Twitter the other day about this, which is, Caveman panelist one, “Fire is bad.” Caveman panelist two, “Fire is good.”
00:40:00,670 --> 00:40:00,770 [Seb Krier]
Yeah.
00:40:00,770 --> 00:40:02,100 [Seth]
Caveman panelist three,
00:40:03,450 --> 00:40:07,120 [Seth]
“We need to balance the upsides and downsides of fire and use it wisely.”
00:40:07,120 --> 00:40:07,320 [Seb Krier]
Absolutely.
00:40:07,320 --> 00:40:09,620 [Seth]
Wild applause. [laughs]
00:40:09,620 --> 00:40:09,650 [Andrey]
[laughs]
00:40:09,650 --> 00:40:14,850 [Seb Krier]
Exactly. There’s a lot of that. That’s the energy that I’m getting very tired of because it’s-
00:40:14,850 --> 00:40:15,050 [Seth]
[laughs]
00:40:15,050 --> 00:40:21,700 [Seb Krier]
And I like playing the role of the wise centrist opinion, whatever. But it does get very-
00:40:21,700 --> 00:40:23,150 [Seth]
You do get wild applause.
00:40:23,150 --> 00:40:24,470 [Seb Krier]
Yeah. All the time. [chuckles]
00:40:26,490 --> 00:40:29,770 [Seb Krier]
But yeah, I think there’s a lot of that. I wish there were more
00:40:30,810 --> 00:40:35,090 [Seb Krier]
almost private Chatham House-y conferences, where you had people who highly disagreed with each other-
00:40:35,090 --> 00:40:35,210 [Andrey]
Mm
00:40:35,210 --> 00:40:36,770 [Seb Krier]
... but were polite and didn’t get at
00:40:37,950 --> 00:40:49,370 [Seb Krier]
each other’s throats. And you had more setups that actually allowed ideas to clash a bit more, in a civilized way, of course. But that would be a bit hard, but also much more interesting, I think, than
00:40:51,490 --> 00:40:55,390 [Seb Krier]
everyone broadly agreeing that it’s good to be good and it’s bad to be bad, and yeah. [chuckles]
00:40:55,390 --> 00:41:03,710 [Andrey]
I do feel like the Lighthaven conferences are quite good for this, in that there’s an enormous amount of free time and-
00:41:03,710 --> 00:41:04,130 [Seb Krier]
Mm-hmm
00:41:04,130 --> 00:41:07,770 [Andrey]
... free space that’s not where the talk is happening.
00:41:07,770 --> 00:41:07,940 [Seb Krier]
Yeah.
00:41:07,940 --> 00:41:10,630 [Andrey]
And so you do get a lot of this.
00:41:10,630 --> 00:41:11,040 [Seb Krier]
Well, yeah, I agree.
00:41:11,040 --> 00:41:21,090 [Andrey]
But I agree that many conferences are not like that, where you’re just packed. You have a conference hall, and you don’t have anywhere else to go, and it’s packed with talks. Yeah.
00:41:21,090 --> 00:41:21,710 [Seb Krier]
Yeah. No, totally.
00:41:21,710 --> 00:41:23,550 [Seth]
NBER Summer Institute. [laughs]
00:41:24,750 --> 00:41:28,330 [Andrey]
Seth, there is disagreement. Say what you will. At NBER-
00:41:28,330 --> 00:41:28,540 [Seth]
There is fire
00:41:28,540 --> 00:41:29,430 [Andrey]
... people throw down.
00:41:30,450 --> 00:41:31,430 [Andrey]
[laughs]
00:41:31,430 --> 00:41:37,720 [Seth]
[laughs] I’ve never seen a meaner comment than I have seen from a discussant at NBER Summer Institute. [laughs]
00:41:37,720 --> 00:41:52,570 [Seb Krier]
[laughs] The Progress Conference, for example, last year, was one that I thought was really good. That was at Lighthaven, in fact. I think the setup and the kind of people and the curation and so just made it something that I found quite engaging. [upbeat music]
00:41:52,570 --> 00:41:56,490 [Seth]
So you brought up this idea, as we were talking, about you
00:41:58,330 --> 00:42:21,049 [Seth]
think there are so many meetings in your organization because it’s so hard, yet so critical to transfer information. And there’s this Coasean idea that so much of why the economy works the way it does is just the idea of transaction costs, right? In addition to kind of this Hayekian idea of local information that’s hard to share.
00:42:21,050 --> 00:42:21,810 [Seb Krier]
Mm-hmm.
00:42:21,810 --> 00:42:23,960 [Seth]
You have a very influential essay
00:42:25,130 --> 00:42:30,230 [Seth]
that kind of maybe stole some of Andrey’s thunder, but is still an excellent essay-
00:42:30,230 --> 00:42:31,040 [Seb Krier]
[laughs]
00:42:31,040 --> 00:42:46,210 [Seth]
... about this idea of, well, what happens when AIs go out there and can micro-bargain costlessly with each other at high frequency over very, what might seem to us, small issues.
00:42:47,570 --> 00:42:57,440 [Seth]
Tell us maybe in a few sentences, what’s that vision and what’s the positive vision for why that would be good for society, for us to have AI agents constantly bargaining for us over stuff?
00:42:59,130 --> 00:43:01,810 [Seb Krier]
Yeah. I guess the idea is, as you mentioned, there’s all sorts of
00:43:03,990 --> 00:43:26,350 [Seb Krier]
transaction costs that mean that we don’t get to bargain on things that we would otherwise bargain for. And instead, you get these blunt rules and these solutions that kind of work, but come with all sorts of externalities or aren’t super efficient. And so the idea is, if you can actually do this kind of negotiation at scale for very little, and that’s a big assumption. That’s not a given either,
00:43:27,850 --> 00:43:35,586 [Seb Krier]
then you could solve all sorts of things thatAnd also just kind of problems that would otherwise not be even conceivable in the first place.
00:43:36,726 --> 00:43:41,186 [Seth]
One example you give, just so we can be a little bit more specific, is noise standards, right?
00:43:41,186 --> 00:43:41,456 [Seb Krier]
Right.
00:43:41,456 --> 00:43:57,226 [Seth]
So you can’t throw a loud party after 10:00 PM in such and such a place. But you think that maybe AI agents could come to a less coarse rule that is, get us more to the grand coalition of allocative efficiency than a coarse rule like that.
00:43:57,226 --> 00:44:01,166 [Seb Krier]
Yeah. To be fair, that’s probably a problem that no one really cares about except me because of like- [chuckles]
00:44:01,166 --> 00:44:02,086 [Seth]
No. Dude.
00:44:02,086 --> 00:44:03,645 [Andrey]
I care about it so much.
00:44:03,645 --> 00:44:04,626 [Seb Krier]
Oh, really? Okay, cool.
00:44:04,626 --> 00:44:04,746 [Andrey]
Yes.
00:44:04,746 --> 00:44:07,816 [Seb Krier]
Maybe that’s a good example then. But yeah, the idea here is,
00:44:09,146 --> 00:44:17,006 [Seb Krier]
my neighbor is throwing a party, and instead of there being some sort of rule that says you’re not allowed to throw parties after 11:00, he could maybe just compensate me for the noise or something.
00:44:18,326 --> 00:44:21,686 [Seb Krier]
Or in fact, that’s one of the key crux of the whole Coasean thing is maybe
00:44:24,186 --> 00:44:36,085 [Seb Krier]
I have to compensate him to stop his parties. And it kind of depends where the initial right is. But broadly, you could have these kind of, my whole neighborhood doesn’t want me to party, and they’re just giving me a small payment or the reverse, depending on where the initial allocation is.
00:44:37,226 --> 00:44:44,446 [Seb Krier]
But I think you could have all sorts of micro ways in which these transaction costs at scale help you get much better beneficial outcomes.
00:44:45,486 --> 00:44:48,486 [Seb Krier]
And so that would be the noise one would be like, okay.
00:44:50,406 --> 00:45:18,666 [Seb Krier]
And it’ll probably just also let people kind of regroup into the party people just going into the neighborhood where that’s just generally more party tolerant or something, and the kind of peace and quiet preferring people just... Because I think one of the points with the piece was that AI also helps you coordinate better. You can use this stuff to find people who have the same interests and preferences as you or something, and just then bargain or negotiate or whatnot in that way as well.
00:45:20,626 --> 00:45:27,386 [Seth]
So it’s not just bargaining over externalities that are negative, it’s maybe coordinating over positive externalities, right?
00:45:27,386 --> 00:45:27,526 [Seb Krier]
Yeah.
00:45:28,766 --> 00:45:51,746 [Seth]
What pieces do we need in the economy to make this a reality, and what time horizon are you thinking about? So obviously this is an idea that you could have a small version of, and then like the sci-fi, this is constantly, I’m allowed to speed in my car today because I really need to get to work because I’m late, and it’s bargaining with all the cars on the highway at ultra-high frequency. So what are the time horizons you have in mind, and what pieces do we need?
00:45:51,746 --> 00:46:21,786 [Seb Krier]
Honestly, I haven’t even thought about the timelines really. [laughing] For me, this was mostly kind of an aspirational thing of like, well, it looks like we could unlock some cool things, and because there’s all these-- It’d be nice to have a positive vision of how things might pan out. It certainly doesn’t mean that everything has to be negotiated and bargained over. But I could see a large proportion of things, certainly in everyday life, like I could just tell my aunt, “You don’t have to worry about your parking issues anymore. It’s just sorted now,” whatever. The agents are taking care of that. And so it kind of depends on what scale you’re talking about. Certainly having democracy at scale and
00:46:23,626 --> 00:46:29,086 [Seb Krier]
half automated and half made more efficient through these systems or something is something that I think is going to take a long time.
00:46:30,426 --> 00:46:47,986 [Seb