0:00
/
0:00
Transcript

AGI is Still 30 Years Away — Ege Erdil & Tamay Besiroglu

The economy will literally double every year afterwards

Ege Erdil and Tamay Besiroglu have 2045+ timelines, think the whole "alignment" framing is wrong, don't think an intelligence explosion is plausible, but are convinced we'll see explosive economic growth.

This discussion offers a totally different scenario than my recent interview with Scott and Daniel.

Ege and Tamay are the co-founders of Mechanize (disclosure - I’m an angel investor), a startup dedicated to fully automating work. Before founding Mechanize, Ege and Tamay worked on AI forecasts at Epoch AI.

Watch on Youtube; listen on Apple Podcasts or Spotify.

Sponsors

  • WorkOS makes it easy to become enterprise-ready. With simple APIs for essential enterprise features like SSO and SCIM, WorkOS helps companies like Vercel, Plaid, and OpenAI meet the requirements of their biggest customers. To learn more about how they can help you do the same, visit workos.com

  • Scale’s Data Foundry gives major AI labs access to high-quality data to fuel post-training, including advanced reasoning capabilities. If you’re an AI researcher or engineer, learn about how Scale’s Data Foundry and research lab, SEAL, can help you go beyond the current frontier at scale.com/dwarkesh

  • Google's Gemini Pro 2.5 is the model we use the most at Dwarkesh Podcast: it helps us generate transcripts, identify interesting clips, and code up new tools. If you want to try it for yourself, it's now available in Preview with higher rate limits! Start building with it today at aistudio.google.com.

To sponsor a future episode, visit dwarkesh.com/advertise.

Timestamps

(00:00:00) - AGI will take another 3 decades

(00:22:27) - Even reasoning models lack animal intelligence

(00:45:04) - Intelligence explosion

(01:00:57) - Ege & Tamay’s story

(01:06:24) - Explosive economic growth

(01:33:00) - Will there be a separate AI economy?

(01:47:08) - Can we predictably influence the future?

(02:19:48) - Arms race dynamic

(02:29:48) - Is superintelligence a real thing?

(02:35:45) - Reasons not to expect explosive growth

(02:49:00) - Fully automated firms

(02:54:43) - Will central planning work after AGI?

(02:58:20) - Career advice

Transcript

AGI will take another 3 decades

Dwarkesh Patel 00:00:00
Today, I’m chatting with Tamay Besiroglu and Ege Erdil. They were previously running Epoch AI and are now launching Mechanize, which is a company dedicated to automating all work. One of the interesting points you made recently, Tamay, is that the whole idea of the intelligence explosion is mistaken or misleading. Why don’t you explain what you’re talking about there?
Tamay Besiroglu 00:00:22
Yeah, I think it’s not a very useful concept. It’s kind of like calling the Industrial Revolution a horsepower explosion. Sure, during the Industrial Revolution, we saw this drastic acceleration in raw physical power, but there are many other things that were maybe equally important in explaining the acceleration of growth and technological change that we saw during the Industrial Revolution.
Dwarkesh Patel 00:00:42
What is a way to characterize the broader set of things that the horsepower perspective would miss about the Industrial Revolution?
Tamay Besiroglu 00:00:50
So I think in the case of the Industrial Revolution, it was a bunch of these complementary changes to many different sectors in the economy. So you had agriculture, you had transportation, you had law and finance, you had urbanization and moving from rural areas into cities. There were just many different innovations that happened simultaneously that gave rise to this change in the way of economically organizing our society.
It wasn’t just that we had more horsepower. I mean, that was part of it, but that’s not the kind of central thing to focus on when thinking about the Industrial Revolution. And I think similarly, for the development of AI, sure, we’ll get a lot of very smart AI systems, but that will be one part among very many different moving parts that explain why we expect to get this transition and this acceleration and growth and technological change.
Dwarkesh Patel 00:01:46
I want to better understand how you think about that broader transformation. Before we do, the other really interesting part of your worldview is that you have longer timelines to get to AGI than most of the people in San Francisco who think about AI. When do you expect a drop-in remote worker replacement?
Ege Erdil 00:02:05
Maybe for me, that would be around 2045.
Dwarkesh Patel 00:02:10
Wow. Wait, and you?
Tamay Besiroglu 00:02:11
Again, I’m a little bit more bullish. I mean, it depends what you mean by “drop in remote worker“ and whether it’s able to do literally everything that can be done remotely, or do most things.
Ege Erdil 00:02:21
I’m saying literally everything.
Tamay Besiroglu 00:02:22
For literally everything. Just shade Ege’s predictions by five years or by 20% or something.
Dwarkesh Patel 00:02:27
Why? Because we’ve seen so much progress over even the last few years. We’ve gone from Chat GPT two years ago to now we have models that can literally do reasoning, are better coders than me, and I studied software engineering in college. I mean, I did become a podcaster, I’m not saying I’m the best coder in the world.
But if you made this much progress in the last two years, why would it take another 30 to get to full automation of remote work?
Ege Erdil 00:03:01
So I think that a lot of people have this intuition that progress has been very fast. They look at the trend lines and just extrapolate; obviously, it’s going to happen in, I don’t know, 2027 or 2030 or whatever. They’re just very bullish. And obviously, that’s not a thing you can literally do.
There isn’t a trend you can literally extrapolate of “when do we get to full automation?”. Because if you look at the fraction of the economy that is actually automated by AI, it’s very small. So if you just extrapolate that trend, which is something, say, Robin Hanson likes to do, you’re going to say, “well, it’s going to take centuries” or something.
Now, we don’t agree with that view. But I think one way of thinking about this is how many big things are there? How many core capabilities, competences are there that the AI systems need to be good at in order to have this very broad economic impact, maybe 10x acceleration and growth or something? How many things have you gotten over the past 10 years, 15 years? And we also have this compute-centric view…
Tamay Besiroglu 00:04:05
So just to double click on that, I think what Ege is referring to is, if you look at the past 10 years of AI progress, we’ve gone through about nine or 10 orders of magnitude of compute, and we got various capabilities that were unlocked. So in the early period, people were solving gameplay on specific games, on very complex games. And that happened from 2015 to 2020, Go and Chess and Dota and other games. And then you had maybe sophisticated language capabilities that were unlocked with these large language models, and maybe advanced abstract reasoning and coding and maybe math. That was maybe another big capability that got unlocked.
And so maybe there are a couple of these big unlocks that happened over the past 10 years, but that happened on the order of once every three years or so, or maybe one every three orders of magnitude of compute scaling. And then you might ask the question, “how many more such competencies might we need to unlock in order to be able to have an AI system that can match the capabilities of humans across the board?” Maybe specifically just on remote work tasks. And so then you might ask, well, maybe you need kind of coherence over very long horizons, or you need agency and autonomy, or maybe you need full multimodal understanding, just like a human would.
And then you ask the question, “okay, how long might that take?” And so you can think about, well, just in terms of calendar years, the previous unlocks took about, you get one every three years or so. But of course, that previous period coincided with this rapid scale-up of the amount of compute that we use for training. So we went through maybe 9 or 10 orders of magnitude since AlexNet compared to the biggest models we have today. And we’re getting to a level where it’s becoming harder and harder to scale up compute. And we’ve done some extrapolations and some analysis looking at specific constraints, like energy or GPU production.
And based on that, it looks like we might have maybe three or four orders of magnitude of scaling left. And then you’re really spending a pretty sizable fraction or a non-trivial fraction of world output on just building up data centers, energy infrastructure, fabs, and so on.
Dwarkesh Patel 00:06:40
Which is already like 2% of GDP, right?
Tamay Besiroglu 00:06:42
I mean, currently it’s less than 2%.
Ege Erdil 00:06:44
Yeah, but also currently most of it is actually not going towards AI chips. But even most TSMC capacity currently is going towards mobile phone chips or something like that, right?
Dwarkesh Patel 00:06:52
Even leading edge. It’s like 5% of leading edge.
Tamay Besiroglu 00:06:55
Yeah, even leading edge is pretty small. But yeah, so that suggests that we might need a lot more compute scaling to get these additional capabilities to be unlocked. And then there’s a question of do we really have that in us as an economy to be able to sustain that scaling?
Dwarkesh Patel 00:07:14
But it seems like you have this intuition that there’s just a lot left to intelligence. When you play with these models, they’re almost there. You forget you’re often talking to an AI.
Ege Erdil 00:07:26
What do you mean they’re almost there? I don’t know. I can’t ask Claude to pick up this cup and put it over there.
Dwarkesh Patel 00:07:31
Remote work, you know?
Ege Erdil 00:07:32
Okay. But even for remote work, I can’t ask Claude to… I think the current computer use systems can’t even book a flight properly.
Dwarkesh Patel 00:07:38
How much of an update would it be if by the end of 2026, they could book a flight?
Ege Erdil 00:07:43
I probably think by the end of this year, they’re going to be able to do that. But that’s a very simple… Nobody gets a job where they’re paid to book flights. That’s not a task.
Dwarkesh Patel 00:07:54
I think some people do.
Tamay Besiroglu 00:07:56
If it’s literally just a book flight job, and without-
Ege Erdil 00:08:00
But I think that’s an important point, because a lot of people look at jobs in the economy, and then they’re like, “oh, that person, their job is to just do X”. But then that’s not true. That’s something they do in their job. But if you look at the fraction of their time on the job that they spend on doing that, it’s a very small fraction of what they actually do. It’s just this popular conception people have. Or travel agents, they just book hotels and flights. But that’s not actually most of their job. So automating that actually wouldn’t automate their job, and it wouldn’t have that much of an impact on the economy.
So I think this is actually an important thing, that important worldview difference that separates us from people who are much more bullish, because they think jobs in the economy are much simpler in some sense, and they’re going to take much fewer competences to actually fully automate.
Dwarkesh Patel 00:08:47
So our friend Leopold has this perspective of, quote unquote, ‘unhobblings’, where the way to characterize it might be, they’re basically like baby AGIs already. And then because of the constraints we artificially impose upon them by, for example, only training them on text and not giving them the training data that is necessary for them to understand a Slack environment or a Gmail environment, or previously before inference time scaling, not giving them the chance to meditate upon what they’re saying and really think it through, and not giving them the context about what is actually involved in this job, only giving them this piecemeal, a couple of minutes worth of context in the prompt, we’re holding back what is fundamentally a little intelligence from being as productive as it could be, which implies that unhobblings just seem easier to solve for than entirely new capabilities of intelligence. What do you make of that framework?
Tamay Besiroglu 00:09:46
I mean, I guess you could have made similar points five years ago and say “you look at AlphaZero and there’s this mini AGI there, and if only you unhobbled it by training it on text and giving it all your context” and so on, that just wouldn’t really have worked. I think you do really need to rethink how you train these models in order to get these capabilities.
Dwarkesh Patel 00:10:08
But I think the surprising thing over the last few years has been that you can start off with this pre-trained corpus of the internet, and it’s actually quite easy. ChatGPT is an example of this unhobbling, where 1% of additional compute spent on getting it to talk in a chatbot-like fashion with post training is enough to make it competent- really competent- at that capability.
Reasoning is another example where it seems like the amount of compute that is spent on RL right now in these models is a small fraction of total compute. Again, reasoning seems complicated, and then you just do 1% of compute and it gets you that. Why not think that computer use, or long-term agency on computer use, is a similar thing?
Tamay Besiroglu 00:10:55
So when you say “reasoning is easy” and “it only took this much compute” and “it wasn’t very much”, and maybe “you look at the sheer number of tokens and it wasn’t very much, and so it looks easy”, well, that’s true from our position today. But I think if you ask someone to build a reasoning model in 2015, then it would have looked insurmountable. You would have had to train a model on tens of thousands of GPUs, you would have had to solve that problem, and each order of magnitude of scaling from where they were would pose new challenges that they would need to solve.
You would need to produce internet scale, or tens of trillions of tokens of data in order to actually train a model that has the knowledge that you can then unlock and access by way of training it to be a reasoning model. You need to maybe make the model more efficient at doing inference and maybe distill it, because if it’s very slow then you have a reasoning model that’s not particularly useful, so you also need to make various innovations to get the model to be distilled so that you can train it more quickly, because these rollouts take very long.
It actually becomes a product that’s valuable if it’s a couple tokens a second, as a reasoning model that would have been very difficult to work with. So in some sense, it looks easy from our point of view, standing on this huge stack of technology that we’ve built up over the past five years or so, but at the time, it would have been very hard.
And so my claim would be something like; I think the agency part might be easy in a similar sense, that in five years or three years time or whatever we will look at what unlocked agency and it’ll look fairly simple. But the amount of work that, in terms of these complementary innovations that enable the model to be able to learn how to become a competent agent, that might have just been very difficult and taken years of innovation and a bunch of improvements in kind of hardware and scaling and various other things.
Dwarkesh Patel 00:12:54
Yeah, I feel like what’s dissimilar between 2015 and now… in 2015 if you were trying to solve reasoning, you just didn’t have a base to start on. Maybe if you tried formal proof methods or something, but there was no leg to stand on, where now you’d actually have the thing- you have the pre-trained base model, you have these techniques of scaffolding, of post-training, of RL. And so it seems like you think that those will look to the future as, say, AlphaGo looks to us now in terms of the basis of a broader intelligence.
I’m curious if you have intuitions on why not think that language models as we have them now are like, we got the big missing piece right and now we’re just like plugging things on top of it?
Ege Erdil 00:13:51
Well, I mean, I guess what is the reason for believing that? I mean, you could have looked at AlphaGo or AlphaGo Zero, AlphaZero, those seemed very impressive at the time. I mean, you’re just learning to play this game with no human knowledge, you’re just learning to play it from scratch. And I think at the time it did impress a lot of people. But then people tried to apply it to math, they tried to apply it to other domains, and it didn’t work very well, they weren’t able to get competent agents at math.
So it’s very possible that these models, at least the way we have them right now, you’re going to try to do the same thing people did for reasoning, but for agency, it’s not going to work very well. And then you’re not going to-
Dwarkesh Patel 00:14:32
I’m sorry, you’re saying by the end of 2026, we will have agentic computer use.
Tamay Besiroglu 00:14:36
I think Ege said you’d be able to book a flight, which is very different from having full agentic computer use.
Dwarkesh Patel 00:14:44
I mean, the other things you need to do on a computer are just made up of things like booking a flight.
Ege Erdil 00:14:49
I mean, sure, but they are not disconnected tasks. That’s like saying everything you do in the world is just like you just move parts of your body, and then you move your mouth and your tongue, and then you roll your head. Yeah, individually those things are simple, but then how do you put them together, right?
Dwarkesh Patel 00:15:09
Yeah. Okay. So there’s two pieces of evidence that you can have that are quite dissimilar.
One, the METR eval, which we’ve been talking about privately, which shows that the task length over certain kinds of tasks- I can already see you getting ready. AI’s ability to do the kind of thing that it takes a human 10 minutes to do, or an hour to do, or four hours to do, the length of time for corresponding human tasks, it seems like these models seem to be doubling their task length every seven months. The idea being that by 2030, if you extrapolate this curve, they could be doing tasks that take humans one month to do, or one year to do. And then this long-term coherency in executing on tasks is fundamentally what intelligence is. So this curve suggests that we’re getting there.
The other piece of evidence- I kind of feel like my own mind works this way. I get distracted easily, and it’s hard to keep a long-term plan in my head at the same time. And I’m slightly better at it than these models. But they don’t seem that dissimilar to me. I would have guessed reasoning is just a really complicated thing, and then it seems like, “oh, it’s just something like learning 10 tokens worth of MCTS” of “wait, let’s go back, let’s think about this another way”.
Chain of thought alone just gets you this boost. And it just seems like intelligence is simpler than we thought. Maybe agency is also simpler in this way.
Ege Erdil 00:16:39
Yeah. I mean, I think there’s a reason to expect complex reasoning to not be as difficult as people might have thought, even in advance, because a lot of the tasks that AI solved very early on were tasks of various kinds of complex reasoning. So it wasn’t the kind of reasoning that goes into when a human solves a math problem.
But if you look at the major AI milestones over, I don’t know, since 1950, a lot of them are for complex reasoning. Like chess is, you can say, a complex reasoning task. Go is, you could say, a complex reasoning task.
Dwarkesh Patel 00:17:14
But I think there are also examples of long-term agency. Like winning at Starcraft is an example of being agentic over a meaningful period of time.
Ege Erdil 00:17:24
That’s right. So the problem in that case is that it’s a very specific, narrow environment. You can say that playing Go or playing chess, that also requires a certain amount of agency. And that’s true. But it’s a very narrow task. So that’s like saying if you construct a software system that is able to react to a very specific, very particular kind of image, or very specific video feeds or whatever, then you’re getting close to general sensor motor skill automation.
But the general skill is something that’s very different. And I think we’re seeing that. We still are very far, it seems like, from an AI model that can take a generic game off Steam. Let’s say you just download a game released this year. You don’t know how to play this game. And then you just have to play it. And then most games are actually not that difficult for a human.
Dwarkesh Patel 00:18:21
I mean, what about Claude Plays Pokemon? I don’t think it was trained on Pokemon.
Ege Erdil 00:18:25
Right, so that’s an interesting example. First of all, I find the example very interesting, because yeah, it was not trained explicitly. They didn’t do some RL on playing Pokemon Red. But obviously, the model knows how it’s supposed to play Pokemon Red, because there’s tons of material about Pokemon Red on the internet.
In fact, if you were playing Pokemon Red, and you got stuck somewhere, you didn’t know what to do, you could probably go to Claude and ask “I’m stuck in Mount Moon, and what am I supposed to do?” And then it’s probably able to give you a fairly decent answer. But that doesn’t stop it from getting stuck in Mount Moon for 48 hours. So that’s a very interesting thing, where it has explicit knowledge, but then when it’s actually playing the game, it doesn’t behave in a way which reflects that it has that knowledge.
Dwarkesh Patel 00:19:09
All it’s got to do is plug the explicit knowledge to its actions.
Ege Erdil 00:19:13
Yeah, but is that easy?
Dwarkesh Patel 00:19:15
Okay, if you can leverage your knowledge from pre-training about these games in order to be somewhat competent at them, okay, they’re going to be leveraging a different base of skills. But with that same leverage, they’re going to have a similar repertoire of abilities. If you’ve read everything about whatever skill that every human has ever seen.
Ege Erdil 00:19:43
A lot of the skills that people have, they don’t have very good training data for them.
Dwarkesh Patel 00:19:48
That’s right. What would you want to see over the next few years that would make you think, “oh, no, I’m actually wrong and this was the last unlock, and it was now just a matter of ironing out the kinks”. And then we get the thing that will kick off the, dare I say, intelligence explosion.
Tamay Besiroglu 00:20:04
I think something that would reveal its ability to do very long context things, use multimodal capabilities in a meaningful way, and integrate that with reasoning and other types of systems. And also agency and being able to take action over a long horizon and accomplish some tasks that takes very long for humans to do, not just in specific software environments, but just very broadly; say downloading an arbitrary game from Steam, something that it’s never seen before,
it doesn’t really have much training data, maybe it was released after a training cutoff and so there’s no tutorials or maybe there’s no earlier versions of the game that has been discussed on the Internet, and then accomplishing that game and actually playing that game to the end and accomplishing these various milestones that are challenging for humans. That would be a substantial update. I mean, there are other things that would update me, too, like OpenAI making a lot more revenue than it’s currently doing.
Dwarkesh Patel 00:21:11
Is the hundred billion in revenue that would, according to their contract, mark them as AGI enough?
Tamay Besiroglu 00:21:15
I think that’s not a huge update to me if that were to happen. So I think the update would come if it was, in fact, $500 billion in revenue or something like that. But then I would certainly update quite a lot. But a hundred billion, that seems pretty kind of likely to me. I would assign that maybe a 40 percent chance or something.
Dwarkesh Patel 00:21:37
If you’ve got a system that is, in producer surplus terms, worth a hundred billion. And the difference between this and AlphaZero is AlphaZero is never going to make a hundred billion dollars in the marketplace. So just what is intelligence? It’s like something able to usefully accomplish its goals, or your goals. If people are willing to pay a hundred billion dollars for it, that’s pretty good evidence that it’s like accomplishing some goals.
Tamay Besiroglu 00:22:05
I mean, people pay a hundred billion dollars for all sorts of things. That itself is not a very strong piece of evidence that it’s going to be transformative, I think.
Ege Erdil 00:22:13
People pay trillions of dollars for oil. I don’t know, it seems like a very basic point. But the fact that people pay a lot of money for something doesn’t mean it’s going to transform the world economy if only we manage to unhobble it. Like that’s a very different claim.

Even reasoning models lack animal intelligence

Dwarkesh Patel 00:22:27
So then this brings us to the intelligence explosion, because what people will say is, we don’t need to automate literally everything that is needed for automating remote work, let alone all human labor in general. We just need to automate the things which are necessary to fully close the R&D cycle needed to make smarter intelligences.
And if you do this, you get a very rapid intelligence explosion. And the end product of that explosion is not only an AGI, but something that is superhuman potentially. These things are extremely good at coding, and reasoning. It seems like the kinds of things that would be necessary to automate R&D at AI labs. What do you make of that logic?
Ege Erdil 00:24:14
I think if you look at their capability profile, if you compare it to a random job in the economy, I agree they are better at doing coding tasks that will be involved in R&D compared to a random job in the economy. But in absolute terms, I don’t think they’re that good. I think they are good at things that maybe impress us about human coders. If you wanted to see what makes a person a really impressive coder, you might look at their competitive programming performance. In fact, companies often hire people, if they’re relatively junior, based on their performance on these kinds of problems. But that is just impressive in the human distribution.
So if you look in absolute terms at what are the skills you need to actually automate the process of being a researcher, then what fraction of those skills do the AI systems actually have? Even in coding, a lot of coding is, you have a very large code base you have to work with, the instructions are very kind of vague. For example you mentioned METR eval, in which, because they needed to make it an eval, all the tasks have to be compact and closed and have clear evaluation metrics: “here’s a model, get its loss on this data set as low as possible”. Or “here’s another model and its embedding matrix has been scrambled, just fix it to recover like most of its original performance”, etc.
Those are not problems that you actually work on in AI R&D. They’re very artificial problems. Now, if a human was good at doing those problems, you would infer, I think logically, that that human is likely to actually be a good researcher. But if an AI is able to do them, the AI lacks so many other competences that a human would have- not just the researcher, just an ordinary human- that we don’t think about in the process of research. So our view would be that automating research is, first of all, more difficult than people give it credit for. I think you need more skills to do it and definitely more than models are displaying right now.
And on top of that, even if you did automate the process of research, we think a lot of the software progress has been driven not by cognitive effort- that has played a part- but it has been driven by compute scaling. We just have more GPUs, you can do more experiments, to figure out more things, your experiments can be done at larger scales. And that is just a very important driver. If you’re 10 years ago, 15 years ago, you’re trying to figure out what software innovations are going to be important in 10 or 15 years, you would have had a very difficult time. In fact, you probably wouldn’t even have conceived of the right kind of innovations to be looking at, because you would be so far removed from the context of that time with much more abundant compute and all the things that people would have learned by that point.
So these are two components of our view: Research is harder than people think, and depends a lot on compute scale.
Dwarkesh Patel 00:27:17
Can you put a finer point on what is an example of the kind of task which is very dissimilar from ‘train a classifier’ or ‘debug a classifier’ that is relevant to AI R&D?
Tamay Besiroglu 00:27:30
Examples might be introducing novel innovations that are very useful for unlocking innovations in the future. So that might be introducing some novel way of thinking about a problem. A good example might be in mathematics, where we have these reasoning models that are extremely good at solving math problems.
Ege Erdil 00:27:57
Very short horizon.
Tamay Besiroglu 00:28:00
Sure. Maybe not extremely good, but certainly better than I can and better than maybe most undergrads can. And so they can do that very well, but they’re not very good at coming up with novel conceptual schemes that are useful for making progress in mathematics. So it’s able to solve these problems that you can kind of neatly excise out of some very messy context, and it’s able to make a lot of progress there.
But within some much messier context, it’s not very good at figuring out what directions are especially useful for you to build things or make incremental progress on that enables you to have a big kind of innovation later down the line. So thinking about both this larger context, as well as maybe much longer horizon, much fuzzier things that you’re optimizing for, I think it’s much worse at those types of things.
Ege Erdil 00:28:54
Right. So I think one interesting thing is if you just look at these reasoning models, they know so much, especially the larger ones, because they know in literal terms more than any human does in some sense. And we have unlocked these reasoning capabilities on top of that knowledge, and I think that is actually what’s enabling them to solve a lot of these problems. But if you actually look at the way they approach problems, the reason what they do looks impressive to us is because we have so much less knowledge.
And the model is approaching the problems in a fundamentally different way compared to how a human would. A human would have much more limited knowledge, and they would usually have to be much more creative in solving problems because they have this lack of knowledge, while the model knows so much. But you’d ask it some obscure math question where you need some specific theorem from 1850 or something, and then it would just know that, if it’s a large model. So that makes the difficulty profile very different.
And if you look at the way they approach problems, the reasoning models, they are usually not creative. They are very effectively able to leverage the knowledge they have, which is extremely vast. And that makes them very effective in a bunch of ways. But you might ask the question, has a reasoning model ever come up with a math concept that even seems slightly interesting to a human mathematician? And I’ve never seen that.
Dwarkesh Patel 00:30:19
I mean, they’ve been around for all of six months,
Tamay Besiroglu 00:30:23
I mean, that’s a long time. One mathematician might have been able to do a bunch of work over that time, and they have produced orders of magnitude fewer tokens on math.
Ege Erdil 00:30:34
And then I just want to emphasize it, because just think about the sheer scale of knowledge that these models have. It’s enormous from a human point of view. So it is actually quite remarkable that there is no interesting recombination, no interesting, “oh, this thing in this field looks kind of like this thing in this other field”. There’s no innovation that comes out of that. And it doesn’t have to be a big math concept, it could be just a small thing that maybe you could add to a Sunday magazine on math that people used to have. But there isn’t even an example of that.
Tamay Besiroglu 00:31:09
I think it’s useful for us to explain a very important framework for our thinking about what AI is good at and what AI is lagging in, which is this idea of Moravec’s paradox, that things that seem very hard for humans, AI systems tend to make much faster progress on, whereas things that look a bunch easier for us, AI systems totally struggle or are often totally incapable of doing that thing. And so this kind of abstract reasoning, playing chess, playing Go, playing Jeopardy, doing kind of advanced math and solving math problems.
Ege Erdil 00:31:49
There are even stronger examples, like multiplying 100 digit numbers in your head, which is just the one that got solved first out of almost any other problem. Or following very complex symbolic logic arguments, like deduction arguments, which people actually struggle with a lot. Like how do premises logically follow from conclusions? People have a very hard time with that. Very easy for formal proof systems.
Tamay Besiroglu 00:32:12
An insight that is related and is quite important here is that the tasks that humans seem to struggle on and AI systems seem to make much faster progress on are things that have emerged fairly recently in evolutionary time. So, advanced language use emerged in humans maybe 100,000 years ago, and certainly playing chess and Go and so on are very recent innovations. And so evolution has had much less time to optimize for them, in part because they’re very new, but also in part because when they emerged, there was a lot less pressure because it conferred kind of small fitness gains to humans and so evolution didn’t optimize for these things very strongly.
And so it’s not surprising that on these specific tasks that humans find very impressive when other humans are able to do it, that AI systems are able to make a lot of fast progress. In humans, these things are often very strongly correlated with other competencies, like being good at achieving your goals, or being a good coder is often very strongly correlated with solving coding problems, or being a good engineer is often correlated with solving competitive coding problems.
But in AI systems, the correlation isn’t quite as strong. And even within AI systems, it’s the case that the strongest systems on competitive programming are not even the ones that are best at actually helping you code. So o3 mini’s high seems to be maybe the best at solving competitive code problems, but it isn’t the best at actually helping you write code.
Ege Erdil 00:33:54
And it isn’t getting most of the enterprise revenue from places like Coursera or whatever, that’s just Claude, right?
Tamay Besiroglu 00:33:59
But an important insight here is that the things that we find very impressive when humans are able to do it, we should expect that AI systems are able to make a lot more progress on that. But we shouldn’t update too strongly about just their general competence or something, because we should recognize that this is a very narrow subset of relevant tasks that humans do in order to be a competent, economically valuable agent.
Dwarkesh Patel 00:34:26
Yeah. First of all, I actually just really appreciate that there is an AI organization out there where- because there’s other people who take the compute perspective seriously, or try to think empirically about scaling laws and data and whatever. And taking that perspective seriously leads people to just be like, “okay, 2027 AGI”, which might be correct, but it is just interesting to get, “no, we’ve also looked at the exact same arguments, the same papers, the same numbers. And we’ve come to a totally different conclusion”.
So I asked Dario this exact question two years ago, when I interviewed him, and it went viral.
Ege Erdil 00:35:11
Didn’t he say AGI in two years?
Dwarkesh Patel 00:35:13
That, but Dario’s always had short timelines.
Ege Erdil 00:35:15
Okay, but we are two years later.
Dwarkesh Patel 00:35:18
Did he say two years? I think he actually did say two years.
Ege Erdil 00:35:20
Did he say three years?
Tamay Besiroglu 00:35:21
So we have one more year.
Dwarkesh Patel 00:35:22
One more year.
Tamay Besiroglu 00:35:23
Better work hard.
Dwarkesh Patel 00:35:27
But he’s, I mean, I think he’s like, he in particular has not been that well calibrated. In 2018, he had like…
Tamay Besiroglu 00:35:33
I remember talking to a very senior person who’s now at Anthropic, in 2017. And then he told various people that they shouldn’t do a PhD because by the time they completed it everyone will be automated.
Dwarkesh Patel 00:35:49
So anyways, I asked him this exact same question because he has short timelines, which is that if a human knew the amount of things these models know, they would be finding all these different connections. And in fact, I was asking Scott about this the other day when I interviewed him, Scott Alexander, and he said, “look, humans also don’t have this kind of logical omniscience”.
I’m not saying we’re omniscient, but we have examples of humans finding these kinds of connections. This is not an uncommon thing, right? I think his response was that these things are just not trained in order to find these kinds of connections, but their view is that it would not take that much extra compute in order to build some RL environment in which they’re incentivized to find these connections. Next token prediction just isn’t incentivizing them to do this, but the RL required to do this would not be- that or set up some sort of scaffolds. I think actually Google DeepMind did do some similar scaffold to make new discoveries. And I didn’t look into how impressive the new discovery was, they claim that some new discovery was made by an LLM as a result.
On the Moravec paradox thing, this is actually a super interesting way to think about AI progress. But I would also say that if you compare animals to humans, long term intelligent planning… an animal is not gonna help you book a flight either. An animal is not gonna do remote work for you.
I think what separates humans from other animals is that we can hold long-term, we can come up with a plan and execute on it. Whereas other animals often had to go by instinct, or within the kinds of environments that they have evolutionary knowledge of, rather than, “I’m put in the middle of the savanna, or I’m put in the middle of the desert, or I’m put in the middle of tundra, and I’ll learn how to make use of the tools and whatever there”. I actually think there’s a huge discontinuity between humans and animals and their ability to survive in different environments, just based on their knowledge. And so it’s a recently optimized thing as well. And then I’d be like, “okay, well, we got it soon. AIs will optimize for it fast”.
Ege Erdil 00:37:50
Right. So I would say if you’re comparing animals to humans, it’s kind of a different thing.
I think if you could put the competences that the animals have into AI systems, that might just already get you to AGI already. I think the reason why there is such a big discontinuity between animals and humans is because animals have to rely entirely on natural world data, basically, to train themselves. Imagine that the only thing as a human that you saw was nobody talked to you, you didn’t read anything, you just had to learn by experience, maybe to some extent by imitating other people, but you have no explicit communication. It would be very inefficient.
What’s actually happening is that you have this- I think some other people have made this point as well- is that evolution is sort of this outer optimizer that’s improving the software efficiency of the brain in a bunch of ways. There’s some genetic knowledge that you inherit, not that much because there isn’t that much space in the genome. And then you have this lifetime learning, which is, you don’t actually see that much data during lifetime learning. A lot of this is redundant and so on.
So what seems to have changed with humans compared to other animals is that humans became able to have culture and they have language, which enables them to have a much more efficient training data modality compared to animals. They also have, I think, stronger ways in which they tend to imitate other humans and learn from their skills, so that also enables this knowledge to be passed on. I think animals are pretty bad at that compared to humans. So basically as a human, you’re just being trained on much more efficient data and that creates further insights to be then efficient at learning from it, and then that creates this feedback loop where the selection pressure gets much more intense.
So I think that’s roughly what happened with humans. But a lot of the capabilities that you need to be a good worker in the human economy, animals already have. So they have quite sophisticated sensory motor skills. I think they are actually able to pursue long-term goals.
Dwarkesh Patel 00:40:03
But ones that have been instilled by evolution. I think a lion will find a gazelle and that is a complicated thing to do and requires stalking and blah, blah, blah-
Ege Erdil 00:40:12
But when you say it’s been instilled by evolution, there isn’t that much information in the genome.
Dwarkesh Patel 00:40:16
But I think if you put the lion in the Sahara and you’re like, “go find lizards instead”.
Ege Erdil 00:40:22
Okay. So suppose you put a human and they haven’t seen the relevant training data.
Dwarkesh Patel 00:40:27
I think they’d be slightly better.
Ege Erdil 00:40:29
Slightly better, but not that much better. Again, didn’t you recently have an interview?
Dwarkesh Patel 00:40:36
Joseph Henrich.
Ege Erdil 00:40:37
Yeah. So he would probably tell you that.
Dwarkesh Patel 00:40:40
I think what you’re making is actually a very interesting and subtle point that has an interesting implication. So often people say that ASI will be this huge discontinuity, because while we have this huge discontinuity in the animal-to-human transition, not that much changed between pre-human primates and humans genetically, but it resulted in this humongous change in capabilities. And so they say, “well, why not expect something similar between human level intelligence and superhuman intelligence?”
And one implication of the point you’re making is actually it wasn’t that we just gained this incredible intelligence. Because of biological constraints, animals have just been held back in this really weird way that no AI system has been arbitrarily held back from not being able to communicate with other copies or with other knowledge sources. And so since AIs are not held back artificially in this way, there’s not going to be a point where we should take away that hobbling. And then now they explode.
Now, actually, I think I would disagree with that. The implication that I made, I would actually disagree with- I’m like a sort of unsteerable chain of thought.
We wrote a blog post together about AI corporations where we discuss actually there will be a similar unhobbling with future AIs, which is not about the intelligence, but a similar level of bandwidth and communication and collaboration with other AIs, which is a similar magnitude of change from non-human animals to humans, in terms of their social collaboration, that AIs will have with each other because of their ability to copy all their knowledge exactly, to merge, to distill themselves.
Tamay Besiroglu 00:42:28
Maybe before we talk about that, I think just a very important point to make here, which I think underlies some of this disagreement that we have with others about both this argument from the transition from kind of non-human animals to humans, is this focus on intelligence and reasoning and R&D, which is enabled by that intelligence as being enormously important. And so if you think that you get this very important difference from this transition from non-human primates to humans, then you think that in some sense you get this enormously important unlock from fairly small scaling and, say, brain size or something.
And so then you might think, “well, if we scale beyond the size of training runs, the amount of training compute that the human brain uses, which is maybe on the order of 1E24 flop or whatever, which we’ve recently surpassed, then maybe surpassing it just a little bit more enables us to unlock very sophisticated intelligence in the same way that humans have much more sophisticated intelligence compared to non-human primates”. And I think part of our disagreement is that intelligence is kind of important, but just having a lot more intelligence and reasoning and good reasoning isn’t something that will kind of accelerate technological change and economic growth very substantially.
It isn’t the case that the world today is totally bottlenecked by not having enough good reasoning, that’s not really what’s bottlenecking the world’s ability to grow much more substantially. I think that we might have some disagreement about this particular argument, but I think what’s also really important is just that we have a different view as to how this acceleration happens, that it’s not just having a bunch of really good reasoners that give you this technology that then accelerates things very drastically. Because that alone is not sufficient. You need kind of complementary innovations in other industries. You need the economy as a whole growing and supporting the development of these various technologies. You need the various supply chains to be upgraded. You might need demand for the various products that are being built.
And so we have this view where actually this very broad upgrading of your technology and your economy is important rather than just having very good reasoners and very, very, very good reasoning tokens that gives us this acceleration.

Intelligence explosion

Dwarkesh Patel 00:45:04
All right. So this brings us back to the intelligence explosion. Here’s the argument for the intelligence explosion:
You’re right that certain kinds of things might take longer to come about, but this core loop of software R&D that’s required, if you just look at what kinds of progress is needed to make a more general intelligence, you might be right that it needs more experimental compute, but as you guys have documented, we’re just getting a shit-ton more compute every single year for the next few years. So you can imagine an intelligence explosion in the next few years where in 2027, there’ll be like 10 X more compute than there is now for AI.
And you’ll have this effect where the AIs that are doing software R&D are finding ways to make running copies of them more efficient, which has two effects. One, you’re increasing the population of AIs who are doing this research, so more of that in parallel can find these different optimizations. And a subtle point that they’d often make here is software R&D in AI is not just Ilya-type coming up with new transformer-like architectures.
To your point, it actually is a lot of- I mean, I’m not an AI researcher, but I assume there’s, from the lowest level libraries to the kernels, to making RL environments, to finding the best optimizer, to… there’s just so much to do, and in parallel you can be doing all these things or finding optimizations across them. And so you have two effects, going back to this. One is, if you look at the original GPT-4 compared to the current GPT-4o, I think it’s, what, how much cheaper is it to run?
Tamay Besiroglu 00:46:57
It’s like, maybe a hundred times for the same capability or something.
Dwarkesh Patel 00:47:03
Right. So they’re finding ways in which to run more copies of them at a hundred X cheaper or something, which means that the population of them is increasing and the higher populations are helping you find more efficiencies.
And not only does that mean you have more researchers, but to the extent that the complementary input is experimental compute, it’s not the compute itself, it’s the experiments.
And the more efficient it is to run a copy or to develop a copy, the more parallel experiments you can run, because now you can do a GPT-4 scale training run for much cheaper than you could do it in 2024 or 2023. And so for that reason, also this software-only singularity sees more researcher copies who can run experiments for cheaper, dot, dot, dot. They initially are maybe handicapped in certain ways that you mentioned, but through this process, they are rapidly becoming much more capable. What is wrong with this logic?
Tamay Besiroglu 00:47:57
So I think the logic seems fine. I think this is like a decent way to think about this problem, but I think that it’s useful to draw on a bunch of work that, say, economists have done for studying the returns to R&D and what happens if you 10X your inputs, the number of researchers, what happens to innovation or the rate of innovation.
And there, they point out these two effects where, as you do more innovation and you get to stand on top of the shoulders of giants and you get the benefit from past discoveries and it makes you as a scientist more productive. But then there’s also kind of diminishing returns, that the low hanging fruit has been picked, and it becomes harder to make progress. And overall, you can summarize those estimates as thinking about the kind of returns to research effort.
And we’ve looked into the returns to research effort in software specifically. And we look at a bunch of domains in traditional software or linear integer solvers or SAT solvers, but also in AI; computer vision and RL and language modeling. And there, if this model is true, that all you need is just cognitive effort, it seems like the estimates are a bit ambiguous about whether this results in this acceleration or whether it results in just merely exponential growth.
And then you might also think about, well, it isn’t just your research effort that you have to scale up to make these innovations, because you might have complementary inputs. So as you mentioned, experiments are the thing that might kind of bottleneck you. And I think there’s a lot of evidence that in fact, these experiments and scaling up hardware, it’s just very important for getting progress in the algorithms and the architecture and so on. So in AI- this is true for software in general- where if you look at progress in software, it often matches very closely the rate of progress we see in hardware. So for traditional software, we see about a 30% roughly increase per year, which kind of basically matches Moore’s law. And in AI, we’ve seen the same until you get to the deep learning era, and then you get this acceleration, which in fact coincides with the acceleration we see in compute scaling, which gives you a hint that actually the compute scaling might have been very important.
Other pieces of evidence besides this coincidental rate of progress, other pieces of evidence are the fact that innovation and algorithms and architectures are often concentrated in GPU-rich labs and not in the GPU-poor parts of the world, like academia or maybe smaller research institutes. That also suggests that having a lot of hardware is very important. If you look at specific innovations that seem very important, the big innovations over the past five years, many of them have some kind of scaling or hardware-related motivation. So you might look at how the transformer itself was about how to harness more parallel compute. Things like flash attention was literally about how to implement the attention mechanism more efficiently, or things like the chinchilla scaling law.
And so many of these big innovations were just about how to harness your compute more effectively. That also tells you that actually the scaling of compute might be very important. And I think there’s just many pieces of evidence that point towards this complementarity picture.
So I would say that even if you assume that experiments are not particularly important, the evidence we have, both from estimates of AI and other software- although the data is not great- suggests that maybe you don’t get this kind of hyperbolic, faster-than-exponential super-growth in the overall algorithmic efficiency of systems.
Dwarkesh Patel 00:51:56
I’m not sure I buy the argument that because these two things compute and AI progress have risen so concomitantly that this is a sort of causal relationship.
So broadly, the industry as a whole has been getting more compute and as a result, making more progress. But if you look at the top players, there’s been multiple examples of a company with much less compute, but a more coherent vision, more concentrated research effort, being able to beat an incumbent who has much more compute. So OpenAI initially beating Google DeepMind. And if you remember, there were these emails that were released between Elon and Sam and so forth like, “we got to start this company because they’ve got this bottleneck on the compute” and, “look how much more compute Google DeepMind has”. And then OpenAI made a lot of progress. Similarly now with OpenAI versus Anthropic and so forth. And then I think just generally, your argument is just too ‘outside view’. And we actually do know a lot about this very macro economic argument that I’m like, well, why don’t we just ask the AI researchers?
Tamay Besiroglu 00:53:01
I mean, AI researchers will often kind of overstate the extent to which just cognitive effort and doing research is important for driving these innovations, because that’s often convenient or useful. They will say the insight was derived from some nice idea about statistical mechanics or some nice equation in physics that says that we should do it this way. But often that’s an ad hoc story that they tell to make it a bit more compelling to reviewers.
Dwarkesh Patel 00:53:35
So Daniel Kokotajlo mentioned this survey he did where he asked a bunch of AI researchers, “if you had one thirtieth the amount of compute”- and he did one thirtieth because AI’s will be, they suppose, 30 times faster- “If you had one thirtieth the amount of compute, how much would your progress slow down?” And they say, “I make a third of the amount of progress I normally do”. So that’s just a pretty good substitution effect of, you get one tenth the compute, your progress only goes down one third.
And then I was talking to an AI researcher the other day, one of these cracked people, gets paid tens of millions of dollars a year, probably. And we asked him, how much does the AI models help you in AI research? And he said, “in domains that I’m already quite familiar with, where it’s closer to autocomplete, it saves me four to eight hours a week”. And then he said, “but in domains where I’m actually less familiar, where I need to drive new connections, I need to understand how these different parts relate to each other, and so forth. It saves me close to 24 to 36 hours a week”.
And that’s the current models. And I’m just like, “he didn’t get more computed, but it still saved him like a shit ton more time”. Just draw that forward. That’s a crazy implication or crazy trend, right?
Ege Erdil 00:54:58
I mean, I’m skeptical of the claims that we have actually seen that much of an acceleration in the process of R&D. These claims seem to me, like they’re not borne out by the actual data I’m seeing. So I’m not sure how much to trust them.
Dwarkesh Patel 00:55:18
I mean, on the general intuition that cognitive effort alone can give you a lot of AI progress, it seems like a big important thing the labs do is this science of deep learning. Scaling laws… I mean, it ultimately netted out in an experiment, but the experiment is motivated by cognitive effort.
Ege Erdil 00:55:36
So for what it’s worth, when you say that A and B are complementary, you’re not saying, just as A can bottleneck you, B can also bottleneck you. So when you say you need compute and experiments and data, but you also need cognitive effort, that doesn’t mean the lab which has the most compute is going to win, right? That’s a very simple point, either one can be the bottleneck.
I mean, if you just have a really dysfunctional culture and you don’t actually prioritize using your computer very well and you just waste it, well then you’re not going to make a lot of progress, right? So it doesn’t contradict the picture that someone with a much better vision, a much better team, much better prioritization can make better use of their compute if someone else was just bottlenecked heavily on that part of the equation. The question here is, once you get these automated AI researchers and you start this software singularity, your software efficiency is going to improve by many orders of magnitude, while your compute stock, at least in the short run, is going to remain fairly fixed. So how many OOMs of improvement can you get before you become bottlenecked by the second priority equation? And once you actually factor that in, like how much progress should you expect?
That’s the kind of question I think people don’t have. I think it’s hard for people to have good intuitions about this because people usually don’t run the experiments. So you don’t get to see at a company level, or at an industry level, what would have happened if the entire industry had 30 times less compute. Maybe as an individual, what would happen if you had 30 times less compute? You might have a better idea about that, but that’s a very local experiment and you might be benefiting a lot from spillovers from other people who actually have more compute. So because this experiment was never run, it’s sort of hard to get direct evidence about the strength of complementarity.
Dwarkesh Patel 00:57:27
Actually, what is your probability of, if we live in the world where we get AGI in 2027, that there is a software-only singularity?
Tamay Besiroglu 00:57:35
Quite high, because you’re conditioning on compute not being very large. So it must be that you get a bunch of software progress.
Dwarkesh Patel 00:57:44
Yeah, right, right. Like you just have a bunch of leverage from algorithmic progress in that world.
Tamay Besiroglu 00:57:50
OK, that’s right.
Dwarkesh Patel 00:57:51
So then maybe, because I was thinking these are independent questions-
Tamay Besiroglu 00:57:54
I think a call that I want to make is, I know that some labs do have multiple pre-training teams and they give people different amounts of resources for doing the training and different amounts of cognitive effort, different size of teams. But none of that, I think, has been published. And I’d love to see the results of some of those experiments.
I think even that won’t update you very strongly just because it is often just very inefficient to do this very imbalanced scaling of your factor inputs. And in order to really get an estimate of how strong these complementarities are, you need to observe these very imbalanced scale-ups. And so that rarely happens.
And so I think the data that bears on this is just really quite poor. And then the intuitions that people have also don’t seem clearly relevant to the thing that matters about what happens if you do this very imbalanced scaling and where does this net out?
Dwarkesh Patel 00:58:53
One question I have, which it would be really interesting if somebody can provide an example of: maybe through history, there was some point at which because of a war or some other kind of supply shock, you had to ramp up production or ramp up some key output that people really cared about, while for some weird historical reason, many of the key inputs were not accessible to a ramp-up, but you could ramp-up one key input.
I’m talking in very abstract terms. You see what I’m saying, right? You need to make more bombers, but you ran out of aluminum and you need to figure out something else to do. And how successful these efforts have been or whether you just keep getting bottlenecked?
Ege Erdil 00:59:35
Well, I think that is not quite the right way to do it. Because I think if you’re talking about materials, then I think there’s a lot of sense in which different materials can be substitutable for one another in different ways. You can use aluminum. I mean, aluminum is a great metal for making aircraft because it’s light and durable and so on. But you can imagine that you make aircraft with worse metals and then it just takes more fuel and it’s less efficient to fly.So there’s a sense in which you can compensate and just cost more.
I think it’s much harder if you’re talking about something like complementarity between labor and capital, complementarity between remote work and in-person work or skilled or unskilled work. There are input pairs for which I would expect it to be much more difficult. For example, you’re looking at the complementarity between the quality of leadership of an army and its number of soldiers. There is some effect there, but if you just scale up, you have excellent leadership, but your army only has 100 people. You’re not going to get very far.
Dwarkesh Patel 01:00:40
King Leonidas and Thermopylae?
Ege Erdil 01:00:44
Well, they lost, right?
Dwarkesh Patel 01:00:47
It would be funny if we’re building models and software-only singularity and we’re like, “what exactly happened in Thermopylae?” It’s somehow relevant.
Ege Erdil 01:00:53
I can actually talk about that, but we probably shouldn’t.

Ege & Tamay’s story

Dwarkesh Patel 01:00:57
Okay, sure. So the audience should know, my most popular guest by far is Sarah Paine. Not only is she my most popular guest, she’s my most popular four guests. Because all four of those episodes that I’ve done with her are, from a viewer-minute adjusted basis, I host the Sarah Paine Podcast where I occasionally talk about AI.
Anyways, we did this three-part lecture series where one of them was about India-Pakistan wars through history. One of them was about Japanese culture before World War II. The third one was about the Chinese Civil War. And for all of them, my history tutor was Ege. And, why does he know so much about fucking random 20th century conflicts? But he did, and he suggested a bunch of the good questions I asked her. We’ll get into that in a second. Ege, what’s going on there?
Ege Erdil 01:01:56
I don’t know. I mean, I don’t really have a good question. I think it’s interesting. I mean, I read a bunch of stuff, but it’s a kind of boring answer. I don’t know. Imagine you ask a top AI researcher, “What’s going on? How are you so good?” And then they will probably give you a boring answer. Like, I don’t know.
Dwarkesh Patel 01:02:13
That itself is interesting that often these kinds of questions elicit boring answers. It tells you about the nature of the skill. How’d you find him?
Tamay Besiroglu 01:02:22
We connected on a Discord for Metaculus, which is this forecasting platform. And I was a graduate student at Cambridge at the time doing research in economics. And I was having conversations with my peers there. And I was occasionally having conversations with Ege. And I was like, “this guy knows a lot more about economics”. And at the time he was a computer science undergrad in Ankara. And he knows more about economics and about these big trends in economic growth and economic history than almost any of my peers at the university. And so like, what the hell is up with that?
And then we started having frequent collaborations and ended up hiring Ege for Epoch because it clearly makes sense for him to work on these types of questions.
Dwarkesh Patel 01:03:17
And it seems like at Epoch, you’ve just collected this group of internet misfits and weirdos.
Tamay Besiroglu 01:03:23
Yeah, that’s right.
Dwarkesh Patel 01:03:24
How did you start Epoch? And then how did you accomplish this?
Tamay Besiroglu (01:03:27
So I was at MIT doing more research, and I was pretty unhappy with the bureaucracy there where it was very hard for me to scale projects up, hire people. And I was pretty excited about a bunch of work that my PI wasn’t excited about because it’s maybe hard to publish or it doesn’t confer the same prestige. And so I was chatting with Jaime Sevilla, one of the co-founders, and we just collaborated on projects and then thought we should just start our own org, because we can just hire people and work on the projects we were excited about. And then I just hired a bunch of the insightful misfits that like…
Dwarkesh Patel 01:04:12
But was the thesis like, “oh, there’s a bunch of underutilized internet misfits and therefore this org was successful”? Or you started the org and then you were like…
Tamay Besiroglu 01:04:20
I think it’s more of the latter. So it was more like we can make a bunch of progress because clearly academia and industry is kind of dropping the ball on a bunch of important questions that academia is unable to publish interesting papers on. Industry is not really focused on producing useful insights. And so it seemed very good for us to just do that. And also the timing was very good. So we started just before ChatGPT and we wanted to have much more grounded discussions of the future of AI.
And I was frustrated with the quality of discussion that was happening on the internet about the future of AI. And to some extent or to a very large extent, I still am. And that’s a large part of what motivates me to do this. It’s just born out of frustration with bad thinking and arguments about where AI is going to go.

Explosive Economic Growth

Dwarkesh Patel 01:06:24
OK, so let me ask you about this: So just to set the scene for the audience, we’re going to talk about the possibility of this explosive economic growth and greater than 30 percent economic growth rates. So I want to poke you both from a perspective of “maybe suggesting that this isn’t aggressive enough in the right kind of way, because it’s maybe it’s too broad”, and then I’ll poke you from the more normal perspective that, “hey, this is fucking crazy”.
Ege Erdil 01:06:54
I imagine it would be difficult for you to do the second thing.
Dwarkesh Patel 01:06:57
No, I mean, I think it might be fucking crazy, let’s see. The big question I have about this broad automation, I get what you’re saying about the Industrial Revolution, but in this case, we can just make this argument that you get this intelligence and then what you do next is you go to the desert and you build this Shenzhen of robot factories which are building more robot factories, which are building… If you need to do experiments then you build bio labs and you build chemistry labs and whatever.
Ege Erdil 01:07:30
Or you can build Shenzhen in the desert. I agree that looks much more plausible than a software-only singularity.
Dwarkesh Patel 01:07:35
But the way you’re framing it, it sounds like McDonald’s and Home Depot and fucking whatever are growing at 30 percent a year as well. The aliens’ level view of the economy is that there’s a robot economy in the desert that’s growing at 10,000 percent a year and everything else is the same-old-same-old, or is it like-
Ege Erdil 01:07:57
No, I mean, there is a question about what would be possible, or physically possible, and what would be the thing that would actually be efficient. So it might be the case, and again, once you’re scaling up the hardware part of the equation as well as the software part, then I think the case for this feedback loop gets a lot stronger. If you scale up data collection as well, I think it gets even stronger, real world data collection by deployment and so on.
But building Shenzhen in a desert… if you think about the pipeline; so far we have relied first on the entire semiconductor supply chain. That industry depends on tons of inputs and materials. And it gets from probably tons of random places in the world. And creating that infrastructure, doubling, or tripling, whatever, that infrastructure, the entire thing. That’s very hard work. So probably you couldn’t even do it, even if you just have Shenzhen in a desert, that will be even more expensive than that.
And on top of that, so far, we have been drawing heavily on the fact that we have built up this huge stock of data, over the past 30 years or something, on the internet. Imagine you were trying to train a state-of-the-art model, but you only have 100 billion tokens to train on. That would be very difficult. So in a certain sense, our entire economy has produced this huge amount of data on the internet that we are now using to train the models. It’s plausible that in the future, when you need to get new competencies added to these systems, the most efficient way to do that will be to try to leverage similar kinds of modalities of data, which will also require this… you would want to deploy the systems broadly because that’s going to give you more data. And maybe you can get where you want to be without that, but it would just be less efficient if you’re starting from scratch compared to if you’re collecting a lot of data.
I think this is actually a motivation for why labs want their LLMs to be deployed widely, because sometimes when you talk to ChatGPT, it’s going to give you two responses and it’s going to say, well, which one was good? Or it’s going to give you one response and it’s going to ask you, was this good or not? Well, why are they doing that, right? That’s a way in which they are getting user data through this extremely broad deployment. So I think you should just imagine that thing to continue to be efficient and continue to increase in the future because it just makes sense.
And then there’s a separate question of, well, suppose you didn’t do any of that. Suppose you just tried to imagine the most rudimentary, the narrowest possible kind of infrastructure build-out and deployment that would be sufficient to get this positive feedback loop that leads to much more efficient AIs. I agree that loop could, in principle, be much smaller than the entire world. I think it probably couldn’t be as small as Shenzhen in the desert, but it could be much smaller than the entire world. But then there’s a separate question of, would you actually do that? Would that be efficient? I think some people have the intuition that there are just these extremely strong constraints, maybe regulatory constraints, maybe social political constraints, to doing this broad deployment. They just think it’s going to be very hard.
So I think that’s part of the reason why they imagine these narrower scenarios where they think it’s going to be easier. But I think that’s overstated. I think people’s intuitions for how hard this kind of deployment is comes from cases where the deployment of the technology wouldn’t be that valuable. So it might come from housing. We have a lot of regulations on housing. Maybe it comes from nuclear power. Maybe it comes from supersonic flights. I mean, those are all technologies that would be useful if they were maybe less regulated. But they wouldn’t double.
Tamay Besiroglu 01:11:52
I think the core point here is the value of AI automation and deployment is just extremely large, even just for workers. There might be some kind of displacement and there might be some transition that you need to do in order to find a job that works for you, but otherwise the wages could still be very high for a while at least.
And on top of that, the gains from owning capital might be very enormous. And in fact, a large share of the US population would benefit… They benefit, they own housing, they have 401ks. Those would do enormously better when you have this process of broad automation and AI deployment. And so I think there could just be a very deep support for some of this, even when it’s totally changing the nature of labor markets and the skills and occupations that are in demand.
Ege Erdil 01:12:55
So I would just say it’s complicated. I think what the political reaction to it will be when this starts actually happening, I think the easy thing to say is that, yeah, this will become a big issue and then it will be maybe controversial or something. But what is the actual nature of the reaction in different countries? I think that’s kind of hard to forecast. I think the default view is like, “well, people are going to become unemployed, so it will just be very unpopular”. I think that’s very far from obvious.
And I just expect heterogeneity in how different countries respond. And some of them are going to be more liberal about this and going to have a much broader deployment. And those countries probably end up doing better. So just like during the Industrial Revolution, some countries were just ahead of others. I mean, eventually almost the entire world adopted the sort of norms and culture and values of the Industrial Revolution in various ways.
Tamay Besiroglu 01:13:44
And actually, you say they might be more liberal about it, but they might actually be less liberal in many ways. In fact, that might be more functional in this world in which you have broad AI deployment. We might adopt the kind of values and norms that get developed in, say, the UAE or something, which is maybe focused a lot more on making an environment that is very conducive for AI deployment. And we might start emulating and adopting various norms like that. And they might not be classical liberal norms, but norms that are just more conducive to AI being functional and producing a lot of value.
Ege Erdil 01:14:27
This is not meant to be a strong prediction, this is just an illustrative. It might just be the freedom to deploy AI in the economy and build out lots of physical things at scale, maybe that ends up being more important in the future. Maybe that is still missing something, maybe there are some other things that are also important. The generic prediction that you should expect variance and some countries do better than others, I think that’s much easier to predict than the specific countries that end up doing better.
Dwarkesh Patel 01:14:55
Yeah. Or the norms that that country wants.
Tamay Besiroglu 01:14:56
That’s right.
Dwarkesh Patel 01:14:57
One thing I’m confused about is, if you look at the world of today versus the world of 1750, the big difference is just we’ve got crazy tech that they didn’t have back then. We’ve got these cameras, we’ve got these screens, and we’ve got rockets and so forth. And that just seems like the result of technological growth and R&D and so forth.
Ege Erdil 01:15:22
It’s a capital accumulation.
Dwarkesh Patel 01:15:23
Well, explain that to me because you’re just talking about this infrastructure build out and blah, blah, blah. I’m like, but why won’t they just fucking invent the kinds of shit that humans would have invented by 2050?
Ege Erdil 01:15:37
Producing this stuff takes a lot of infrastructure build-out.
Dwarkesh Patel 01:15:40
But that infrastructure is built out once you make the technology, right?
Tamay Besiroglu 01:15:45
I don’t think that’s right. There isn’t this temporal difference where it’s first you do the invention… often there’s this interplay between the actual capital buildup and the innovation.
Ege Erdil 01:15:57
Learning curves are about this, right, fundamentally? What has driven the increase in the efficiency of solar panels over the past 20, 30 years?
Tamay Besiroglu 01:16:05
It isn’t just like people had the idea of 2025 solar panels. Nobody 20 years ago had the sketch for the 2025 solar panel. It’s this kind of interplay between having ideas, building, learning, producing, and-
Ege Erdil 01:16:24
Other complementary inputs also becoming more efficient at the same time, like you might get better materials. For example, the fact that smelting processes got a lot better towards the end of the 19th century, so it became a lot easier to work with metal, maybe that was a crucial reason why aircraft technology later became more popular.
It’s not like someone came up with the idea of, “oh, you can just use something that just has wings and has a lot of thrust, and then that might be able to fly”. That basic idea is not that difficult, but then, well, how do you make it actually a viable thing? Well, that’s much more difficult.
Dwarkesh Patel 01:17:04
Have you seen the meme where two beavers are talking to each other and they’re looking at the Hoover Dam? One of them’s like, “well, I didn’t build that, but it’s based on an idea of mine”. The point you’re making is that this invention-focused look on tech history underplays the work that goes into making specific innovations practicable and to deploy them widely.
Ege Erdil 01:17:33
It’s just hard, I think. Suppose you want to write a history of this, you want to write the history of how the light bulb was developed or something. It’s just really hard. Because to understand why specific things happen at specific times, you probably need to understand so much about the economic conditions of the time.
For example, Edison spent a ton of time experimenting with different filaments to be using the light bulb. The basic idea is very simple. You make something hot and it glows, but then what filament actually works well for that in a product? What is durable? What has the highest ratio of light output versus heat so that you have less waste, it’s more efficient. And even after you have the product, then you’re facing the problem, well, it’s 1880 or something and US homes don’t have electricity, so then nobody can use it. So now you have to build power plants and build power lines to the houses so that people have electricity in their homes so that they can actually use this new light bulb that you created. So he did that, but then people present it as if it’s like, “okay, he just came up with the idea”, like “it’s a light bulb”.
Dwarkesh Patel 01:18:46
I guess the thing people would say is, you’re right about how technology would progress if we were humans deploying for the human world. But what you’re not counting is there’s going to be this AI economy where maybe they need to do this kind of innovation and learning by doing when they’re figuring out how to, “I want to make more robots because they’re helpful and so we’re going to build more robot factories, we’ll learn and then we’ll make better robots” or whatever. But geographically, that is a small part of the world that’s happening in. You understand what I’m saying? It’s not like, “and then they walk in your building and then you do a business transaction with Lunar Society podcast LLC and then”, you know what I mean?
Ege Erdil 01:19:30
For what it’s worth, if you look at the total surface area of the world, it might well be the case that the place that initially experiences this very fast growth is a small percentage of the surface area of the world. And I think that was the same for the Industrial Revolution, it was not different.
Dwarkesh Patel 01:19:49
What concretely does this explosive growth look like? If I look at this heat map of growth rates on the globe, is there just going to be one area that is blinding hot and that’s the desert factories with all these experiments and like…
Ege Erdil 01:20:03
I would say our idea is that it’s going to be broader than that, but probably initially… So eventually it would probably be most of the world. But as I said, because of this heterogeneity, because I think some countries are going to be faster in adoption than others, maybe some cities will have faster adoption than others, that will mean that there are differentials and some countries might have much faster growth than other countries.
But I would expect that at a jurisdiction level, it will be more homogenous. So, for example, I expect the primary obstacles to come from things like regulation. And so I would just imagine it’s being more delineated by regulatory jurisdiction boundaries than anything else.
Dwarkesh Patel 01:20:48
Got it. So you may be right that this infrastructure build-out and capital deepening and whatever l is necessary for a technology to become practical, but…
Ege Erdil 01:20:57
Or even to be discovered. There’s an aspect of it where you discover certain things by scaling up, learning by doing, that’s the [?] learning curve. And there’s this separate aspect where, suppose that you become wealthier, well, you can invest that increased wealth in, you use it to accumulate more capital, but you also can invest it in R&D and other ways.
Tamay Besiroglu 01:21:21
You get Einstein out of the patent office. You need some amount of resources for that to make sense. And you need the economy to be of a certain scale. You also need demand for the product you’re building. So, you could have the idea, but if the economy is just too small that there isn’t enough demand for you to be specializing and producing the semiconductor or whatever, because there isn’t enough demand for it, then it doesn’t make sense.
A much larger scale of an economy is useful in many ways in delivering complementary innovations and discoveries happening through serendipity, producing, having there be consumers that would actually pay enough for you to recover your fixed costs of doing all the experimentation and the invention. You need the supply chains to exist to deliver the germanium crystals that you need to grow in order to come up with the semiconductor. You need a large labor force to be able to help you do all the experiments and so on.
Dwarkesh Patel 01:22:20
I think the point you’re illustrating is, “look, could you have just figured out that there was a Big Bang by first principles reasoning?” Maybe. But what actually happened is we had World War II and we discovered radio communications in order to fight and effectively communicate during the war.
And then that technology helped us build radio telescopes. And then we discovered the cosmic microwave background. And then we had to come up with an explanation for the cosmic microwave background. And then we discovered the Big Bang as a result of World War II.
Tamay Besiroglu 01:22:46
People underemphasize that giant effort that goes into this build-up of all the relevant capital and all the relevant supply chains and the technology. I mean earlier you were making a similar comment when you were saying, “oh reasoning models actually in hindsight, they look pretty simple”, but then you’re ignoring this giant upgrading of the technology stack that happened, that took five to 10 years prior to that. And so I think people just underemphasize the support that is had from the overall upgrading of your technology, of the supply chains, of various sectors that are important for that.
And people focus on just specific individuals of like, Einstein had this genius insight and he was the very pivotal thing in the causal chain that resulted in these discoveries. Or Newton was just extremely important for discovering calculus without thinking about, well, there were all these other factors that produced lenses, that produced telescopes, that got the right data and that made people ask questions about dynamics and so on that motivated some of these questions. And those are also extremely important for scientific and technological innovation.
Dwarkesh Patel 01:24:06
And then, as you were saying, one of Conquest laws is, the more you understand about a topic, the more conservative you become about that topic. And so there may be a similar law here, where the more you understand about an industry, the more- obviously, I’m just a commentator, or a podcaster, but I understand AI better than any other industry I understand. And I have the sense from talking to people like you that, “oh, so much went into getting AI to the point where it is today”. Whereas when I talk to journalists about AI, they’re like, “okay, who is a crucial person we need to cover?” And they’re like, “should we get in touch with Geoffrey Hinton? Should we get in touch with Ilya?” And I just have this like, “you’re kind of missing the picture”.
But then you should have that same attitude towards things you… Or maybe it’s a similar phenomenon to Gell-Mann amnesia, we should have a similar attitude towards other industries.
Ege Erdil 01:24:59
Robin Hanson has this abstraction of seeing things in near mode versus far mode. And I think if you don’t know a lot about the topic, then you see it in far mode and you simplify things, you see a lot more detail. In general, I think the thing I would say, and the reason I also believe that abstract reasoning and deductive reasoning or even Bayesian reasoning by itself is not sufficient or is not as powerful as many other people think, is because I think there’s just this enormous amount of richness and detail in the real world that you just can’t reason about it. You need to see it. And obviously that is not an obstacle to AI being incredibly transformative because as I said, you can scale your data collection, you can scale experiments you do both in the AI industry itself and just more broadly in the economy, so you just discover more things. More economic activity means we have more exposed surface area to have more discoveries.
All of these are things that have happened in our past, so there’s no reason that they couldn’t speed up. The fundamental thing is that there’s no reason fundamentally why economic growth can’t be much faster than it is today. Like it’s probably as advanced right now just because humans are such an important bottleneck. They both supply the labor. They play crucial roles in the process of discovery of various kinds of productivity growth. There’s just strong complementarity to some extent with capital that you can’t substitute machines and so on for humans very well. So the growth of the economy and growth productivity just ends up being bottlenecked by the growth of human population.
Dwarkesh Patel 01:27:39
So let me ask you a tangential question. What’s been happening in China over the last 50 years, would you describe that as, in principle, the same kind of explosive growth that you expect from AI? Because there’s a lot of labor that makes the marginal product of capital really high, which allows you to have 10% plus economic growth rates. Is that basically in principle from AI?
Ege Erdil 01:28:01
So I would say in some ways it’s similar, in some ways it’s not. Probably the most important way in which it’s not similar is that in China, you see a massive amount of capital accumulation, a substantial amount of adoption of new technologies and probably also human capital accumulation to some extent. But you’re not seeing a huge scale up in the labor force. While for AI, you should expect to see a scale up in the labor force as well, not in the human workforce, but in the AI workforce.
Dwarkesh Patel 01:28:34
And I think you did, maybe not consecutive increases in the labor force…
Tamay Besiroglu 01:28:38
The key thing here is just the simultaneous scaling of both these things. And so you might ask the question of “isn’t it basically half of what’s going to happen with AI that you scale up capital accumulation in China?” But actually if you get both of these things to scale, that gives you just much faster growth and a very different picture.
Ege Erdil 01:29:04
But at the same time, if you’re just asking what 30 percent growth per year would look like, if you just want to have an intuition for how transformative that would be in concrete terms, then I think looking at China is not such a bad case. Especially in the 2000s or maybe late 90s, that seems slower than what we’re forecasting.
Tamay Besiroglu 01:29:24
Right. I think also looking at the Industrial Revolution is pretty good.
Ege Erdil 01:29:26
Well, the Industrial Revolution is very slow.
Tamay Besiroglu 01:29:28
But just in terms of the margins along which we made progress in terms of products. So the thing that didn’t happen during the industrial revolution is we just produced a lot more of things that people were producing prior to the industrial revolution, like producing a lot more crops and maybe a lot more kind of pre-Industrial Revolution style houses or whatever, on farms. Instead, what we got is along pretty much every main sector of the economy, we just had many different products that are totally different from what was being consumed prior to that. So in transportation, in food.
Ege Erdil 01:30:13
I mean, health care is a very big deal and antibiotics.
Dwarkesh Patel 01:30:16
So another question, because I’m not sure I understand how you’re defining the learning by doing versus explicit R&D, because there’s the way for taxes that companies say what they call R&D. But then there’s the intuitive understanding of R&D. So if you think about how AI is boosting TFP, you could say that right now, if you just had replaced the TSMC process engineers with AIs and they’re finding different ways in which to improve that process and improve efficiencies, improve yield, I would kind of call that R&D. On the other hand, you emphasize this other part of TFP, which is like better management and that kind of stuff.
Ege Erdil 01:30:59
The learning by doing could be, you could-
Dwarkesh Patel 01:31:00
But how much “umph” are you… Like you’re going to get to the fucking Dyson Sphere by better management?
Ege Erdil 01:31:05
But that’s not the argument, right? The point is that there are all these different things, some of them are maybe more complimentary than others. The point is not that you can get to a Dyson sphere by just scaling labor and capital. That’s not the point. You need to scale everything at once. So just as you can’t get to a Dyson sphere by just scaling labor and capital, you also can’t get to it by just scaling TFP. That doesn’t work.
Tamay Besiroglu 01:31:30
I think there’s a very important distinction between what is necessary to scale, to get to this Dyson sphere world and what is important. Like in some sense, producing food is necessary. But of course, producing food doesn’t get you to a Dyson sphere, right? So I think R&D is necessary, but on its own isn’t sufficient. And scaling up the economy is also necessary. On its own, it’s not sufficient. And then you can ask the question, what is the relative importance of each?
Ege Erdil 01:32:00
So I think our view here is very much the same. It is very connected to our view about the software R&D thing where we’re just saying there are these bottlenecks, so you need to scale everything at once. This is just a general view.
But I think people misunderstand us sometimes as saying that R&D is not important. No, that’s not what we’re saying. We’re saying it is important. It is less important in relative terms than some other things, none of which are by themselves sufficient to enable this growth. So the question is, how do you do the credit attribution? One of my missions in economics is to look at the elasticities of output to the different factors. Capital is less important than labor, because labor elasticity output is like 0.6, while for capital it’s like 0.3. But neither are by themselves sufficient. If you just scaled one of them and the other remained fixed, then neither would be sufficient to indefinitely scale output.

Will there be a separate AI economy?

Dwarkesh Patel 01:33:00
One question that Daniel posed to me is, because I made this perspective about everything being interconnected when you were talking about… another example people often bring up is what would it take to build the iPhone in the year 1000? And it’s unclear how you could actually do that without just replicating every intermediate technology or most intermediate technologies.
And then he made the point like, OK, fine, whatever. Nanobots are not a crux here. The crux, at least to the thing he cares about, which is human control, is just by when can the robot economy, or the AI economy, whether it’s a result of capital deepening or whether it’s a result of R&D, by when will they have the robots? And they have more cumulative physical power?
Ege Erdil 01:33:50
Right. But he’s imagining a separate thing called the AI economy. Well, why would you imagine that? I think it’s probably downstream of his views about the software-only singularity. But again, those are views that we don’t share.
Tamay Besiroglu 01:34:01
So it’s just much more efficient for AI to operate in our economy and benefit from the existing supply chains and existing markets rather than set up shop on some island somewhere and do its own thing.
Ege Erdil 01:34:16
And then it’s not being clear, for example people might have the intuition- I brought this up before- the distinction between what is the minimum possible amount of build-out that would be necessary to get this feedback loop up and running and what would be the most efficient way to do it? Which are not the same question. But then people have this view that, oh, the most efficient thing in principle, we can’t do that because…
Dwarkesh Patel 01:34:36
I think the example he might give is when the conquistadors arrived in the New World or when the East India Trading Company arrived in India, they did integrate into the existing economy. In many cases, it depends on how you define ‘integrate’, but the Spanish relied heavily on New World labor in order to do silver mining and whatever. East India Trading Company was just a ratio of British people to Indian people, which is not that high. So they just had to rely on the existing labor force. But they were still able to take over because of… I don’t know what the analogous thing here is, but you see what I’m saying.
And so he’s concerned about, by when will they, even if they’re ordering components off of Alibaba or whatever- and sorry, I’m being trite, but you see what I’m saying. Even if they’re going to get into the supply chains, by when are they in a position where, because this part of the economy has been growing much faster, they could take over the government or…
Ege Erdil 01:35:40
If they wanted to?
Dwarkesh Patel 01:34:41
That’s right, yeah.
Ege Erdil 01:35:42
Okay. So I think that eventually you expect the AI systems to be driving most of the economy. And unless there are some very strange coincidences where humans are able to somehow uplift themselves and able to become competitive with the AIs by stopping being biological humans or whatever, which seems very unlikely early on, then AI is just going to be much more powerful. And I agree that in that world, if the AI is just somehow coordinated and decides, “okay, we should just like take over” or something, they just somehow coordinated to have that goal, then they could probably do it.
But, that’s also probably true in our world. In our world, if the US wanted to invade Sentinel Island, then probably they could do it. I don’t think anyone could stop them. But what does it actually mean? There’s this dramatic power imbalance, but that doesn’t mean… that doesn’t tell you what’s going to happen, right? Why doesn’t the US just invade Guatemala or something? Why don’t they do that? Seems like they could easily do it.
Dwarkesh Patel 01:36:53
Because the value to the US of…
Ege Erdil 01:36:56
Not that high, right?
Dwarkesh Patel 01:36:58
Yeah. So I agree that might be true for AIs because most of the shit is in space. And you want to do the capital deepening on Mars and the surface of the sun instead of like New York City.
Ege Erdil 01:37:13
I think it’s deeper than that. So it’s deeper than that. There’s also the fact that if the AIs are going to be integrated into our economy…
So basically they start out as a small part of our economy or our workforce and over time they grow and over time they become the vast majority of the actual work power in the economy. But they are growing in this existing framework where we have norms and rules for better coordination and then undermining those things has a cost. So if getting the things that are making the humans wealthier than they used to be before and more comfortable, yeah, you would probably be better off if you could just take that from them. But the benefit to you, if you already are getting almost all of the income in the economy, will be fairly small.
Dwarkesh Patel 01:38:03
I feel like the Sentinel Islands thing, there’s one reference class that includes that. But historically, there’s a huge reference class that includes; East India Trading Company could have just kept trading with the Mughals, they just took over, right? They could have kept trading with the 50 different nation states in pre-colonial India. But yeah.
Ege Erdil 01:38:21
That’s right. I mean, that’s what they were initially doing. And then whatever. I’m not going to go into that subject.
Dwarkesh Patel 01:38:27
But that is the reference class…
Ege Erdil 01:38:30
I agree. I agree. So if the question is, if they have some totally different values and then they represent most of the economy, then would they take over? I still don’t know, because I’m not sure to what extent the class of all AI is a natural class. It’s sort of like, why don’t the young people in the economy coordinate?
Dwarkesh Patel 01:38:54
I agree that sometimes these kinds of class arguments are misused. For example, when Marxists are like, “why don’t this class rise up against the others?”
Daniel made the interesting argument that if you look at the history of the conquistadors, when Cortes was making his way through the new world, he had to actually go back and fight off a Spanish fleet that had been sent to arrest him and then go back. So you can have this fight within this conquering AIs and then that still nets out to the Native Americans getting disempowered.
But with AIs in particular, they’re just copies of each other. And in many other ways, they have lower transaction costs when they trade with each other or interact with each other. There’s other reasons to expect them to be more compatible coordinating with each other than coordinating with the human world.
Ege Erdil 01:39:48
Sure. If the question is just that, “is it possible for that to happen?”, which is a weaker claim, then yeah, it seems possible. But there are, I think, a lot of arguments pushing back against it. Probably actually the biggest one is the fact that AI preferences are just not… Just look at the AIs we have today. Can you imagine them doing that? I think people just don’t put a lot of weight on that, because they think once we have enough optimization pressure and once they become super intelligent, they’re just going to become misaligned. But I just don’t see the evidence for that.
Dwarkesh Patel 01:40:24
I agree there’s some evidence that they’re good boys.
Ege Erdil 01:40:28
No, there’s more than some evidence.
Dwarkesh Patel 01:40:30
No, but there’s also some evidence… There’s a new openAI paper where in chain of thought, reward hacking is such a strong basin that if you were like, “hey, let’s go solve this coding problem”, In the chain of thought, they’ll just be like, “okay, let’s hack this and then figure out how to hack it.”
Ege Erdil 01:40:48
So imagine that you gave students at a school a test and then the answer key was like on the back.
Dwarkesh Patel 01:40:52
Right, but the reference class of humans does include Cortes and the East Indian Trading Company.
Ege Erdil 01:40:57
Sure.
Tamay Besiroglu 01:40:58
So I think one issue here is that I think people are doing this very kind of partial equilibrium analysis or something where they’re thinking about these raw abilities of AI systems in a world where AI systems are dominant and human civilization has done very little in terms of integrating itself and the AI is integrating itself into the human world. Insofar as it’s poor at communicating and coordinating with AI, addressing those deficiencies and improving that. Insofar as that’s posing a risk, or creating inefficiencies, because it’s unable to benefit from coordinating and trading, then it should have this enormous incentive to address that.
Insofar as there is a lot of value to be gained from dominating and taking over humans, what you might get is a more negotiated settlement. If that’s indeed the case, then a war would just be inefficient. And so you would want to negotiate some settlement that results in some outcomes that are mutually beneficial.
Dwarkesh Patel 01:42:05
Compared to the counterfactual, not compared to… There was a mutually beneficial trade that was made between the Qing dynasty and the British in the opium wars, right? But it was maybe better than pre-industrial China going to war with the British empire, but it wasn’t better than never having interacted with the British empire in the first place.
Tamay Besiroglu 01:42:28
So I think one mistake that I feel people make is they have this very naive analysis of what creates conflict. And I think Matthew has written a bit about this, a colleague of ours, where they say there’s misalignment. And so that then creates conflict.
But that’s actually not what the literature on what causes conflict says creates conflict. It’s not just misalignment, it’s also other issues like having a bad understanding of the relative strengths of your armies versus theirs, or maybe having these very strong commitments that you think some grounds are sacred, and so you’re not willing to do any trade in order to give up some of that in order to gain something else. And so then you have to posit some additional things other than just the base value misalignment part.
Dwarkesh Patel 01:43:27
I think you’re making a good argument against, like, “humans take up the spears and the machetes and go to war against the AI data centers”, because maybe there’s not this asymmetric information that often leads to conflicts in history. But this argument does not address at all the risk of takeover, which can be the result of a peaceful end negotiation or human society being like, “look, we’re totally outmatched. And we’ll just take these meager concessions rather than go to war”.
Tamay Besiroglu 01:43:57
But insofar as it’s more peaceful, then I think it’s like much less of a thing to worry about. I think there could be this trend where we indeed have this gradual process where AI is much more important in the world economy and actually deciding and determining what happens in the world. But this could be beneficial for humans where we’re getting access to this vast, much, much larger economy and much more advanced technological stock.
Ege Erdil 01:44:30
Yeah. So I think it’s important to be clear about what is the thing that you’re actually worried about. Because I think some people just say that, “oh, humans are going to lose control of the future, we’re not going to be the ones that are making the important decisions. We, however, concede", that’s also kind of nebulous.
But is that something to worry about? If you just think biological humans should remain in charge of all important decisions forever, then I agree, the development of AI seems like a problem for that. But in fact, other things also seem like a problem for that, I just don’t expect to generically be true. Like in a million years from now, if even if you don’t develop AI, biological humans, the way we recognize them today, are still making all the important decisions
and they have something like the culture that we would recognize from ourselves today. I would be pretty surprised by that.
I think Robin Hanson has again talked about this, where he said a bunch of the things that people fear about AI are just things they fear about change and fast change. So the thing that’s different is that AI has a prospect of accelerating much of this change so that it happens in a narrower period.
Dwarkesh Patel 01:45:36
I think it’s not just the kind of change that would have happened from, let’s say, genetically modifying humans and blah, blah, blah, is instead happening in a compressed amount of time. I think the worry comes more from like, it’s not just that change compressed. It’s a very different vector of change .
Ege Erdil 01:45:53
Yeah, but what is the argument for that? I have never seen a good argument for this.
Tamay Besiroglu 01:45:58
You should expect a bunch of change if you accelerate just human change as well. You might expect different values to become much more dominant. You might expect people that don’t discount the future as much to be much more influential because they save more and they make good investments that gives them more control.
Ege Erdil 01:46:17
People who are higher risk tolerance.
Tamay Besiroglu 01:46:18
Higher risk tolerance. Because they are more willing to make bets that maximize expected value and so get much more influence. So just generically, accelerating human change would also result in a lot of things being lost that you might care about.
Dwarkesh Patel 01:46:34
I think the argument is that maybe the speed of the change determines what fraction of the existing population or stakeholders or whatever, have some causal influence on the future. And maybe the thing you care about is, look, there’s going to be change, but it’s not just going to be like one guy presses a button. That’s like the software singularity extreme. It’s more like over time norms change and so forth.

Can we predictably influence the future?

Ege Erdil 01:47:08
So if you’re looking at the software singularity picture, I agree that picture looks different. And again, I’m coming back to this because obviously Daniel, and maybe Scott to some extent, they probably have this view that the software-only singularity is more plausible. And then one person, we could end up in a situation where their idiosyncratic preferences or something end up being more influential.
I agree that makes the situation look different from if you just have this broader process of automation. But even in that world, I think a lot of people have this view about things like value lock-in, where they think this moment is a pivotal moment in history. And then someone is going to get this AI, which is very powerful because of the software-only singularity. And then they’re just going to lock in some values. And then those values are going to be stable for millions of years.
And I think that just looks very unlike anything that has happened in the past. So I’m kind of confused why people think it’s very plausible. I think people have the argument that they see the future, again, in my view, in sort of ‘far mode’. They think there’s going to be one AI. It’s going to have some kind of utility function. That utility function is going to be very stable over time, so it’s not going to change, there won’t be this messiness of a lack of coordination between different AIs, or over time values drifting for various reasons, maybe because they become less functional in an environment, maybe because of other reasons. And so they just don’t imagine that. They say, “well, utility functions, we can preserve them forever. We have the technology to do that. So it’s just going to happen”. And I’m like, “well, that seems like such a weak argument to me”.
Tamay Besiroglu 01:48:50
Often the idea is, because this is digital you can preserve the information better and copy it with higher fidelity and so on. But actually, even if you look just at information on the internet, you have this thing called link rot, which happens very quickly. And actually, information that’s digital isn’t preserved for very long at all.
Dwarkesh Patel 01:49:15
And the point that Matthew was making is that the fact that the information is digital has led to- not maybe led to, but at least been associated with- faster cultural change.
Tamay Besiroglu 01:49:25
Cultural change, exactly.
Ege Erdil 01:49:26
I mean, basically technological changes can create incentives for cultural change just as they make preserving…
Dwarkesh Patel 01:49:32
I think there’s two key arguments that I’ve heard. One is that we will soon reach something called technological maturity. And one of the key ways in which society has been changing recently is- maybe actually its culture would have changed even more. Actually, no, I think this argument that you’re making is wrong, because we do know that language actually changed a lot more. We can read everything that was written after the 1800s when literacy became more common. But just go back a couple hundred years after that and you’re reading old English and it’s hard to understand. And that is a result of literacy and the codification of language.
Ege Erdil 01:50:09
Well, that information was better preserved. What about other kinds of cultural practices?
Dwarkesh Patel 01:50:12
But I think the argument would be that change was a result of technological change in general, not the result of information being digitized. And maybe that culture would have actually changed more if information wasn’t as well preserved or technology had continued to proceed. And the argument is, in the future we’re going to reach some point at which you’ve done all the tech, ideas have just gotten way too hard to find and you need to make a CERN that’s the size of a galaxy to progress physics an inch forward.
And at that point, there’s this growth in technology, just churning over civilization goes away. And then you just have the digital thing, which does mean that a lock-in is more plausible.
Tamay Besiroglu 01:51:00
So the technological maturity thing, I agree that results in this slowdown and change and growth and so on and certain things might get more locked-in relative to what preceded it. But then what do we do today about that? Well, what could you do to have a positive impact by our lights?
Robin Hanson had this question of what could someone do in the 1500s to have a positive impact on the world today from their point of view, knowing all they knew back then? I think this question is even worse than that, because I think the amount of change that happens between today and technological maturity is just orders of magnitude greater than whatever change happened between the 1500s and today.
So it’s an even worse position than someone in the 1500s thinking about what they could do to have a positive impact in expectation, like predictably positive today. And so I think it’s just pretty hopeless. I don’t know if we could do anything or find any candidate set of actions that would make things better post lock-in.
Ege Erdil 01:52:05
I mean, that’s assuming lock-in is going to happen, which is not…
Dwarkesh Patel 01:52:08
In the 1700s, a bunch of British abolitionists were making the case against slavery. And I don’t think there’s any in-principle reason why we couldn’t have been a slave society to this day, or more of the world couldn’t have slavery. I think what happened is just the convincing of British people that slavery is wrong, the British Empire put all its might into abolishing slavery and making that a norm.
I think another example is Christianity and the fact that Jesus has these ideals, you could talk about these ideals. I think the world is a more Christian place.
Ege Erdil 01:52:45
It is a more Christian place, sure.
Dwarkesh Patel 01:52:57
And also is like more of the kind of place- I’m not saying Jesus Christ would endorse every single thing that happens in the world today. I’m just saying he endorses this timeline more than one in which he doesn’t exist and doesn’t preach at all.
Ege Erdil 01:53:00
I don’t know, actually. I’m not sure if that’s true. It seems like a hard question.
Dwarkesh Patel 01:53:03
But I think like a sum from the Christian perspective, favorable cultural development to the West.
Ege Erdil 01:53:07
I mean, you don’t know the counterfactual.
Dwarkesh Patel 01:53:09
I agree that is always true. I just think the world does have people who read the Bible and are like, “I’m inspired by these ideals to do certain things”. And it just seems like that’s more likely to lead to…
Ege Erdil 01:53:20
So that is what I would call a ‘legacy effect’ or something. You can say the same thing about languages, some cultures might just become more prominent and their languages might be spoken more, or some symbols might become more prominent. But then there are things like how do cities look, and how do cars look, and what do people spend most of their time doing in their day, and what do they spend their money on? And those questions seem much more determined by how your values change as circumstances change.
Dwarkesh Patel 01:53:49
That might be true, but I’m in the position with regards to the future where I expect a lot of things to be different and I’m okay with them being different. I care much more about the equivalent of slavery, which in this case is literally slavery.
Just to put a final point on it, the thing I really care about is there’s going to be trillions of digital beings. I want it to be the case that they’re not tortured and put into conditions in which they don’t want to work and whatever. I don’t want galaxies worth of suffering. That seems closer to British abolitionists being like, “let’s put our empire’s might against fighting slavery”.
Ege Erdil 01:54:25
I agree. But I would distinguish between the case of Christianity and the case of the end of slavery, because I think the end of slavery… I agree you can imagine a society, technologically it’s feasible to have slavery. But I think that’s not the relevant thing which brought it to an end.
The relevant thing is that the change in values associated with the Industrial Revolution made it so that slavery just became an inefficient thing to sustain in a bunch of ways. And a lot of countries at different times phased out different things you could call slavery. For example, Russia abolished serfdom in the 1860s. They were not under British pressure to do so. Britain couldn’t force Russia to do that, they just did that on their own. There were various ways in which people in Europe were tied to their land and they couldn’t move, they couldn’t go somewhere else. Those movement restrictions were lifted because they were inefficient.
There were ways in which the kind of labor that needed to be done in the colonies to grow sugar or to grow various crops, it was very hard labor. It was not the kind of thing that probably you could have paid people to do, because they just wouldn’t want to do it because the health hazards and so on were very great, which is why they needed people to force people to do them. And that kind of work over time became less prevalent in the economy.
So, again, that reduces the economic incentives to do it. I agree you could still do it.
Dwarkesh Patel 01:55:58
I would emphasize the way you’re painting the counterfactual is like, “oh, but then in that world, they would have just phased out the remnants of slavery”. But there’s a lot of historical examples where there’s not necessarily hard labor, only hard labor, like Roman slavery.
Ege Erdil 01:56:14
Yes. It was different.
Dwarkesh Patel 01:56:16
And I interviewed a historian about it recently, the episode hasn’t come out, but he wrote a book about the scope. I think it was like 20 percent of people under Roman control were slaves.
And this was not just agricultural slavery. His point was that the maturity of the Roman economy is what led to this level of slavery, because the reason slavery collapsed in Europe after the fall of the Roman Empire was because the economy just lost a lot of complexity.
Ege Erdil 01:56:50
Well, I’m not sure if I would say that slavery collapsed. I think this depends on what you mean by slavery. I mean in a lot of ways people in feudal Europe were…
Dwarkesh Patel 01:56:58
But his point is that serfdom was not the descendant institution from Roman slavery.
Ege Erdil 01:57:02
No, I agree. It was not descendant. But in fact, this point I’m trying to make is that, values that exist at a given time, like what the values we will have in 300 years, or from the perspective of someone a thousand years ago, what values people are going to have in a thousand years. Those questions are much more determined by the technological and economic and social environment that’s going to be there in a thousand years, which values are going to be functional, which sides, which values end up being more competitive and being more influential so that other people add up their values. And it depends much less on the individual actions taken by people a thousand years ago.
So I would say that the abolitionist thing, it’s not the cause of why slavery came to an end. Slavery comes to an end also because people just have natural preferences that I think are suppressed in various ways during the agricultural era where it’s more efficient to have settled societies in cities which are fairly authoritarian and don’t allow for that much freedom and that you’re in this Malthusian world where people have very low wages perhaps compared to what they enjoyed in the hunter-gatherer era. So it’s just a different economic period and I think people didn’t evolve to have the values that would be functional in that era.
So what happened is that there had to be a lot of cultural assimilation where people had to adopt different values and in the Industrial Revolution people become also very wealthy compared to what they used to be, and that I think leads to different aspects of people’s values being expressed. Like people just put a huge amount of value on equality. It’s always been the case. But I think when it is sufficiently functional for that to be suppressed they are capable of suppressing it.
Dwarkesh Patel 01:59:01
I mean if that’s the story then this makes value alignment all the more important, because then you’re like “oh if the AI’s become wealthy enough they actually will make a concerted effort to make sure the future looks more like the utility function you put into them” which I think you have been under-emphasizing.
Ege Erdil 01:59:18
No, I’m not under-emphasizing that. What I would say is there are certain things that are path-dependent in history, such that if someone had done something different, something had gone differently a thousand years ago, then today in some respects would look different. I think for example, which languages are spoken across which boundaries, or which religions people have, or fashion maybe to some extent, though not entirely.
Those things are more path-dependent, but then there are things that are not as path-dependent. So for example if some empire, like if the Mongols had been more successful and they somehow- I don’t know how realistic it is- but they became very authoritarian and had slavery everywhere, would that have actually led to slavery being a much more enduring institution a thousand years later? That seems not true to me.
The forces that led to the end of slavery seemed like they were not contingent forces, they seem like deeper forces than that and if you’re saying “well if we align the AI today to some bad set of values then that could affect the future in some ways which are more fragile” that seems plausible, but I’m not sure how much of the things you care about the future and how much the ways in which you expect the future to get worse you actually have a lot of leverage on at the present moment.
Dwarkesh Patel 02:00:40
I mean another example here might be factory farming where you could say “oh, it’s not like us having better values over time led to suffering going down, in fact your suffering might have gone up because the incentives that led to factory farming emerging are…”
Ege Erdil 02:00:56
And probably when factory farming comes to an end it will be because the incentives start going away, right?
Dwarkesh Patel 02:01:01
So suppose I care about making sure the digital equivalent of factory farming doesn’t happen. Maybe, all else being equal, it’s just more economically efficient to have suffering minds doing labor for you than non-suffering minds because of the intermediary benefits of suffering or something like that, right?
What would you say to somebody like me where I’m like “I really want that not to happen, I don’t want the lightcone filled with suffering workers” or whatever. Is it just like “we’ll give up because this is the way economic history is”?
Ege Erdil 02:01:40
No, I don’t think you should give up. It’s hard to anticipate the consequences of your actions in the very distant future. So I would just recommend that you should just discount the future. Not for a moral reason, not because the future is worthless or something, but because it’s just very hard to anticipate the effects of your actions. In the near-term I think there are things you can do that seem like they would be beneficial. For example, you could try to align your present AI systems to value the things that you’re talking about, like they should value happiness and they should dislike suffering or something.
You might want to support political solutions that would… Basically you might want to build up the capacity so that in the future if you notice something like this happening then we might have some ability to intervene. Maybe you would think about the prospect of “well eventually we’re gonna maybe colonize other stars and civilization might become very large and communication delays might be very long between different places”. And in that case competitive pressures between different local cultures might become much stronger because it’s harder to centrally coordinate.
And so in that you might expect competition to take over in a stronger way and if you think the result of that is going to be a lot of suffering, maybe you would try to stop that. Again I think at this point it’s very far from obvious that trying to limit competition is actually a good idea, I would probably think it’s a bad idea, but maybe in the future we will receive some information and we’ll be like “oh, we were wrong actually actually we should stop this” and then maybe you want to have the capacity so that you can make that decision.
But that’s a nebulous thing. How do you build that up? Well I don’t know. That’s the kind of thing I would be trying to do.
Tamay Besiroglu 02:03:28
Yeah I think the overall takeaway I take from the way that I think about it, and I guess we think about it, as be more humble in what you think you can achieve, and just focus on the nearer term, not because it’s more morally important than the longer term, but just because it’s much easier to have a predictably positive impact on that.
Dwarkesh Patel 02:03:49
One thing I’ve noticed over the last few weeks of thinking about these bigger future topics and interviewing Daniel and Scott and then you two, is how often I’ve changed my mind about everything from the smallest questions about when AI will arrive- it’s funny that that’s the small question in the grand scheme of things- to whether there will be an intelligence explosion, or whether it’ll be an R&D explosion, to whether there’ll be explosive growth, or how to think about that.
And if you’re in a position where you are incredibly epistemically uncertain about what’s going to happen, I think it’s important to, instead of becoming super certain about your next conclusion, just being like “well let me just take a step back, I’m not sure what’s going on here”. And I think a lot more people should be from that perspective unless you’ve had the same opinion about AI for many years, in which case I have other questions for you about why that’s the case. And I mean generally, how we as a society deal with topics on which we are this uncertain is just to have freedom, decentralization, both decentralized knowledge and decentralized decision making take the reins and not to do super high volatility centralized moves like “hey let’s nationalize so we can make sure that the software-only singularity is aligned” or not to make moves that are just incredibly contingent on one world view that are brittle under other considerations.
And that’s become a much more salient part of my world view. I think just classical liberalism is the way we deal with being this epistemically uncertain and I think we should be more uncertain than we’ve ever been in history, as opposed to many other people who seem to be more certain than they are about other sort of more mundane topics.
Tamay Besiroglu 02:05:44
Yeah I think it’s very hard to predict what happens because this acceleration basically means that you find it much harder to predict what the world might be in 10 years time. I think these questions are also just very difficult and we don’t have very strong empirical evidence and then there’s like a lot of this kind of disagreement that exists.
Ege Erdil 02:06:10
I would say that it’s much more important in a lot of cases and a lot of situations to maintain flexibility and ability to adapt to new circumstances, new information, than it is to get a specific plan that’s going to be correct and that’s very detailed and has a lot of specific policy recommendations and things that you should do.
That’s actually also the thing that I would recommend if I want to make the transition to AI in this period of explosive growth go better. I would just prefer it if we in general had higher quality institutions, but I am much less bullish on someone sitting down today and working out “okay what will this intelligence explosion or explosive growth be like? What should we do?”
I think plans that you work out today are not going to be that useful when the events are actually occurring, because you’re going to learn so much stuff that you’re going to update on so many questions that these plans are just going to become obsolete.
Tamay Besiroglu 02:07:12
One thing you could do is you could look at say, the history of war planning and how successful war planning has been for like actually anticipating what actually happens when the war actually happens.
Ege Erdil 02:07:22
So for one example- I think I might have mentioned this off the record at some point- but before the Second World War happened, obviously people saw that there were all these new technologies like tanks and airplanes and so on, which existed in World War I. but in a much more primitive setting. So they were wondering, what is going to be the impact of these technologies now that we have in them in much greater scale? And the British government had estimates of how many casualties there would be from aerial bombardment in the first few weeks of the Second World War. And they expected hundreds of thousands of casualties in two weeks, three weeks, after the war begins.
So the idea was that air bombing is basically this unstoppable force, all the major urban centers are going to get bombed, tons of people will die, so basically we can’t have a war because if there’s a war then it will be a disaster because we will have this aerial bombardment. But later it turned out that that was totally wrong. In fact, in all of Britain there were fewer casualties from air bombing in the entire six years of the Second World War than the British government expected in the first few weeks of the war. They had less casualties in six years than they expected in three weeks.
So why did they get it wrong? Well there are lots of boring practical reasons, like for example it turned out to be really infeasible, especially early on, to bomb cities in daytime because your aircraft would just get shot down, but then if you try to bomb at night time then your bombing was really imprecise and only a very small fraction of it actually hit. And then people also underestimated the extent to which people on the ground like firefighters and so on could just go around the city and that put out fires from bombs that were falling on structures. They overestimated the amount of economic damage that it would do. They underestimated how economically costly it would be; basically you’re sending these aircraft and then they’re getting shot down, while an aircraft is very expensive.
So in the end how it turned out is, when the allies started bombing Germany, for each dollar of capital they were destroying in Germany they were spending like four to five dollars on the aircraft and fuel and training of the pilots and so on that they were sending in missions and the casualty rate was very high, which later got covered up by the government because they didn’t want people to worry about, you know…
So that is a kind of situation where all the planning that you would have done in advance predicated on this assumption of air bombing going to be this “nuclear weapons-lite”, basically it’s extremely destructive there’s going to be some aspect to which…
Dwarkesh Patel 02:09:57
I mean it was though, right? 84,000 people died in one night of firebombing in Tokyo, Germany, large fractions of their…
Ege Erdil 02:10:07
But that was over the period of six years of war.
Dwarkesh Patel 02:10:11
Right, but there were single firebombing attacks. I mean it was a case that during the end of World War II when they were looking for the place to launch the atomic bombs, they just had to go through like a dozen cities because it just wouldn’t be worth nuking them because they’re already destroyed by the firebombing.
Ege Erdil 02:10:28
That’s right, but if you look at the level of destruction that was expected within the space of a few weeks, and then this level of destruction took many years, so there was like a two order of magnitude mismatch or something like that, which is pretty huge. So that affected the way people think about it.
Tamay Besiroglu 02:10:45
An important underlying theme of much of what we have discussed is how powerful just reasoning about things is to making progress about what specific plans you want to make to prepare and make this transition to advanced AI go well.
And our view is, well it’s actually quite hard and you need to make contact with the actual world in order to inform most of your beliefs about what actually happens and so it’s somewhat futile to do a lot of wargaming and figure out how AI might go, and what we can do today to make that go a lot better, because a lot of the policies you might come up with might just look fairly silly.
And in thinking about how AI actually has this impact, again people think “oh you know, AI reasoning about doing science and doing R&D just has this drastic impact on the overall economy or technology, and our view as well actually again making contact with the real world and getting a lot of data from experiments and from deployment and so on is just very important”.
So I think there is this underlying kind of latent variable which explains some of this disagreement, both on the policy prescriptions and about the extent to which we should be humble versus ambitious about what we ought to do today, as well as for thinking about the mechanism through which AI has this impact. And this underlying latent thing is, what is the power of reason? How much can we reason about what might happen? How much can reasoning in general figure things out about the world and about technology? And so that is a core underlying disagreement here.
Dwarkesh Patel 02:12:27
I do want to ask: You say in your announcement, we want to accelerate this broad automation of labor as fast as possible. As you know, many people think it’s a bad idea to accelerate this broad automation of labor and AGI and everything that’s involved there. Why do you think this is good?
Ege Erdil 02:12:49
So the argument for why it’s good is that we’re going to have this enormous increase in economic growth, which is going to mean enormous amounts of wealth, and incredible new products that you can’t even imagine, in health care or whatever. And like the quality of life of the typical person is probably going to go up a lot.
Early on, probably also their wages are going to go up, because the AI systems are going to be automating things that are complementary to their work. Or it’s going to be automating part of their work, and then you’ll be doing the rest and then you’ll be getting paid much more on that. And in the long term, eventually we do expect wages to fall just because of arbitrage with the AIs. But by that point, we think humans will own enormous amounts of capital and there will also be ways in which even the people who don’t own capital, we think are just going to be much better off than they are today.
I think it’s just hard to express in words the amount of wealth and increased variety of products that we would get in this world. It will probably be more than the difference between 1800 and today. So if you imagine that difference, it’s such a huge difference. And then we imagine two times, three times, whatever.
Dwarkesh Patel 02:13:58
The standard argument against this is why does the speed to get there matter so much? Especially if the trade-off against the speed is the probability that this transition is achieved successfully in a way that benefits humans?
Tamay Besiroglu 02:14:12
I mean, it’s unclear that this trades off against the probability of it being achieved successfully or something.
Dwarkesh Patel 02:14:17
There might be an alignment tax.
Tamay Besiroglu 02:14:20
I mean, maybe. You can also just do the calculation of how much a year’s worth of delay costs for current people. This is this enormous amount of utility that people are able to enjoy, and that gets brought forward by year or pushed back by year if you delay things by year. And how much is this worth? Well, you can look at simple models of how concave people’s utility functions are and do some calculations and maybe that’s worth on the order of tens of trillions of dollars per year in consumption.
That is roughly the amount consumers might be willing to defer in order to bring forward the date of automation one year.
Dwarkesh Patel 02:15:03
In absolute terms, it’s high. In relative terms, relative to if you did think it was going to nudge the probability one way or another of building systems that are aligned and so forth that it’s so small compared to all of the future.
Ege Erdil 02:15:18
I agree. So there are a couple of things here.
First of all, I think the way you think about this matters. So first of all, we don’t actually think that it’s clear whether speeding things up or slowing things down actually makes a doomy outcome more or less likely. I think that’s a question that doesn’t seem obvious to us. Partly because of our views on the software R&D side. We don’t really believe that if you just pause and then you do research for 20 years at a fixed level of compute scale, that you’re actually going to make that much progress on relevant questions on alignment or something.
Imagine you were trying to make progress on alignment in 2016 with the compute budgets of 2016. Well, you would have gotten nowhere, basically. You would have discovered none of the things that people have discovered today and that turned out to be useful. And I think if you pause today, then we will be in a very similar position in 10 years, right? We would have not made a bunch of discoveries. So the scaling is just really important to make progress in alignment, in our view. And then there’s a separate question of how longtermist should you be in various different senses?
So there’s a moral sense, of how much you should actually care about people who are alive today as opposed to people who are not yet born as just a moral question. And there was also a practical question of, as we discussed, how certain can you be about the impact your present actions are actually going to have in the future?
Dwarkesh Patel 02:16:43
OK, maybe you think it really doesn’t matter whether you slow things down right now or you speed things up right now. But is there some story about why speeding them up from the alignment perspective actually helped, it’s good to have that extra progress right now rather than later on?
Or is it just that, well, if it doesn’t make a difference either way, then it’s better to just get that extra year of people not dying and having cancer cures and so forth?
Ege Erdil 02:17:06
I think I would say the second. But it’s just important to understand the value of that. Even in purely economic terms, imagine that each year of delay might cause maybe 100 million people- maybe more, maybe 150, 200 million people- who are alive today to end up dying. So even in purely economic terms, the value of a statistical life is pretty enormous, especially in Western countries. Sometimes people use numbers as high as $10 million for a single life. So imagine you do $10 million times 100 million people. That’s a huge number, right?
That is so enormous that I think for you to think that speeding things up is a bad idea, you have to first have this long-termist view where you look at the long run future. You think your actions today have high enough leverage that you can predictably affect the direction of the long run future.
Dwarkesh Patel 02:18:10
Well, in this case, it’s kind of different because you’re not saying “I’m going to affect what some emperor a thousand years from now does” like somebody in the year zero would have to do to be a long-termist. In this case, you just think there’s this incredibly important inflection point that’s coming up and you just need to have influence over that crucial period of explosive growth of intelligence explosion or something. So I think it is a much more practicable prospect than…
Ege Erdil 02:18:36
So I agree in relative terms. In relative terms, I agree the present moment is a moment of higher leverage and you can expect to have more influence. I just think in absolute terms, the amount of influence you can have is still quite low. So it might be orders of magnitude greater than it would have been 2000 years ago and still be quite low.
Tamay Besiroglu 02:18:54
And again, I think there’s this difference in opinion about how broad and diffuse this transformation ends up being, versus how concentrated within specific labs where the very idiosyncratic decisions made by that lab will end up having a very large impact.
If you think those developments will be very concentrated, then you think the leverage is especially great. And so then you might be especially excited about having the ability to influence how that transition goes, but our view is very much that this transition happens very diffusely by way of many, many organizations and companies doing things. And for those actions to be determined a bunch by economic forces rather than idiosyncratic preferences on the part of labs or these decisions that have these founder effects that last for very long.

Arms race dynamic

Dwarkesh Patel 02:19:48
Okay let’s go through some of the objections to explosive growth, which is that most people are actually more conservative not more aggressive about the forecasts you have. So obviously one of the people who has articulated their disagreements with your view is Tyler Cowen. He made an interesting point when we did the podcast together and he said “most of Sub-Saharan Africa still does not have reliable clean water. The intelligence required for that is not scarce. We cannot so readily do it. We are more in that position that we might like to think along other variables.”
Tamay Besiroglu 02:20:22
I mean we agree with this. I think intelligence isn’t the bottleneck that’s holding back technological progress or economic growth. It’s like many other things. And so this is very much consistent with our view that scaling up your overall economy, accumulating capital, accumulating human capital, having all these factors scale…
Ege Erdil 02:20:45
In fact this is even consistent with what I was saying earlier that I was pointing out this “oh, good management and good policies and those just contribute to TFP and they can be bottlenecks”.
Dwarkesh Patel 02:20:55
Like right now we could just plug-and-play our better management into Sub-Saharan Africa.
Ege Erdil 02:21:02
No we can’t.
Tamay Besiroglu 02:21:03
It’s hard. I don’t think we can.
Dwarkesh Patel 02:21:05
Okay so maybe I should have said, one could theoretically imagine plugging and playing…
Ege Erdil 02:21:10
I agree.
Tamay Besiroglu 02:21:12
I can imagine many things.
Dwarkesh Patel 02:21:14
But we cannot so readily do it because of… it’s hard to articulate why and it wouldn’t be so easy to do in just capital or labor. Why not think that the rest of the world will be in this position with regards to the advances that AI will make possible?
Tamay Besiroglu 02:21:32
I mean if the AI advances are like the kind of geniuses in a data center, then I agree that that might be bottlenecked by the rest of the economy not scaling up and being able to accumulate the relevant capital to make those changes feasible. So I kind of agree with this picture and I think this is an objection to the “geniuses in a data center” type view, and I buy basically this.
Ege Erdil 02:21:57
And also the fact that it’s also plausible you’re going to have the technology, but then some people are not going to want to deploy it, or some people are going to have norms and laws and cultural things that are going to make it so that AI is not able to be widely deployed in their economy- or not as widely deployed as they otherwise might be. And that is going to make those countries or societies just slower. That’s like some countries will be growing faster just like Britain and the Netherlands were sort of the leaders in the Industrial Revolution, they were the first countries to start experiencing rapid growth. And then other countries, even in Europe, had to come from behind.
Well again I just think we expect the same thing to be true for AI. And the reason that happened was exactly because of these kinds of reasons, where those countries that had a culture or governance systems or whatever which were just worse than bottlenecked the deployments and scaling of the new technologies and ideas. It seems very plausible.
Dwarkesh Patel 02:22:53
But you’re saying as long as there’s one jurisdiction?
Ege Erdil 02:22:55
Yeah.
Dwarkesh Patel 02:22:56
But then again you also previously emphasized the need to integrate with the rest of the global economy and the human economy. So doesn’t that contradict…?
Tamay Besiroglu 02:23:05
That doesn’t often require cultural homogeneity. We trade with countries, like the US trades with China, quite a lot actually. And there’s a bunch of disagreement…
Dwarkesh Patel 02:23:15
But what if the US is like “I don’t like that the UAE is doing explosive growth with AI, we’re just going to embargo them”.
Tamay Besiroglu 02:23:22
That seems plausible.
Dwarkesh Patel 02:23:24
And then would that not prevent explosive growth?
Tamay Besiroglu 02:23:26
I think that would be plausible at the point at which it’s revealing a lot about the capabilities and the power of AI. Yeah. And you should also think that that creates both an incentive to embargo, but also an incentive to adopt the very similar styles of governing that enable AI to be able to produce a lot of value.
Dwarkesh Patel 02:23:48
What do you make of this: I think people interpret explosive growth from an arms race perspective. And that’s often why I think in terms of public-private partnerships for the labs themselves. But this idea that you have the geniuses in the data center, you can have them come up with the mosquito drone swarms. And then those drone swarms will, like if China gets to the swarms earlier… Even within your perspective, is this a result of your whole economy being advanced enough that you can produce mosquito drone swarms?
You being six months ahead means that you could decisively win… does it? I don’t know. Maybe you being like a year ahead and explosive growth means you could decisively win a war against China or China could win a war against you. So would that lead to an arms race-like dynamic?
Ege Erdil 02:24:33
I mean I think it would to some extent, but I’m not sure if I would expect a year of lead to be enough to take a risk, because if you go to war with China… For example if you replace China today with China from 1990. Or if you replace Russia today with Russia from like 1970 or 1980. It’s possible that their ICBM and whatever technology is already enough to make a very strong deterrence.
So maybe even that technological lead is not sufficient so that you would feel comfortable going to war. So that seems possible.
Dwarkesh Patel 02:25:13
Yeah. And actually this relates to a point that Gwern was making which is that this is going to be a much more unstable period than the Industrial Revolution, even though the Industrial Revolution saw many countries gain rapid increases in their capabilities, because within this span, if you’ve got a century’s worth of progress compressed within a decade, one country gets to ballistic missiles first, then the other country gets to railroads first, and so forth.
But if you have this more integrated perspective about what it takes to get to ballistic missiles and to railroads, then you might think “no, basically this isn’t some orthogonal vector. You’re just churning on the tech tree further and further”.
Ege Erdil 02:26:01
I mean for what it’s worth I do think it’s possible if you have it just happen in a few countries which are relatively large and have enough land or something, those countries would be starting from a lower base compared to the rest of the world, so you would need to catch up to some extent. If they are just going to sort of grow internally and they’re not going to depend on the external supply chains. But that doesn’t seem like something that’s impossible to me. Some countries could do it, it would just be more difficult.
But in this setting if some countries have a significant policy advantage over the rest of the world and they start growing first and then they won’t necessarily have a way to get other countries to adopt their norms and culture. So in that case it might be more efficient for them to do the growth locally. So that’s why I was saying the growth differentials will probably be determined by regulatory jurisdiction boundaries more than anything else. I’m not saying- say the U.S. by itself if it had AI but it couldn’t get the rest of the world to adopt AI, I think that would still be sufficient for explosive growth.
Dwarkesh Patel 02:27:03
How worried should we be about the fact that China today, because it industrialized relatively recently, has more industrial capacity and know-how and all the other things of learning by doing and so forth? If we buy your model of how technology progresses, with or without AI, how are we just underestimating China because we have this perspective that what fraction of your GDP you’re spending on research is what matters, when in fact it’s the kind of thing where I’ve got all the factories in my backyard and I know how they work and I can go buy a component whenever I want?
Tamay Besiroglu 02:27:41
I don’t think people are necessarily underestimating China, it depends on who you’re looking at, but it seems like the discussion of China is just this very big discussion in these AI circles, right?
And so people are very much appreciating the power and the potential threat that China poses. But I think the key thing is not just the scale in terms of pure number of people or like number of firms or something, but the scale of the overall economy, which is just measured in how much is being produced in terms of dollars. There, the U.S. is ahead.
Dwarkesh Patel 02:28:14
But we’re not expecting all this explosive growth to come from financial services. We’re expecting it to start from a base of industrial technology and industrial capacity.
Ege Erdil 02:28:25
No, financial services can be important if you want to scale very big projects.
Tamay Besiroglu 02:28:29
Financial services are very important for raising funding and getting investments in data centers.
Dwarkesh Patel 02:28:35
If I understood you correctly it just seems like, man, you know how to build the robot factories and so forth. That know-how which, in your view, is so crucial to technology growth and general economic growth, is lacking. And you might have more advanced financial services but it seems like the more you take your view seriously, the more it seems like that having the Shenzhen locally matters a lot.
Tamay Besiroglu 02:29:00
I mean relative to what starting point? I think people already appreciate that China is very important. And then I agree that there are some domains where China is leading, but then there are very many domains in which the U.S. is leading, or the U.S. and its allies, where countries that are producing relevant inputs for AI that the U.S. has access to, but China doesn’t.
So I think the U.S. is just ahead on many dimensions and there’s some that China is ahead or at least very close. So I don’t think this should cause you to update very strongly in favor of China being a much bigger deal, at least depending on where you start.
Ege Erdil 02:29:40
I think people already think China is a big deal like this is the big underlying thing here. Like if we were just very dismissive of China, then maybe this would be a reason to update.

Is superintelligence a real thing?

Dwarkesh Patel 02:29:48
I get your argument that thinking about the economy-wide acceleration is more important than focusing on the IQ of the smartest AI. But at the same time, do you believe in the idea of superhuman intelligence? Is that a coherent concept in the way that you don’t necessarily stop at human level Go play and you just go way beyond it in ELO score?
Will we get to systems that are like that with respect to the broader range of human abilities? And maybe that doesn’t mean they become God, because there’s other ASIs in the world. But you know what I mean, will there be systems with such superhuman capabilities?
Tamay Besiroglu 02:30:27
Yeah I mean I do expect that. I think there’s a question of how useful is this concept for thinking about this transition to a world with much more advanced AI. And I don’t find this a particularly meaningful or helpful concept.
I think people introduce some of these notions that on the surface seem useful, but then actually when you delve into them it’s very vague and kind of unclear what you’re supposed to make of this. And you have this notion of AGI which is distinguished from narrow AI in the sense that it’s much more general and maybe can do everything that a human can do on average. AI systems have these very jagged profiles of capability. So you have to somehow take some notion of average capabilities and what exactly does that mean, it just feels really unclear.
And then you have this notion of ASI, which is AGI in the sense that it’s very general but then it’s also better than humans on every task. And is this a meaningful concept? I guess it’s coherent. I think this is not a super useful concept, because I prefer just thinking about what actually happens in the world. And you could have a drastic acceleration without having an AI system that can do everything better than humans can do. I guess you could have no acceleration when you have an ASI that is better than humans at everything, but it’s just very expensive or very slow or something. So I don’t find that particularly meaningful or useful. I just prefer thinking about the overall effects on the world and what AI systems are capable of producing those types of effects.
Dwarkesh Patel 02:32:06
Yeah I mean one intuition pump here is: compare John von Neumann versus a human plucked from the standard distribution. If you added a million John von Neumanns to the world what would the impact on growth be as compared to just adding a million people from normal distribution?
Ege Erdil 02:32:25
Well I agree it would be much greater.
Dwarkesh Patel 02:32:27
Right. But then because of Moravec paradox-type arguments that you made earlier that evolution has not necessarily optimized us for that long along the kind of spectrum on which John von Neumann is distinguished from the average human. And given the fact that already within this deviation you have this much greater economic impact. Why not focus on optimizing on this thing that evolution has not optimized that hard on, further?
Ege Erdil 02:32:51
I don’t think we shouldn’t focus on that. But what I would say is, for example if you’re thinking about the capabilities of Go-playing AIs, then the concept of a superhuman Go AI, yeah, you can say that is a meaningful concept. But if you’re developing the AI, it’s not a very useful concept. If you just look at the scaling curve, it just goes up and there is some human level somewhere. But the human level is not privileged in any sense.
So the question is, is it a useful thing to be thinking about? And the answer is probably not. Depends on what you care about. So I’m not saying we shouldn’t focus on trying to make the system smarter than humans are, I think that’s a good thing to focus on.
Dwarkesh Patel 02:33:31
Yeah I guess I try to understand whether we will stand in relation to the AIs of 2100 that humans stand in relation to other primates. Is that the right mental model we should have, or is it going to be a much greater familiarity with their cognitive horizons?
Tamay Besiroglu 02:33:49
I think AI systems will be very diverse, and so it’s not super meaningful to ask something about this very diverse range of systems and where we stand in relation to them.
Dwarkesh Patel 02:33:59
I mean, will we be able to cognitively access the kinds of considerations they can take on board? Humans are diverse, but no chimp is going to be able to understand this argument in the way that another human might be able to, right? So if I’m trying to think about my place, or a human’s place, in the world of the future, is a relevant concept of; is it just that the economy has grown a lot and there’s much more labor, or are there beings who are in this crucial way super intelligent?
Tamay Besiroglu 02:34:28
I mean there will be many things that we just will fail to understand, and to some extent there are many things today that people don’t understand about how the world works and how certain things are made. And then how important is it for us to have access or in principle be able to access those considerations?
And I think it’s not clear to me that that’s particularly important that any individual human should be able to access all the relevant considerations that produce some outcome. That just seems like overkill. Why do you need that to happen? I think it would be nice in some sense. But I think if you want to have a very sophisticated world where you have very advanced technology, those things will just not be accessible to you.
So you have this trade-off between accessibility and maybe how advanced the world is. And from my point of view I’d much rather live in a world which has very advanced technology, has a lot of products that I’m able to enjoy, and a lot of inventions that I can improve my life with, if that means that I just don’t understand them. I think this is a very simple trade that I’m very willing to make.

Reasons not to expect explosive growth

Dwarkesh Patel 02:35:45
Okay so let’s get back to objections to explosive growth. We discussed a couple already. Here’s another which is more a question than an objection: Where is all this extra output going? Who is consuming it? If the economy is 100X bigger in a matter of a decade or something, to what end?
Ege Erdil 02:36:05
So first of all I think even if you view that along what you might call the intensive margin in the sense that you just have more of the products you have today, I think there will be a lot of appetite for that. Maybe not quite 100X, that might start hitting some diminishing returns.
Tamay Besiroglu 02:36:23
Current GDP per capita on average in the world is 10K a year or something, right? And there are people who enjoy millions of dollars. And so there’s a gap between what people enjoy, and don’t seem to be super diminished in terms of marginal utility, and so there’s a lot of room on just purely the intensive margin of just consuming the things we consume today but more. And then there is this maybe much more important dimension along which we will expand which is…
Ege Erdil 02:36:52
Product variety.
Tamay Besiroglu 02:36:53
Yeah, extensive margin of what is the scope of things that you’re consuming. And if you look at something like the Industrial Revolution, that seemed to have been the main dimension along which we expanded to consume more. In any kind of sector that you care about, transportation, medicine, entertainment, and food, there’s just this massive expansion in terms of variety of things that we’re able to consume that is enabled by new technology or new trade routes or new methods of producing things. So that I think is really the key thing that we will see come along with this kind of expansion in consumption.
Dwarkesh Patel 02:37:35
Another point that Tyler makes is that there will be some mixture of Baumol cost disease, where you’re bottlenecked by the slowest growing thing. The fastest productivity things basically diminish their own…
Ege Erdil 02:37:56
Share in output.
Dwarkesh Patel 02:37:57
That’s right, yeah.
Tamay Besiroglu 02:37:59
I mean we totally agree with that. I would say that that’s just a kind of qualitative consideration. It isn’t itself sufficient to make a prediction about what growth rates are permitted given these effects versus not, it’s just a qualitative consideration and then you might need to make additional assumptions to be able to make a quantitative prediction. So I think it’s a little bit…
Ege Erdil 02:38:24
So the convincing version of this argument would be if you did the same thing that we were doing earlier with the software-only singularity argument, where we were pointing to essentially the same rejection where there are multiple things that can bottleneck progress. So I would be much more convinced if someone pointed to an explicit thing, like here, health care is this very important thing. And why should we expect AI to make that better? That doesn’t seem like that would get better because of AI. So maybe health care just becomes a big part of the economy and then that bottleneck. So if there was some specific sector…
Dwarkesh Patel 02:38:58
Maybe the argument is that if there is even one…
Ege Erdil 02:39:00
No, if there’s one though, if that’s a small part of the economy then you could just still get a lot of growth. You just automate everything else and that is going to produce a lot of growth.
Tamay Besiroglu 02:39:09
So it has to quantitatively work out. And so you actually have to be quantitatively specific about what this objection is supposed to be.
Ege Erdil 02:39:15
Right. So first of all you have to be specific about what these tasks are. What is the current share in economic output?
The second thing is you have to be specific about how bad do you think the complementarities are? So in numerical terms economists use the concept of elasticity of substitution to quantify this. So that gives you a numerical estimate of, if you just have much more output on some dimensions but not that much on other dimensions, how much does that increase economic output overall?
And then there’s a third question. You can also imagine you automate a bunch of the economy. Well, a lot of humans were working on those jobs. So now, well they don’t need to do that anymore because those got automated. So they could work on the jobs that haven’t yet been automated. So as I gave the example earlier, you might imagine a world in which remote work tasks get automated first, and then sensory-motor skills lag behind. So you might have a world in which software engineers become physical workers instead.
Of course, in that world the wages of physical workers will be much higher than their wages are today. So that reallocation also produces a lot of extra growth, even if bottlenecks are maximally powerful, even if you just look at all the tasks in the economy and literally take the worst one for productivity growth, you would still get a lot of increase in output because of this reallocation.
Tamay Besiroglu 02:40:35
So I think one point that I think is useful to make; our experience talking to economists about this is that they will bring up these more qualitative considerations, whereas the arguments that we make, make specific quantitative predictions about growth rates. So for example you might ask “how fast will the economy double?” And then we can think about, an H100 does about… there are some estimates of how much computation the human brain does per second and it’s about one E15 flop or so, it’s a bit unclear, but then it turns out that an H100 roughly does on that order of computation. And so you can ask the question of “how long does it take for an H100 to pay itself back?”
Ege Erdil 02:41:21
If you run the software of the human brain.
Tamay Besiroglu 02:41:22
If you run the software of the human brain you can then deploy that in the economy and earn say human wages on the order of 50 to 100 K a year or whatever in the US. And so then it pays itself back because it costs on the order of 30 K per H100. And so you get a doubling time of maybe on the order of a year.
And so this is like a very quantitatively specific prediction about… And then there’s the response, “well you have Baumol effects” well, what does this mean? Does it double? Does this predict it doubles every two years or every five years? You need just more assumptions in order to make this a coherent objection.
And so I think a thing that’s a little bit confusing is just that there are these qualitative objections that I agree with, like bottlenecks are indeed important, which is part of the reason I’m more skeptical of this ‘software singularity’ story. But I think this is not sufficient for blocking explosive growth.
Dwarkesh Patel 02:42:23
The other objection that I’ve heard often- and it might have a similar response from you- is this idea that a lot of the economy is comprised of O-ring-type activities. And this refers to, I think, the Challenger space shuttle explosion. There is just one component- I forgot what the exact problem with the O-ring was- but because of that being faulty the whole thing collapsed.
Tamay Besiroglu 02:42:48
I mean I think it’s quite funny actually because the O-ring model is taking the product of many, many inputs, and then the overall output is the product of very many things. But actually this is pretty optimistic from the point of view of having fewer bottlenecks.
Ege Erdil 02:43:08
I think we pointed this out before, which again, talking about software only singularity, I said if it’s the product of computer experiments with research…
Dwarkesh Patel 02:43:14
But if one of those products …
Ege Erdil 02:43:15
Is zero.
Dwarkesh Patel 02:43:16
Because of human…
Tamay Besiroglu 02:43:17
But you have constant marginal product there, right?
Ege Erdil 02:43:19
Yeah, but if one of those products doesn’t scale that doesn’t limit- like yeah, it means you’re less efficient at scaling than you otherwise would be, but you can still get a lot of…
Tamay Besiroglu 02:43:30
You can just have unbounded scaling in the O-ring world. So actually I disagree with Tyler, that he’s not conservative enough, that he should take his bottlenecks view more seriously than he actually is. And yet I disagree with him about the conclusion. And I think that we’re going to get explosive growth once we have AI that can flexibly substitute.
Dwarkesh Patel 02:43:50
I’m not sure I understand, like, there will be entirely new organizations that AIs come up with. We’ve written a blog post about one such with the AI firms. And you might be a productive worker or a productive contributor in this existing organization as it exists today. In the AI world many humans might just be zero or even minus…
Ege Erdil 02:44:11
I agree.
Dwarkesh Patel 02:44:13
Why won’t that… put that in the multiplication.
Tamay Besiroglu 02:44:18
But why would humans be in the loop there?
Ege Erdil 02:44:21
You’re both saying that humans would be negatively contributing to output. But then you’re also saying that we should put them into the…
Dwarkesh Patel 02:44:31
Okay, fair fair fair. The main objection often is regulation. And I think we’ve addressed it implicitly in different points, but might as well just explicitly address why won’t regulation stop this?
Ege Erdil 02:44:43
Yeah. So for what it’s worth, we do have a paper where we go over all the arguments for and against explosive growth. And regulation, I think, is the one that seems strongest as ‘against’.
The reason it seems strong is because even though we have made arguments before about international competition and variation of policies among jurisdictions and these strong incentives to adopt this technology both for economic and national security reasons.
So I think those are pretty compelling when taken together but even still, the world does have a surprising ability to coordinate on just not pursuing certain technologies.
Dwarkesh Patel 02:45:18
Right. Human cloning…
Ege Erdil 02:45:20
That’s right. So I think it’s hard to be extremely confident that this is not going to happen. I think it’s less likely that we’re going to do this for AI than it is for human cloning, because I think human cloning touches on some other taboos and so on.
Tamay Besiroglu 02:45:38
And also is less valuable.
Ege Erdil 02:45:39
Also less valuable. And probably less important also for national security in an immediate sense. But at the same time, as I said, it’s just hard to rule this out.
So if someone said “well I think there’s a 10 percent or 15 percent, whatever, 20 percent chance that there will be some kind of global coordination of regulation and that’s going to just be very effective. Maybe it will be enforced through sanctions on countries that defect or you know.
And then maybe it doesn’t prevent AI from being deployed, but maybe just slows things down enough that you never quite get explosive growth”. I don’t think that’s an unreasonable view. It’s like 10 percent chance it could be.
Dwarkesh Patel 02:46:17
I don’t know if there’s any… I don’t know. Do you encounter any other…
Ege Erdil 02:46:24
Any other objections?
Dwarkesh Patel 02:46:25
What should I be hassling you about?
Ege Erdil 02:46:27
I mean some things that we’ve heard from economists… People sometimes respond to our argument about explosive growth, which is an argument about growth levels. So we’re saying “we’re going to see 30 percent growth per year, instead of 3 percent”. They respond to that with an objection about levels. So they say “well how much more efficient, how much more valuable can you make hairdressing, or taking flights, or whatever, or going to a restaurant?”. And that is just fundamentally the wrong kind of objection.
We’re talking about the rate of change, and you’re objecting to it by making an argument about the absolute level of productivity. And as I said before, it is not an argument that economists themselves would endorse if it was made about a slower rate of growth continuing for a longer time. So it seems more like special pleading…
Dwarkesh Patel 02:47:20
I mean why not just the deployment thing, where the same argument you made about AI, where you do learn a lot just by deploying to the world and seeing what people find useful, ChatGPT was an example of this. Why won’t a similar thing happen with AI products and services where if one of the components is you put it out to the marketplace and people play with it and you find out what they need, and it clings to the existing supply chain and so forth. Doesn’t that take time?
Tamay Besiroglu 02:47:49
I mean it takes time but it is often quite fast. In fact, ChatGPT grew extremely fast.
Dwarkesh Patel 02:47:55
Right, but that was just purely digital service.
Ege Erdil 02:47:57
One reason to be optimistic is if you think the AIs will literally be drop-in remote workers, or drop-in workers in some cases if you have robotics, then companies are already experienced at onboarding humans, onboarding humans doesn’t take like a very long time. Maybe it takes six months even in a particularly difficult job for a new worker to start being productive. Well, that’s not that long.
So I don’t think that would rule out companies being able to onboard AI workers, assuming that they don’t need to make a ton of new complementary innovations and discoveries to take advantage. I think one way in which current AI systems are being inhibited and the reason we’re seeing the growth maybe be slower than you might otherwise expect, is because companies in the economy are not used to working with this new technology, they have to rearrange the way they work in order to take advantage of it.
But if AI systems were literally able to substitute for human workers then, well, the complementary innovations might not be as necessary.

Fully automated firms

Dwarkesh Patel 02:49:00
Actually this is a good excuse to go to the final topic, which is AI firms. So this blog post we wrote together about what it would be like to have a firm that is fully automated, and the crucial point we were making was that people tend to overemphasize and think of AI from the perspective of how smart individual copies will be.
And if you actually want to understand the ways in which they are superhuman, you want to focus on their collective advantages which, because of biology, we are precluded from, which are the fact that they can be copied with all their tacit knowledge. You can copy a Jeff Dean or Ilya Sutskever or whatever the relevant person is, in a different domain. You can even copy Elon Musk and he can be the guy who’s every single engineer in the SpaceX rig. And if that’s not an efficient way to…
Tamay Besiroglu 02:49:49
The AI equivalent of them.
Dwarkesh Patel 02:49:50
And if it’s not best to have Elon Musk or anything, you just copy the relevant team or whatever. And we have this problem with human firms, where there can be very effective teams or groups, but over time their culture dilutes, or the people leave, or die, or get old. And this is one of the many problems that can be solved with these digital firms.
Firms right now have two of the three relevant criteria for evolution; they have selection, and they have variation, but they don’t have high fidelity replication. And you could imagine a much more fast-paced and intense sequence of evolution for firms once you have this final piece click in.
And that relates to the onboarding thing, where right now they just aren’t smart enough to be onboarded as full workers, but once they are, I just imagine the kinds of things I try to hire for, it would just be such an unlock. The salaries are totally secondary. The fact that I can… “This is the skill I need” or the set of skills I need. And I can have a thousand workers in parallel if there’s something that has a high elasticity of demand. I think it’s probably, along with the transformative AI, the most underrated tangible thing that you need to understand about what the future AI society will look like.
Ege Erdil 02:51:22
I think there’s a first point about this very macroeconomic picture, where you just expect a ton of scaling of all the relevant inputs. I think that is the first order thing. But then you might have more micro-questions about, “okay, how does this world actually look like? How is it different from a world in which we just have a lot more people and a lot more capital and a lot more…?” Because it should be different. And then I think these considerations become important.
I think another important thing is just that AIs can be aligned. You get to control the preferences of your AI systems in a way that you don’t really get to control the preference of your workers. Your workers, you can just select, you don’t really have any other option. But for your AIs, you can fine tune them. You can build AI systems which have the kind of preferences that you want. And you can imagine that’s dramatically changing basic problems that determine the structure of human firms.
For example, the principal agent problem might go away. This is a problem where you as a worker have incentives that are either different from those of your manager, or those of the entire firm, or those of the shareholders of the firm.
Dwarkesh Patel 02:52:29
I actually think the incentives are a smaller piece of the puzzle. It’s more about bandwidth and information sharing where, with a large organization it’s very hard to have a single coherent vision, and the most successful firms we see today are where, for an unusual amount of time, a founder is able to keep their vision instilled in the organization; SpaceX or Tesla are examples of this. People talk about Nvidia this way.
But just imagine a future version where there’s this hyper inference scale mega-Jensen, who you’re spending $100 billion a year on inference, and copies of him are constantly writing every single press release and reviewing every pull request, and answering every customer service request, and so forth, and monitoring the whole organization, making sure it’s proceeding along a coherent vision and getting merged back into the hyper-Jensen, mega-Jensen, whatever.
Ege Erdil 02:53:30
Yeah, I agree that’s a bigger deal. At the same time, I would point out that part of the reason why it’s important to have a coherent vision and culture and so on in human companies might be that incentive problems exist otherwise. I wouldn’t rule that out, but I agree that, aside from the overall macroeconomic thing, I think the fact that they can be replicated is probably the biggest deal.
That also enables additional sources of economies of scale where if you have twice the number of GPUs, you can run not only twice the number of copies of your old model, but then you can train a model that’s even better. So you double your training compute and your inference compute, and that means you don’t get just twice the number of workers you would have had otherwise, you get more than that, because they are also smarter, because you spend more training compute. So that is an additional source of economies of scale.
And then there’s this benefit that, for humans, every human has to learn things from scratch, basically. They are born and then they have a certain amount of lifetime learning that they have to do. So in human learning, there’s a ton of duplication, while for an AI system, it could just learn once. It could just have one huge training run with tons of data. And then that run could be deployed everywhere. So that’s another massive advantage that the AIs have over humans.

Will central planning work after AGI?

Dwarkesh Patel 02:54:43
Maybe we’ll close up with this one debate we’ve often had offline, which is: will central planning work with these economies of scale?
Ege Erdil 02:54:52
So I would say that, I mean, again, the question of, “will it work?”
Dwarkesh Patel 02:54:56
Will it be optimal?
Ege Erdil 02:54:58
Right. So my guess is probably not optimal. But I don’t think anyone has thought this question through in a lot of detail.
Tamay Besiroglu 02:55:10
So it is worth thinking about why one might expect central planning to be slightly better in this world. So one consideration is just communication bandwidth being potentially much, much greater than it is today. In the current world, the information gathering and the information processing are co-located; humans observe and also process what they observe. In an AI world, you can disaggregate that.
So you can have the sensors and not do much processing, but just collect and then process centrally. And that processing centrally might make sense for a bunch of reasons, and you might get economies of scale from having more GPUs that produce better models, and also be able to think more deeply about what it’s seeing.
Dwarkesh Patel 02:56:06
It’s worth noting that certain things already work like this, for example, Tesla FSD. It will benefit from the data collected at the periphery from millions of miles of driving. And then the improvements which are made as a result of this.
Tamay Besiroglu 02:56:19
Centrally directed, it’s coming from HQ being like, “we’re going to push an update”.
And so you do get some of this more centralized…
Dwarkesh Patel 02:56:27
And it can be a much more intelligent form than just whatever gradient averaging that they- I mean, I’m sure it’s more sophisticated than that at Tesla- but it can be a much more deliberate, intelligent update.
Tamay Besiroglu 02:56:36
So that’s one reason to expect. And the other reason, I guess, is current leaders or CEOs don’t have bigger brains than the workers do. Maybe a little bit…
Dwarkesh Patel 02:56:50
I don’t know if you want to open that…
Tamay Besiroglu 02:56:52
But not by orders of magnitude. And so you could have orders of magnitude more scaling of the size of the models that are doing the planning than the people or the agents or workers doing the actions.
Ege Erdil 02:57:04
And I think a third reason is the incentive thing, where part of the reason you have a market is that it gives people the right kind of incentives. But you might not need that as much if you’re using AI. So I think there’s an argument that if you just list the traditional arguments people have made against “why does central bank not work?”, then you might expect them to become weaker.
Now, I think there’s a danger when you’re doing that kind of analysis to fall into the same kind of partial equilibrium analysis where you’re only considering some factors and then you’re not considering other things. For example…
Tamay Besiroglu 02:57:43
Things get more complex, you just have a much bigger economy and so on the one hand, your ability to collect information and process it improves, but also the need for doing that also increases as things become more complex.
Dwarkesh Patel 02:57:59
And one way to illustrate that is: imagine if Apple, the organization today, with all its compute and whatever, was tasked with managing the economy of Uruk. I think it actually could centrally plan the economy. The economy of Uruk might work even better as a result. But Apple as it exists today cannot manage the world economy as it exists today.
Ege Erdil 02:58:18
That’s right. Yeah.

Career advice

Dwarkesh Patel 02:58:20
All right, actually this will be the final question: One of the things that makes AI so fascinating is that there is no domain of human knowledge that is irrelevant to studying it, because what we’re really trying to…
Tamay Besiroglu 02:58:33
I don’t know about that.
Dwarkesh Patel 02:58:36
There’s no serious domain of human knowledge…
Tamay Besiroglu 02:58:40
That’s better.
Dwarkesh Patel 02:58:42
…that is not relevant to studying it, because you’re just fundamentally trying to figure out what a future society will look like. And so obviously computer science is relevant, but also economics- as we’ve been discussing- history, and how to understand history, and many other things we’ve been discussing.
Especially if you have longer timelines and there is enough time for somebody to pursue a meaningful career here, what would you recommend to somebody? Because both of you are quite young. I mean, you especially Ege, but both of you. You would think this is the kind of thing which requires crystallized intelligence or whatever, especially given what we said earlier about… Look, as we get more knowledge, we’re going to have to factor what we’re learning into building a better model of what’s going to happen to the world. And if somebody is interested in this kind of career that you both have, what advice do you have for them?
Ege Erdil 02:59:27
Yeah, that’s a hard question. I mean, I’m not sure. I think there is an extent to which it’s difficult to deliberately pursue the implicit strategy that we would have pursued. It probably works better if it’s spontaneous and more driven by curiosity and interest than: you make a deliberate choice, “okay, I’m just going to learn about a bunch of things so that I can contribute to the discourse on AI”. I would think that strategy is probably less effective. At least I haven’t seen anyone who deliberately used that strategy and then was successful, it seems like.
Dwarkesh Patel 03:00:05
Yeah, I guess not that I’ve contributed to discourse directly, but maybe facilitated other people contributing. I guess it wasn’t a deliberate strategy on my end, but it was a deliberate strategy to do the podcast, which inadvertently gave me the opportunity to learn about multiple fields.
Tamay Besiroglu 03:00:20
Yeah, so given that you’re already interested and curious and reading a bunch of things, and studying a bunch of things, and thinking about these topics, on the margin there are a bunch of things you can do to make you more productive at making some contributions to this.
And I think just speaking to people and writing your thoughts down and finding especially useful people to chat with and collaborate with, I think that’s very useful. So just seek out people that have similar views and you’re able to have very high bandwidth conversations with and make progress on these topics. And I think that’s just pretty useful.
Dwarkesh Patel 03:01:00
But how exactly? Like should they DM you? Like how do they get in?
Ege Erdil 03:01:05
Yeah, sure.
Tamay Besiroglu 03:01:06
And, I don’t know, set up Signal chats with your friends or whatever.
Dwarkesh Patel 03:01:10
Actually, it’s crazy how much alpha I’ve gotten out of that.
Ege Erdil 03:01:14
But yeah, I think one advice I would give to people in general, even if they are not thinking about AI specifically, but I think it’s also helpful for that, is people should be much more aggressive about reaching out. People have an impression that if you reach out to someone who looks really important, they’re not going to respond to you. But if what you send to them is interesting and high quality, then it’s very, very likely that they will respond.
There’s like a lot more edge there that you can get, which is just being more aggressive and less ashamed of looking dumb. That’s the main advice I would give. Because if you want to be productive, then again, there are these complementarities and so you need to be part of some community or some organization.
Dwarkesh Patel 03:02:02
And it goes back to the thing about reasoning alone not being that helpful.
Ege Erdil 03:02:05
Yeah, yeah, yeah.
Dwarkesh Patel 03:02:06
It’s just like other people have thought a long time and have randomly stumbled upon useful ideas that you can take advantage of.
Ege Erdil 03:02:12
That’s right. So you should just try to place yourself in a situation where you can become part of something larger. Which isn’t working on the front, that’s just a more effective way of contributing. And to do that, you have to, well, let people know.
Dwarkesh Patel 03:02:25
That’s right. That’s right. And I think just coming to the Bay Area is especially- for interest in AI in particular.
Ege Erdil 03:02:30
Yeah, going to the Bay Area is nice. Just post, like just writing things and like posting them where people can see them. Just aggressively reaching out to people with interesting comments.
Tamay Besiroglu 03:02:39
Provided your thoughts are interesting and so on.
Dwarkesh Patel 03:02:42
I mean, they probably aren’t. In many cases, I think it’s like, my thoughts still might not be interesting, but people will tolerate my cold emails and will still collaborate with me and so forth.
The other thing I’ve noticed- tell me if this is actually the wrong pattern. With people like you or with Carl Shulman or something, is that, as compared to a general person who’s intellectually curious or reading widely, you tend to focus much more on key pieces of literature than say, “I’m going to go read the classics or just generally read”. It’s like, “ I’m going to just put like a ton more credence in something like the Roamer paper”. And a normal person who’s intellectually curious would not be reading key pieces of literature.
Ege Erdil 03:03:31
Yeah. I think you have to be very mindful of the fact that you have a very limited amount of time, you’re not an AI model. So you have to aggressively prioritize what you’re going to spend your time reading.
Tamay Besiroglu 03:03:44
Even AI models don’t prioritize that heavily. They read Reddit mostly or a large part of their corpuses…
Dwarkesh Patel 03:03:48
Key pieces of empirical literature, at least. At least among you guys. I mean, it might not be the most productive thing in general, but…
Tamay Besiroglu 03:03:54
I think that’s useful. I also think it’s useful to read Twitter. I think we were having this conversation about people often say that they’re spending too much time reading Twitter and they wish they spent more time reading arXiv. But actually, the amount of information per unit time you get reading Twitter is often just much higher, and it’s just much more productive for them to read Twitter.
I think there are key pieces of literature that are important, and I think it’s useful to figure out what people who have spent a lot of time thinking about this find important in their worldview, so in AI, this might be key papers, like the Andy Jones paper about scaling loss for inference is a big thing.
And in economics, this Romer paper or the paper on explaining long run population from Kremer or from David Roodman and so on. I think if people who you think think very well about this suggest a certain paper and they highly recommend it, then I think you should take that seriously and actually read those papers.
Dwarkesh Patel 03:05:09
And for me, it’s been especially helpful to, instead of just skimming a bunch of things, if there’s a key piece of literature in order to, for example, understand the transformer, there’s always the Karpathy lectures, but one research that was really useful is the Anthropic’s original transformer circuit paper. And just spending a day on that paper instead of skimming it and making a bunch of spaced repetition cards and so forth, was much more useful than just generally reading widely about AI.
Ege Erdil 03:05:42
I think it’s just much more important here if you want to prioritize things correctly to be, again, to be part of a community or to be getting inputs from a community or get from people who have thought a lot and have a lot of experience about what is important and what is not.
Dwarkesh Patel 03:05:56
Yeah.
Ege Erdil 03:05:57
This is true even in academic fields. So if you want to do math research, but you’re not part of a graduate program, you’re not at a university where there are tons of people who do math research all day for many years, then you’re not even going to know what are the open problems that I should be working on? What is reasonable to attack? What is not reasonable to attack? What papers in this field are important, contain important techniques? You’re just going to have no idea. So it’s very important to be plugged into that feed of information somehow.
Dwarkesh Patel 03:06:26
But how did you know all this shit before being plugged in? Because you weren’t talking to anybody in Ankara.
Ege Erdil 03:06:30
You don’t need to talk. The internet is a pretty useful thing in this respect. And you don’t need to necessarily talk to people, you can get a lot of benefit from reading. But you just need to identify, who are the people who seem constantly most interesting? And maybe you find one person. And then often that person will know some other people who are interesting. And then you can start tracing the social network.
One example I can give, which I think is actually accurate, is maybe you know about Daniel Ellsberg. So you look for a podcast he appears on. You notice that he’s appeared on 80,000 Hours podcast, which he has. And then you notice there are some other guests on the 80,000 Hours podcast. So maybe there’s Bryan Caplan, who has also appeared on the podcast. And then maybe Robin Hanson has also appeared on the podcast. And then maybe there are some people those other people know. And then just tracing that kind of social network and figuring out who to listen to like that. I think that can be…
Tamay Besiroglu 03:07:26
And I think you’re doing a very big service to making that possible. I think your selection is often very good.
Dwarkesh Patel 03:07:33
I’m actually curious to hear offline what I got wrong. Well, actually, I think I know the answer to that.
Tamay Besiroglu 03:07:38
And I think that makes it a bunch easier to track who are the people doing the most interesting thinking on various topics.
Dwarkesh Patel 03:07:47
That’s right. Cool. I think that’s a good place to end, with you praising me. Again, I highly recommend people follow Epoch.
There’s a great weekly newsletter, Gradient Updates, which- I mean, people plug newsletters, but this is, I can’t believe this is a thing that comes out on a weekly basis.
And you now have a new podcast, which I will not plug as a competitor, but you can check it out.
Tamay Besiroglu 03:08:18
Thanks for lending your studio.
Ege Erdil 03:08:20
Yeah, that’s very generous.
Dwarkesh Patel 03:08:24
Anyways, cool. Thanks, guys.