Agents Of Tech
The current AI boom is really built on the huge scaling of LLMs, but are they're getting it wrong? Alex LeBrun, Co-founder and CEO of Advanced Machine Intelligence (AMI Labs), argues that if AI is going to understand the world, model cause and effect, and reason across time, it may need something different: world models. His new company has raised more than $1 billion to try to prove it. He isn't making a technical argument – it's a challenge to the whole logic of the current AI boom. So today, our hosts Autria Godfrey, Stephen Horn and Laila Rizvi are asking Alex LeBrun whether the industry is over-invested in scaling just one dominant paradigm, and whether world models are a real alternative or just a better theory in a market that may have already chosen its winner. Autria kicks things off by asking Alex where LLMs fall short. According to Alex, “If you want to understand or manipulate the real world, then LLMs are not good…across the board.” He explains that LLMs are good in language-first tasks: everything that is discreet and recognized, like mathematics, coding, and information retrieval. But that’s “only one class of problems. It's not everything in the world.” Alex defines what world models are, how they are trained, and what they are most useful for. You’ll hear about how AMI chairman Yann LeCun and his team developed a concept called JEPA, Joint Embedding Predictive Architecture, which is a way to train world models through self-supervised learning. We explore how AMI is using data-rich video to train their world models – and why the vast majority of the videos on YouTube just won’t cut it as source material. Laila asks about the possibility of scaling LLMs to the point where they can lead to implicit world models emerging. Alex disagrees. He points to diminishing returns on the core progress of LLMs in spite of spending billions of dollars. Instead, he sees world models as complimentary to LLMs the way physicians need real world training in addition to only reading books. He even compares how LLMs and world models can mirror the way the human brain works, with different areas of the brain responsible for different functions while working together in tandem. We dive into the economics of AI, from whether further investment in scaling LLMs makes sense to whether world models will catch on with investors. Alex shares some of the challenges he’s faced competing for compute and raising capital in an investment climate where Anthropic has a $30 billion run rate. When Stephen wonders how world models will fit in with agentic AI in areas like healthcare, Alex points out that agentic models are very brittle and usually break when it comes to long term planning. He says 99% accuracy isn’t enough if it means killing 1 out of every 100 patients. World models build an internal representation of the world, which is more deterministic, and will allow for longer term planning with more accuracy. Finally, it’s time for our Rapid Fire segment, where our hosts ask our guests a series of three questions. Autria asks Alex where people will draw the line with AI in their personal lives; Laila asks him for something that's universally accepted in his field that he disagrees with; and Stephen asks Alex what will happen in the future that people aren't talking about now. In our post interview discussion, Autria. Stephen and Laila discuss whether Alex made the case for including world models in the AI economy, including diverting some of the capital being invested in LLMs into world models. We also want to know what you think. Are world models the new frontier in the world of AI, or is scaling LLMs still the best bet forward for seeing all the potential that AI has to offer? Tell us in the comments.
49 episoder
Kommentarer
0Vær den første til at kommentere
Tilmeld dig nu og bliv en del af Agents Of Tech-fællesskabet!