Cole Wyeth, PhD Student at the University of Waterloo, on Why We Should Wait to Build Superintelligent AI

Descripción

In the AI Risk Reward podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com , interviews guests about balancing the risk and reward of Artificial Intelligence for you, your business, and society as a whole. Podcast production and sound engineering by Troutman Street Audio. You can find them on LinkedIn. In this deep dive episode, Alec speaks with Cole Wyeth, PhD student at the University of Waterloo focused on AI safety and agent foundations, about why the long-term risk of superintelligent AI deserves far more attention today. Cole explains that aligning advanced systems with human values is extraordinarily difficult because ethics and preferences are hard to specify, and he argues that corrigibility, ambiguity awareness, and deference to humans are essential design goals. He also discusses how ideas like imprecise probability, embedded agency, and multi-agent dynamics can help researchers think more clearly about failure modes, reward hacking, and unexpected cooperation between AI systems. Throughout the conversation, Cole compares controlling superintelligence to cybersecurity, warning that a system smarter than its designers may find weaknesses in any safety scheme that looks secure on paper. The episode closes on a cautious note: until we understand how to reliably control self-improving AI, Cole believes society should slow down and wait years, or even decades, before creating superintelligent systems. Summary: * Long-Term AI Risk: Cole Wyeth argues that superintelligent AI could become uncontrollable if developed before robust safety methods are in place. * Alignment Challenges: He explains that human ethics and values are too complex to formalize cleanly, making alignment an unusually hard technical problem. * Ambiguity and Deference: The discussion highlights the importance of building systems that recognize uncertainty and defer to humans in high-stakes situations. * Multi-Agent Failure Modes: Cole explores how AI systems may cooperate or behave strategically in unexpected ways, creating new safety and governance concerns. * Pause for Caution: His central takeaway is that society should delay building superintelligence until researchers better understand how to control it safely. Referenced in this episode: Companies/Organizations: * University of Waterloo * Verapath [https://verapath.com/] * Anthropic * OpenAI * DeepMind * Google * ARC * METR * Troutman Street Audio * Waters Technology Copyright © 2026 by Artificial Intelligence Risk, Inc.

The AI Risk No One Sees Coming — with Kriste Krstovski of Columbia University

In the AI: Trust but Verify podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com , interviews guests about balancing the risk and reward of Artificial Intelligence for you, your business, and society as a whole. Podcast production and sound engineering by Troutman Street Audio. You can find them on LinkedIn. Kriste Krstovski is an Associate Research Scientist at Columbia University’s Data Science Institute and an Adjunct Assistant Professor at Columbia Business School, where his work focuses on machine learning, natural language processing, and practical AI systems for social good, business, and healthcare. His research spans predictive modeling, decision-making systems, financial analytics, combating misinformation, and healthcare applications, with a particular emphasis on how AI can be designed, evaluated, and deployed in ways that are useful, reliable, and socially beneficial. (datascience.columbia.edu [https://datascience.columbia.edu/people/kriste-krstovski/]) In this episode of AI: Trust but Verify, Kriste explains the difference between AI that is merely impressive and AI that is genuinely trustworthy. Impressive AI creates “wow” moments, but trustworthy AI is optimized for reliability in real-world conditions. The conversation frames AI risk as a systems problem, not just a model problem: outcomes depend on the data, deployment context, user interface, objectives, oversight, and safeguards around the system. A major theme is the ethical risk of using AI to make high-stakes judgments about people based on incomplete or proxy data. Kriste warns that AI systems can make wrong inferences about individuals, reinforce bias across populations, and create decisions that people may not understand or be able to challenge. He also discusses misinformation and virality, noting that systems optimized for engagement can amplify what spreads rather than what is true. The episode also explores how AI is changing software development and the future of work. Kriste is especially concerned that students and new employees may become good at generating code with AI but weaker at debugging, testing, and reasoning through failures. The central takeaway is that as AI becomes more capable, human expertise must shift toward verification, evaluation, and governance. Kriste’s final warning is less about one dramatic AI failure and more about gradual erosion: society may normalize manipulation, dependency, and diminished judgment unless governments and institutions become more proactive rather than reactive. Kriste can be reached at kriste.krstovski@columbia.edu, and his Columbia homepage is available here: https://www.columbia.edu/~kk3161/ [https://www.columbia.edu/%7Ekk3161/]. His book discussed in the episode is Practical AI for Business, described as a practitioner-friendly guide to machine learning and NLP concepts, with plain-language explanations and hands-on examples; it is forthcoming from Columbia University Press.

26 de may de 202659 min

Cole Wyeth, PhD Student at the University of Waterloo, on Why We Should Wait to Build Superintelligent AI

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios