We're making up AI as we go | Saturdata with Joey Yudelson

46 min · 4 de abr de 2026

Descripción

What happens when you train an evil AI and it just lies really confidently? Joey Yudelson (https://www.linkedin.com/in/joseph-yudelson/ [https://www.linkedin.com/in/joseph-yudelson/]), AI safety researcher at Redwood Research, joins Sam and Shifra to break down why 300 people standing between us and a catastrophic AI future might not be enough, and what data folks can actually do about it. We talk about: - Taxonomies of AI risk (silly vs. not silly, yes this is a real framework) - Why the evil AI wrote 30 paragraphs insisting its buggy code was perfect - Global AI regulation and who's actually doing a good job (hint: it's the EU) - How to use Claude agents like a multiplayer cheat code - Why you personally could make a dent in AI safety research Follow Saturdata, your favorite weekend data podcast Spotify: https://open.spotify.com/show/5QolhKm1jDZzVuHO0S9ZBo?si=910efb23833f4fc1 [https://open.spotify.com/show/5QolhKm1jDZzVuHO0S9ZBo?si=910efb23833f4fc1] LinkedIn: https://www.linkedin.com/company/saturdata [https://www.linkedin.com/company/saturdata] Instagram: @SaturdataPod #Saturdata #AISafety #DataScience #MachineLearning Chapters: 0:00 - Intro 0:57 - Joey's origin story: from high school Yudkowsky reader to full-time AI safety researcher 3:42 - A guided tour of the AI safety landscape 6:14 - Where Joey fits in the puzzle: model organisms and misalignment research 7:28 - The evil AI that wrote 30 paragraphs insisting its buggy code was perfect 10:55 - Deep in the lab vs. everyday AI user: how different are they really? 13:42 - The knowledge lag: why comedians are still calling AI "smart autocomplete" 17:24 - Taxonomies of risk: silly vs. not silly (yes, water use is on the table) 22:02 - Being a responsible AI user: what data folks can actually do 28:32 - How LLMs actually work, explained with a very talented dog named Jeeves 33:11 - Joey's lifelong vendetta against SQL (and how he gets away with it) 36:59 - Three rules for getting real value out of AI agents without losing your mind 42:48 - Why you personally could make a dent in AI safety (and the case for Talmudic AI research) 45:20 - Takeaways and outro

Comentarios

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de Saturdata!

Prueba gratis

Todos los episodios

46 episodios

AI regulation isn't just a tech problem, it's a people problem

It's not all firewalls and technical fixes. Here's why shaping the future of AI comes down to soft skills, cultural awareness, and actually showing up for the conversation. Call your rep, use your voice, because any regulation is better than none 🗣️ #shorts #saturdata #data #AIregulation #AIgovernance #TechPolicy #futureofAI

6 de abr de 202657 s

We're making up AI as we go | Saturdata with Joey Yudelson

4 de abr de 202646 min

Stop coding, start directing: the AI shift you can't ignore

Claude Opus is probably a better coder than you, and Joey isn't sugarcoating it. Instead of writing code line by line, the real move could be writing design docs and letting a fleet of AI agents do the heavy lifting. The biggest mistake is assuming AI will always look the way it does right now #shorts #saturdata #data #AI #claudeai #futureofwork #techtrends

3 de abr de 202655 s

Your AI chatbot is basically a very well-trained dog named Jeeves 🐾

Ever wondered how ChatGPT actually works? Joey breaks it down in the most hilarious way possible, and honestly, we'll never think about reinforcement learning the same way again. Train it right and it fetches you a beer. Train it wrong and it wants to pee on your friend Dave's foot. The science checks out. 🍺🤣 #reels #saturdata #data #AI #machinelearning #ChatGPT #techexplained

2 de abr de 20264 min

How ChatGPT actually works: a dog explains | Saturdata with Joey Yudelson

What if you could explain ChatGPT using only a dog, some audiobooks, and a stick? Joey Yudelson joins Sam and Shifra to break down how large language models actually work, no PhD required. From next-word prediction to reinforcement learning, this one will make you feel like you actually get it. We talk about: - How LLMs learn to "speak human" by predicting the next word - What reinforcement learning actually does (and why your model needs a stick sometimes) - Why RLHF is basically dog training at a bazillion scale - Spurious correlations and how models learn the wrong lessons - What it really means when an AI "has a persona" Follow Saturdata, your favorite weekend data podcast: Spotify: https://open.spotify.com/show/5QolhKm1jDZzVuHO0S9ZBo?si=910efb23833f4fc1 [https://open.spotify.com/show/5QolhKm1jDZzVuHO0S9ZBo?si=910efb23833f4fc1] LinkedIn: https://www.linkedin.com/company/saturdata [https://www.linkedin.com/company/saturdata] Instagram: @SaturdataPod #Saturdata #MachineLearning #ChatGPT #LLM

1 de abr de 20264 min

We're making up AI as we go | Saturdata with Joey Yudelson

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios