Beyond CUDA

Beyond CUDA

Automating GPU Kernels: Mako’s CEO on Crossing the CUDA Moat

26 min · 27 de ago de 2025
portada del episodio Automating GPU Kernels: Mako’s CEO on Crossing the CUDA Moat

Descripción

Waleed Atallah from Mako joins us to discuss automating GPU kernel generation with AI. 03:07 – Why GPU kernels bottleneck progress 04:46 – Three current ways to get kernels 06:02 – LLMs: can they write fast kernels? 07:48 – Mako’s “kernel agent” in action 09:12 – Early adopters & best-fit users 12:05 – Data scarcity limits fine-tuning 16:18 – AI-written kernels by 2030? 20:27 – Top technical risks ahead #BeyondCUDA #GPUkernels #mako 🌐 Connect with us: https://www.linkedin.com/in/jtatarchuk https://www.linkedin.com/in/waleedatallah * (00:00) - C1750 P1315 Waleed Atallah Audio * (26:24) - OUTRO SEQUENCE

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y forma parte de la comunidad de Beyond CUDA!

Prueba gratis

Empieza 7 días de prueba

$99 / mes después de la prueba. · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros al mes
  • Podcast gratuitos

Todos los episodios

9 episodios

episode Why the World Needs AMD for Sovereign AI with Keith Strier @ AMD artwork

Why the World Needs AMD for Sovereign AI with Keith Strier @ AMD

Keith Strier explains the origins of sovereign AI, and why he moved from NVIDIA to AMD. He unpacks the GPU divide, silicon diversity, and why nations are now embracing open source and the ROCm ecosystem. 02:10 – Quitting 17-yr consulting for NVIDIA 04:55 – NVIDIA culture shock: flat & fast 07:32 – EY’s first AI practice (2016) 10:08 – Estonia & Malta AI strategies 12:56 – Launching NVIDIA’s “AI Nations” 14:30 – ChatGPT hits; Jensen pivots 17:05 – GPU land-grab by governments 19:40 – Real bottleneck: data centers 23:55 – Move to AMD to scale AI infra #BeyondCUDA #AIInfrastructure #AMD 🌐 Connect with us: https://www.linkedin.com/in/jtatarchuk https://www.linkedin.com/in/keithstrier/

27 de ago de 202542 min
episode How dstack is Cutting GPU Cloud Costs 3–7x with Smarter Orchestration artwork

How dstack is Cutting GPU Cloud Costs 3–7x with Smarter Orchestration

How we cuts cloud GPU costs 3-7x! Andrey Chepstov - CEO @ dstack (https://dstack.ai) chats with Jeff Tatarchuk on open source AI & redefining infrastructure. #AI #OpenSource #GPUs 00:29 - Intro and journey at JetBrains.   09:02 - How was dstack born?   13:14 - What are the challenges with Kubernetes?   18:36 - What problems does dstack solve?   26:21 - dstack roadmap and features.   30:25 - Open source journey and community building.   42:03 - What does Beyond CUDA mean?   🌐 Connect with us: https://www.linkedin.com/in/jtatarchuk/ https://www.linkedin.com/in/andrey-cheptsov * (00:00) - C2004 Andrey Cheptsov - Audio * (00:24) - Guest Introduction: Andre Cheep * (01:12) - Andre's Early Interest in Tech * (02:52) - Journey at JetBrains * (06:31) - Lessons from JetBrains * (09:14) - The Birth of DST Stack * (13:32) - Challenges with Kubernetes * (18:49) - DST Stack's Solutions and Vision * (24:12) - Challenges with Kubernetes and Market Dynamics * (24:53) - The Importance of Kubernetes in Cloud Services * (25:35) - Competing with Kubernetes: A New Approach * (26:54) - DST Stack Roadmap and Exciting Features * (27:20) - GPU Health Checks: A Game Changer * (30:41) - Open Source Journey and Community Building * (34:36) - Technical Challenges and Ecosystem Dynamics * (36:00) - Cost Efficiency and Surprising Use Cases * (39:50) - Future Plans and Community Involvement * (42:48) - Final Thoughts and Beyond CUDA

27 de ago de 202544 min