The Practical AI Digest

AI Hardware: GPUs, TPUs and Beyond

25 min · 28 apr 2026
aflevering AI Hardware: GPUs, TPUs and Beyond cover

Beschrijving

This episode is all about the specialized hardware that makes modern AI possible. We explain how GPUs became the workhorses of deep learning by offering massive parallelism for matrix math, and how companies like Google went further to build TPUs (Tensor Processing Units) optimized for neural network workloads. You’ll hear about the latest AI chips, from NVIDIA’s powerful GPUs driving large model training, to emerging AI accelerators like Graphcore’s IPU, Cerebras’s wafer-scale engine, and even AI on the edge (Apple’s neural engines, etc.). We discuss what each brings in terms of speed, memory, efficiency, and how they’re deployed, giving a peek into the data centers (and devices) where AI calculations run.

Reacties

0

Wees de eerste die een reactie plaatst

Meld je nu aan en word lid van de The Practical AI Digest community!

Begin hier

2 maanden voor € 1

Daarna € 9,99 / maand · Elk moment opzegbaar.

  • Podcasts die je alleen op Podimo hoort
  • 20 uur luisterboeken / maand
  • Gratis podcasts

Alle afleveringen

20 afleveringen

aflevering LLMOps: Operating Large Language Models in Production artwork

LLMOps: Operating Large Language Models in Production

Building an AI model is one thing: keeping a large language model running reliably in the real world is another. In this episode, we discuss LLMOps, the emerging set of practices and tools for deploying, monitoring, and maintaining large language models (LLMs) in production. We cover challenges unique to LLMs (like handling the huge model sizes, long context lengths, unpredictable outputs, and continuous updates with new data). You’ll learn about techniques for versioning and evaluating LLMs, setting up feedback loops (human or automated) to catch issues like drift or toxicity, and infrastructure like model hubs and the new Model Context Protocol (MCP) that connects LLMs with external tools and data. We tie it together with examples of how companies manage AI like GPT-4 as a service, ensuring it stays efficient, safe, and up-to-date post-deployment.

Gisteren28 min
aflevering TinyML & Edge AI: Machine Learning on Devices artwork

TinyML & Edge AI: Machine Learning on Devices

In this episode, we explore how AI is moving from the cloud to tiny devices. TinyML is the field of optimizing models and algorithms to run on microcontrollers, smartphones, and other edge devices with very limited compute and power. We discuss techniques like model compression, quantization, and architecture search that make models small and efficient enough to fit on a $5 microcontroller, bringing capabilities like wake-word detection, sensor analytics, or even vision tasks directly onto devices. You’ll hear about examples like MCUNet, an MIT system that achieved ImageNet-level vision recognition on a microcontroller, and why on-device AI can be beneficial (low latency, no internet needed, data privacy). We also cover real-world applications already using TinyML, from smart appliances to wearable health monitors.

12 mei 202625 min
aflevering Synthetic Data: Artificial Data for Real Insights artwork

Synthetic Data: Artificial Data for Real Insights

In this episode, we explore how synthetic data is created and used to improve AI models. Synthetic data refers to artificial datasets generated by models (like GANs or language models) that mimic real data. We discuss how this can help in situations with little real data or strict privacy requirements for example, generating realistic medical records to train an AI without exposing any patient’s information. You’ll learn about techniques for producing synthetic images, text, and tabular data, and how they are validated to ensure they reflect real-world patterns. We also cover the benefits and challenges of synthetic data, from reducing bias and augmenting rare cases, to ensuring the synthetic data doesn’t inadvertently leak sensitive info.

14 apr 202630 min