AI Extreme Weather and Climate

Ep. 11: Principals of Diffusion Models

17 min · 5 de nov de 2025
Portada del episodio Ep. 11: Principals of Diffusion Models

Descripción

This episode provides a comprehensive monograph on diffusion models, detailing their foundational principles through three unifying perspectives: the Variational View (related to VAEs and DDPMs), the Score-Based View (rooted in EBMs and Score SDEs), and the Flow-Based View (connecting to Normalizing Flows and Flow Matching). The core concept involves defining a continuous forward process that adds noise and then learning a corresponding reverse process—a Stochastic Differential Equation (SDE) or Probability Flow Ordinary Differential Equation (PF-ODE)—to transform noise back into data. Much of the discussion focuses on the mathematical equivalence of these different formulations, the tractable training objectives (like Denoising Score Matching), and advanced techniques for accelerating the slow sampling process, including sophisticated numerical ODE solvers (like DPM-Solver) and distillation methods (such as Consistency Models). Finally, the monograph explores the theoretical connection between diffusion models and Optimal Transport (OT), suggesting that diffusion is related to, but not generally equivalent to, solving the optimal transport problem. Reference: Lai, C. H., Song, Y., Kim, D., Mitsufuji, Y., & Ermon, S. (2025). The Principles of Diffusion Models. arXiv preprint arXiv:2510.21890.

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de AI Extreme Weather and Climate!

Prueba gratis

Empieza 7 días de prueba

$99 / mes después de la prueba. · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros al mes
  • Podcast gratuitos

Todos los episodios

14 episodios

episode Target Concept Tuning: Solving the AI Blindspot in Extreme Weather Forecasting artwork

Target Concept Tuning: Solving the AI Blindspot in Extreme Weather Forecasting

In this episode of AI Extreme Weather and Climate, Allen and Sydney explore a major breakthrough in meteorological AI: predicting rare but high-impact events like typhoons. While foundation models like Pangu-Weather excel at everyday forecasts, they often stumble during extreme anomalies due to severe data imbalance. We dive into a newly proposed framework called Target Concept Tuning (TaCT) which acts like a "specialized meteorologist" inside the neural network. By using Sparse Autoencoders to untangle superposed features, TaCT automatically identifies the exact internal concepts that cause the model to fail during extreme weather. It then selectively fine-tunes only those specific concepts, dramatically improving typhoon forecasting accuracy without causing the model to "forget" how to predict normal weather patterns. Tune in to learn how making AI more interpretable is making our early warning systems safer and more reliable! Paper Discussed in this Episode: Ren, S., Gu, X., Peng, Z., Zhang, H., Niu, P., Wu, B., Wang, X., Sun, L., & Wen, J. (2026). Target Concept Tuning Improves Extreme Weather Forecasting. arXiv preprint arXiv:2603.19325. https://doi.org/10.48550/arXiv.2603.19325 [https://doi.org/10.48550/arXiv.2603.19325]

24 de mar de 202618 s
episode NeuralGCM: Observation-Based Hybrid Modeling for Global Precipitation Forecasting artwork

NeuralGCM: Observation-Based Hybrid Modeling for Global Precipitation Forecasting

This paper introduces NeuralGCM, a hybrid atmospheric model that integrates machine learning with traditional differentiable physics to improve global precipitation simulations. Unlike older models that rely on high-resolution simulations for training, this framework is trained directly on satellite observations, specifically the IMERG dataset. By leveraging this observational data, the model effectively corrects common biases in extreme weather events and the diurnal cycle of rainfall. In comparative tests, the model outperformed the ECMWF ensemble in mid-range forecasting and showed superior accuracy over CMIP6 climate models. Additionally, the architecture is exceptionally efficient, running simulations at speeds orders of magnitude faster than conventional general circulation models. These findings suggest that hybrid neural models offer a more reliable and computationally accessible path for predicting future climate impacts.

15 de ene de 202615 min
episode Flow-Matched Neural Operators for Continuous PDE Dynamics artwork

Flow-Matched Neural Operators for Continuous PDE Dynamics

The episode describes the Continuous Flow Operator (CFO), a novel neural framework for learning the continuous-time dynamics of Partial Differential Equations (PDEs), aimed at overcoming limitations found in conventional models like autoregressive schemes and Neural Ordinary Differential Equations (ODEs). CFO's key innovation is the use of a flow matching objective to directly learn the right-hand side of the PDE dynamics, utilizing the analytic velocity derived from spline-based interpolants fit to trajectory data. This approach uniquely allows for training on irregular and subsampled time grids while enabling arbitrary temporal resolution during inference through standard ODE integration. Across four benchmarks (Lorenz, 1D Burgers, 2D diffusion-reaction, and 2D shallow water equations), the quintic CFO variant demonstrates superior long-horizon stability and significant data efficiency, often outperforming autoregressive baselines trained on complete datasets even when trained on only 25% of irregularly sampled data.

9 de dic de 202512 min
episode Ep. 11: Principals of Diffusion Models artwork

Ep. 11: Principals of Diffusion Models

This episode provides a comprehensive monograph on diffusion models, detailing their foundational principles through three unifying perspectives: the Variational View (related to VAEs and DDPMs), the Score-Based View (rooted in EBMs and Score SDEs), and the Flow-Based View (connecting to Normalizing Flows and Flow Matching). The core concept involves defining a continuous forward process that adds noise and then learning a corresponding reverse process—a Stochastic Differential Equation (SDE) or Probability Flow Ordinary Differential Equation (PF-ODE)—to transform noise back into data. Much of the discussion focuses on the mathematical equivalence of these different formulations, the tractable training objectives (like Denoising Score Matching), and advanced techniques for accelerating the slow sampling process, including sophisticated numerical ODE solvers (like DPM-Solver) and distillation methods (such as Consistency Models). Finally, the monograph explores the theoretical connection between diffusion models and Optimal Transport (OT), suggesting that diffusion is related to, but not generally equivalent to, solving the optimal transport problem. Reference: Lai, C. H., Song, Y., Kim, D., Mitsufuji, Y., & Ermon, S. (2025). The Principles of Diffusion Models. arXiv preprint arXiv:2510.21890.

5 de nov de 202517 min
episode Ep 10. RainSeer: Physics-Guided Fine-Grained Rainfall Reconstruction artwork

Ep 10. RainSeer: Physics-Guided Fine-Grained Rainfall Reconstruction

This episode introduces RainSeer, a novel, structure-aware framework for reconstructing high-resolution rainfall fields by treating radar reflectivity as a physically grounded structural prior. The authors argue that existing interpolation methods fail to capture localized extremes and sharp transitions crucial for applications like flood forecasting. RainSeer addresses two main challenges: the spatial resolution mismatch between volumetric radar scans and sparse ground-level station measurements (AWS), and the semantic misalignment caused by microphysical processes like melting and evaporation between the radar's view aloft and the rain that reaches the ground. The framework employs a Structure-to-Point Mapper for spatial alignment and a Geo-Aware Rain Decoder with a Causal Spatiotemporal Attention mechanism to model the physical transformation of hydrometeors during descent, demonstrating significant performance improvements over state-of-the-art baselines on two public datasets.

9 de oct de 202519 min