Cover image of show Artificial Intelligence : Papers & Concepts

Artificial Intelligence : Papers & Concepts

Podcast by Dr. Satya Mallick

English

Technology & science

Limited Offer

2 months for 19 kr.

Then 99 kr. / monthCancel anytime.

  • 20 hours of audiobooks / month
  • Podcasts only on Podimo
  • All free podcasts
Get Started

About Artificial Intelligence : Papers & Concepts

This podcast is for AI engineers and researchers. We utilize AI to explain papers and concepts in AI.

All episodes

52 episodes

episode Vision Banana: Rethinking How AI Models See and Generalize artwork

Vision Banana: Rethinking How AI Models See and Generalize

In this episode of Artificial Intelligence: Papers and Concepts, we explore Vision Banana, a concept that challenges how vision models learn and generalize from visual data. Instead of focusing purely on performance metrics, Vision Banana highlights how models can latch onto shortcuts and fail to truly understand the underlying structure of images. We break down why modern vision systems can misinterpret simple variations, how dataset biases influence model behavior, and what this reveals about the gap between recognition and real understanding. If you're interested in computer vision, model robustness, or the limitations of current AI systems, this episode explains why Vision Banana offers an important perspective on building more reliable and generalizable visual intelligence. Resources: Paper Link: https://arxiv.org/pdf/2604.20329v1 [https://arxiv.org/pdf/2604.20329v1] Interested in Computer Vision and AI consulting and product development services? Email us at contact@bigvision.ai or visit us at https://bigvision.ai

23 Apr 2026 - 14 min
episode Position Encoding: How Transformers Understand Order in Data artwork

Position Encoding: How Transformers Understand Order in Data

In this episode of Artificial Intelligence: Papers and Concepts, we explore Position Encoding, a fundamental concept that enables transformer models to understand the order of information. Since transformers process data in parallel rather than sequentially, position encoding provides the missing sense of sequence helping models distinguish between "what came first" and "what comes next." We break down why order matters in language and sequence-based tasks, how different encoding techniques inject positional information into models, and what this means for performance in applications like text generation, translation, and beyond. If you're interested in transformer architecture, sequence modeling, or the building blocks behind modern AI systems, this episode explains why position encoding is essential for making sense of structured data. Interested in Computer Vision and AI consulting and product development services? Email us at contact@bigvision.ai or visit us at https://bigvision.ai [https://bigvision.ai]

22 Apr 2026 - 21 min
episode V-JEPA 2.1: Learning Video Understanding Without Labels artwork

V-JEPA 2.1: Learning Video Understanding Without Labels

In this episode of Artificial Intelligence: Papers and Concepts, we explore V-JEPA 2.1, a next-generation video learning model that shifts away from traditional supervised training. Instead of relying on labeled datasets, the model learns by predicting missing information in a latent space - focusing on understanding motion, structure, and context rather than memorizing frames. We break down how joint-embedding predictive architectures extend into video, why learning from raw temporal data is critical for real-world intelligence, and what this means for building systems that can understand events as they unfold. If you're interested in self-supervised learning, video intelligence, or the future of AI that learns through observation, this episode explains why V-JEPA 2.1 represents a major step toward more general and efficient video understanding. Resources: Paper Link: https://arxiv.org/pdf/2603.14482v2 [https://arxiv.org/pdf/2603.14482v2] Interested in Computer Vision and AI consulting and product development services? Email us at contact@bigvision.ai or visit us at https://bigvision.ai

21 Apr 2026 - 20 min
episode Agentic AI Cost: The Hidden Economics of Autonomous Systems artwork

Agentic AI Cost: The Hidden Economics of Autonomous Systems

In this episode of Artificial Intelligence: Papers and Concepts, we explore Agentic AI Cost, a deep dive into the often-overlooked economics of autonomous AI systems. As AI agents become more capable- planning, reasoning, and executing tasks - the cost of running them goes far beyond a single model call, involving multiple steps, tools, and feedback loops. We break down why agent-based systems can quickly become expensive, how iterative reasoning and tool usage impact compute and latency, and what this means for building scalable AI products. If you're interested in AI agents, cost optimization, or the business realities of deploying autonomous systems, this episode explains why understanding agentic cost structures is critical for the future of practical AI. Interested in Computer Vision and AI consulting and product development services? Email us at contact@bigvision.ai or visit us at https://bigvision.ai

20 Apr 2026 - 18 min
episode ChopGrad: Making Training More Efficient by Cutting Gradient Complexity artwork

ChopGrad: Making Training More Efficient by Cutting Gradient Complexity

In this episode of Artificial Intelligence: Papers and Concepts, we explore ChopGrad, a novel technique aimed at improving the efficiency of training deep learning models by selectively simplifying gradient computations. Instead of processing full gradient updates at every step, ChopGrad strategically reduces complexity helping models train faster while maintaining performance. We break down why gradient computation is one of the most resource-intensive parts of training, how approaches like ChopGrad balance efficiency with accuracy, and what this means for scaling models without proportionally increasing compute costs. If you're interested in optimization techniques, efficient deep learning, or the future of scalable AI training, this episode explains why ChopGrad represents a promising direction in making model training more practical and cost-effective. Resources: Paper Link: https://princeton-computational-imaging.github.io/ChopGrad/ [https://princeton-computational-imaging.github.io/ChopGrad/] Interested in Computer Vision and AI consulting and product development services? Email us at contact@bigvision.ai or visit us at https://bigvision.ai

17 Apr 2026 - 10 min
En fantastisk app med et enormt stort udvalg af spændende podcasts. Podimo formår virkelig at lave godt indhold, der takler de lidt mere svære emner. At der så også er lydbøger oveni til en billig pris, gør at det er blevet min favorit app.
En fantastisk app med et enormt stort udvalg af spændende podcasts. Podimo formår virkelig at lave godt indhold, der takler de lidt mere svære emner. At der så også er lydbøger oveni til en billig pris, gør at det er blevet min favorit app.
Rigtig god tjeneste med gode eksklusive podcasts og derudover et kæmpe udvalg af podcasts og lydbøger. Kan varmt anbefales, om ikke andet så udelukkende pga Dårligdommerne, Klovn podcast, Hakkedrengene og Han duo 😁 👍
Podimo er blevet uundværlig! Til lange bilture, hverdagen, rengøringen og i det hele taget, når man trænger til lidt adspredelse.

Choose your subscription

Most popular

Limited Offer

Premium

20 hours of audiobooks

  • Podcasts only on Podimo

  • No ads in Podimo shows

  • Cancel anytime

2 months for 19 kr.
Then 99 kr. / month

Get Started

Premium Plus

Unlimited audiobooks

  • Podcasts only on Podimo

  • No ads in Podimo shows

  • Cancel anytime

Start 7 days free trial
Then 129 kr. / month

Start for free

Only on Podimo

Popular audiobooks

Get Started

2 months for 19 kr. Then 99 kr. / month. Cancel anytime.