What is vLLM? | Agentic AI Podcast by lowtouch.ai

16 min · 14. feb. 2026

Beskrivelse

In this episode, we introduce vLLM, an open-source library designed to dramatically improve the speed and efficiency of large language model (LLM) inference. We break down how vLLM uses techniques like PagedAttention to optimize memory usage, increase throughput, and reduce latency—making it ideal for serving LLMs in production environments. Whether you're building AI-powered applications or scaling agentic systems, this episode explains why vLLM is becoming a go-to solution for cost-effective, high-performance model deployment.

Kommentarer

Vær den første til at kommentere

Tilmeld dig nu og bliv en del af Agentic AI Podcast-fællesskabet!

Kom i gang

Alle episoder

69 episoder

What is vLLM? | Agentic AI Podcast by lowtouch.ai

14. feb. 202616 min

Stanford 3D Microchip AI Hardware Breaks Barriers | Agentic AI Podcast by lowtouch.ai

In this episode, we explore how 3D AI chips are overcoming the long-standing “memory wall” that limits AI performance. By stacking memory and compute vertically, these next-generation architectures dramatically reduce latency, boost bandwidth, and improve energy efficiency. We break down why traditional chip designs struggle to keep up with modern AI workloads and how 3D integration is unlocking faster training, real-time inference, and scalable agentic systems. Tune in to understand why breakthroughs in hardware, not just software are shaping the next era of AI innovation.

13. feb. 202615 min

Shift from Vibe Coding to Agentic Engineering | Agentic AI Podcast by lowtouch.ai

In this episode, we explore Andrej Karpathy’s evolving vision for software development—from “vibe coding,” where developers collaborate intuitively with AI, to full-scale agentic engineering, where AI agents take on autonomous building, debugging, and optimization tasks. We break down how this shift changes the role of developers—from writing every line of code to orchestrating intelligent systems. Tune in to understand how AI-native workflows are redefining productivity, creativity, and the very craft of software engineering.

12. feb. 202615 min

AI's Role in B2B Marketing | Agentic AI podcast by lowtouch.ai

In this episode, we explore how AI is transforming B2B marketing, from smarter lead scoring and personalized account-based marketing to content generation and performance optimization. We discuss how AI helps marketers move beyond broad campaigns to data-driven, intent-led strategies that align sales and marketing more closely. Tune in to learn how enterprises are using AI not just to automate marketing tasks, but to drive better pipeline quality, higher conversion rates, and measurable revenue impact.

10. feb. 202615 min

What is OpenAI's Frontier | Agentic AI podcast by lowtouch.ai

In this episode, we unpack OpenAI’s “Frontier” , the idea and direction behind the most advanced, high-capability AI systems being developed today. We explore what makes a model “frontier,” how these systems differ from everyday AI tools, and why they raise new questions around safety, governance, and real-world impact. From reasoning and autonomy to enterprise and societal implications, this episode helps you understand why frontier models are shaping the next phase of AI and what that means for businesses preparing for agentic AI.

9. feb. 202611 min

What is vLLM? | Agentic AI Podcast by lowtouch.ai

Beskrivelse

Kommentarer

1 måned kun 9 kr.

Alle episoder