Imagen de portada del programa The Deep Dives

The Deep Dives

Podcast de Rajat Gupta

inglés

Tecnología y ciencia

Empieza 7 días de prueba

$99 / mes después de la prueba.Cancela cuando quieras.

  • 20 horas de audiolibros al mes
  • Podcasts solo en Podimo
  • Podcast gratuitos
Prueba gratis

Acerca de The Deep Dives

Where engineering leadership meets production reality. Hosted by an engineering manager leading Platform, SRE, and Security, The Merge Window dives into the trade-offs, failures, and patterns behind high-scale systems and the teams who run them. Each episode brings practical conversations for engineering leaders, SREs, DevOps pros, and C-level execs navigating the edge of reliability, velocity, and security. Expect deep dives into postmortems, platform strategies, infrastructure evolution, and what it really takes to lead technical teams in a modern org—minus the fluff.

Todos los episodios

18 episodios

episode The Most Important SRE Metrics & How to Track Them artwork

The Most Important SRE Metrics & How to Track Them

Are you constantly caught between your team's desire to ship new features and the C-suite's demand for unwavering stability? In this episode, we demystify Site Reliability Engineering (SRE) metrics and transform them from abstract concepts into a practical management toolkit. Join us as we break down the crucial hierarchy of SLAs, SLOs, and SLIs, and explain why your choice of metrics can make or break your reliability efforts. We'll explore the Four Golden Signals—Latency, Traffic, Errors, and Saturation—that provide a real-time pulse on your user experience. Most importantly, we'll dive deep into the most transformative SRE concept: the error budget. Learn how to use it as a data-driven framework to eliminate subjective debates and empower your team to balance innovation and reliability with confidence. Whether you're just starting your SRE journey or looking to refine your approach, this episode provides actionable advice, real-world case studies from Google and Netflix, and a clear path to fostering a culture of shared reliability.

1 de oct de 2025 - 6 min
episode The Velocity Paradox: Turning Tech Debt into Your Innovation Engine artwork

The Velocity Paradox: Turning Tech Debt into Your Innovation Engine

Is your team constantly firefighting instead of innovating? The culprit is likely technical debt, a silent drag on productivity that costs the industry over $85 billion a year. But what if we've been thinking about it all wrong? In this episode, we dismantle the myth that you have to choose between speed and quality. Drawing on insights from industry leaders like Google, Shopify, and others, we provide a practical playbook for engineering managers, SREs, and security leaders to transform debt management from a costly chore into a strategic advantage. Join us as we cover: * The true cost of unmanaged debt and how to articulate it to business leaders. * Moving beyond "messy code" with Martin Fowler's Technical Debt Quadrant. * How Google's DORA metrics prove that stability and speed are two sides of the same coin. * Actionable strategies you can implement tomorrow, including Shopify's "25% Rule" and the "Boy Scout Rule." * Advanced plays for SRE and Security teams using SLOs, Error Budgets, and "Shift Left" automation. Stop being a debt collector and start being an innovation enabler. This episode shows you how.

29 de sep de 2025 - 6 min
episode Why Your Next Business Strategy Should Be Site Reliability Engineering (SRE) artwork

Why Your Next Business Strategy Should Be Site Reliability Engineering (SRE)

Is your company treating system reliability like a technical chore for the IT department? You could be overlooking a $400 billion blind spot. In today's digital economy, uptime isn't just about "keeping the lights on"; it's the bedrock of customer trust, revenue, and your competitive edge. In this episode, we're moving SRE out of the server room and into the boardroom. We'll dismantle the myth that you have to choose between innovation speed and system stability. Using the core principles of Site Reliability Engineering, we reveal how you can achieve both. Join us as we explore: * The Real Cost of Downtime: We break down the staggering financial and reputational damage of unreliability. * SLOs & Error Budgets: How to use these SRE tools to create a data-driven language that aligns your entire organization, from engineering to the C-suite. * The Leader's Playbook: A practical guide for engineering managers and executives on how to champion, structure, and scale an SRE function that delivers real business ROI, with case studies from finance, e-commerce, and more. This isn't just another technical discussion. This is a strategic conversation about how to build a more resilient, innovative, and profitable business.

8 de sep de 2025 - 6 min
episode Titans of Reliability: Lessons from Google, Netflix, and Meta artwork

Titans of Reliability: Lessons from Google, Netflix, and Meta

In the relentless pursuit of uptime, who do you turn to for inspiration? The titans of tech who operate at a planetary scale. While Google pioneered Site Reliability Engineering (SRE), Netflix mastered proactive resilience with Chaos Engineering, and Meta championed hyper-automation, their principles aren't just for the giants. In this episode, we go beyond the buzzwords and dive deep into the practical, actionable lessons every engineering manager, SRE, and DevOps leader can learn from them. We'll dissect their core philosophies—from error budgets and blameless postmortems to chaos monkeys and self-healing infrastructure. Forget blindly copying their org charts. Join us as we build a pragmatic playbook for synthesizing the best of these approaches to foster a powerful culture of reliability in your own organization. We’ll discuss how to start a "toil hunt," run your first "game day," and use SLOs to transform the conversation between development and operations.

30 de jul de 2025 - 6 min
episode Startup SRE: Building Reliability from Day One artwork

Startup SRE: Building Reliability from Day One

In the chaotic world of startups, the pressure to ship features often pushes reliability to the back burner, a debt to be paid "later." But what if this approach is fundamentally flawed? In this episode, we argue that reliability isn't a brake on development, but the very engine of sustainable growth. Join us for a deep-dive into the pragmatic principles of Site Reliability Engineering, tailored for the resource-constrained reality of a startup. We'll move beyond theory and provide an actionable blueprint for engineering managers, SREs, and founders. You will learn: * The Cultural Foundation: How to implement data-driven tools like Error Budgets and user-centric SLOs to balance speed with stability. * Pragmatic Tech Stacks: Why a "monolith first" approach and a cost-effective, open-source observability stack built on Prometheus, Grafana, and OpenTelemetry are strategic assets that prevent vendor lock-in. * A Phased 6-Month Plan: How to evolve your reliability practices in lockstep with your business—from validating an MVP to surviving hypergrowth. * Learning from Failure: The anatomy of a blameless postmortem and how to create a culture of psychological safety that turns every incident into an investment in resilience. Stop treating reliability as a luxury and start building your most durable competitive advantage.

28 de jul de 2025 - 8 min
Muy buenos Podcasts , entretenido y con historias educativas y divertidas depende de lo que cada uno busque. Yo lo suelo usar en el trabajo ya que estoy muchas horas y necesito cancelar el ruido de al rededor , Auriculares y a disfrutar ..!!
Muy buenos Podcasts , entretenido y con historias educativas y divertidas depende de lo que cada uno busque. Yo lo suelo usar en el trabajo ya que estoy muchas horas y necesito cancelar el ruido de al rededor , Auriculares y a disfrutar ..!!
Fantástica aplicación. Yo solo uso los podcast. Por un precio módico los tienes variados y cada vez más.
Me encanta la app, concentra los mejores podcast y bueno ya era ora de pagarles a todos estos creadores de contenido

Elige tu suscripción

Más populares

Premium

20 horas de audiolibros

  • Podcasts solo en Podimo

  • Disfruta los shows de Podimo sin anuncios

  • Cancela cuando quieras

Empieza 7 días de prueba
Después $99 / mes

Prueba gratis

Sólo en Podimo

Audiolibros populares

Prueba gratis

Empieza 7 días de prueba. $99 / mes después de la prueba. Cancela cuando quieras.