Smooth Scaling: System Design for High Traffic
In this episode, José Quaresma sits down with two Queue-it engineers — Zaigham Sarfaraz, Engineering Manager, and Šimon Bučko, Senior Software Engineer — to talk autoscaling in production. They cover the fundamentals of horizontal and vertical scaling, why stateless architecture matters for scaling out, and what happens when the metrics you're scaling on don't match your actual bottleneck. The conversation gets real when Zaigham shares a war story of autoscaling failing during an iPhone launch — one million users in one second — and how that experience reshaped how the team thinks about pre-scaling for extreme traffic. Šimon challenges the temptation to rely on default configurations and explains why the days you most need autoscaling to work are exactly the days it might not. Episode page [https://www.queue-it.com/smooth-scaling-podcast/ep023-autoscaling-in-production/] --- * (00:00) - Introduction (00:46) - What is autoscaling under the hood? (03:25) - Why scaling down matters too (03:53) - Horizontal vs. vertical scaling (05:43) - When vertical scaling is the better choice (07:56) - Stateful vs. stateless applications (10:42) - Solving state for horizontal scaling (12:14) - The role of load balancers (14:31) - Choosing the right scaling metrics (16:46) - Is serverless the silver bullet? (21:34) - The cost paradox of autoscaling (23:40) - iPhone launch: when the whole world wants to buy a product (25:56) - Why autoscaling isn't enough for non-linear traffic (30:37) - The fallacy of the rule of thumb (32:48) - Rapid fire questions Šimon Bučko is a Senior Software Engineer at Queue-it, working across full-stack development. He is an AWS Certified Solutions Architect Professional with strong experience in software architecture and bridging the gap between business needs and technical execution. Zaigham Sarfaraz is an Engineering Manager at Queue-it with over 15 years of experience across frontend, backend, infrastructure, and people leadership. He is an AWS Certified Cloud Practitioner and plays a key role in ensuring stable system operations while contributing to the continuous improvement of Queue-it's backend architecture. This podcast is hosted by José Quaresma, researched by Joseph Thwaites and produced by Perseu Mandillo. © Queue-it, 2026
25 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de Smooth Scaling: System Design for High Traffic!