"How Keith Deming Scaled Computer Vision by Moving AI from Servers to the Edge"

Kuvaus

Episode Summary: In this episode of Engineering Choices You Have to Defend, host Nicola Onassis sits down with Keith Deming, an engineering leader with experience at Postmates, Uber, and PRISM Skylabs, to explore a pivotal architectural decision that transformed how computer vision systems scale in the real world. At PRISM Skylabs, Keith and his team built a platform that turned retail surveillance cameras into powerful analytics tools, tracking foot traffic, customer journeys, and in-store engagement. The system worked exceptionally well… until customers wanted it everywhere. What started as a four-camera deployment quickly became a 200-camera scaling challenge, exposing the limits of server-based infrastructure. Keith shares how the team faced mounting constraints, hardware costs, power consumption, cooling limitations, and physical space, and realized that simply scaling servers wasn’t viable. Instead, they made a bold shift: moving compute from centralized servers directly onto the cameras themselves. The conversation dives into how a Raspberry Pi prototype proved edge computing was feasible, why rewriting performance-critical systems from Python to C++ became necessary, and how eliminating video decoding overhead unlocked real-time processing. More importantly, this architectural shift didn’t just solve a technical problem, it removed friction from the buying process, making it easier for customers to adopt and scale the product incrementally. Keith also reflects on how modern advancements in edge AI and distributed computing are reshaping system design today, and why many teams still underestimate the true cost of centralized infrastructure. For engineering leaders, this episode highlights a critical lesson: scaling isn’t always about adding more resources—it’s about rethinking where computation happens. Key Takeaways: * Centralized infrastructure can become the biggest bottleneck to scale * Edge computing eliminates hardware, power, and space constraints * Moving the compute closer to the data reduces latency and processing overhead * Prototyping with simple tools (like Raspberry Pi) can unlock major breakthroughs * Rewriting for performance (Python → C++) is often necessary at scale * Removing infrastructure friction accelerates customer adoption * The best architectures reduce reasons for customers to say “no” * Distributed and edge-based systems are becoming the future of AI deployment Connect with Keith Deming: * LinkedIn: https://www.linkedin.com/in/keith-deminghttps://www.linkedin.com/in/keith-deming [https://www.linkedin.com/in/keith-deming] Listen Now & Subscribe: Apple Podcasts, Spotify, Amazon Music, or wherever you get your podcasts. "Engineering Choices You Have to Defend explores the real technical decisions behind regulated software, compliance, and AI integration, helping leaders build secure, auditable, and user-friendly systems."

“How Pavel Spesivtsev Argues That Knowledge Infrastructure Matters More Than AI Models”

Episode Summary: In this episode of Engineering Choices You Have to Defend, host Nicola Onassis sits down with Pavel Spesivtsev, CTO, AI strategist, and agentic engineering practitioner, to explore why many AI-driven software initiatives fail long before coding becomes the problem. After spending the last eighteen months helping organizations implement agentic development workflows, Pavel has observed a surprising pattern: the models themselves are rarely the weakest link. Instead, failures typically emerge from incomplete specifications, missing organizational knowledge, weak governance, and poor context management. Pavel explains why traditional software development assumptions are being challenged by agentic engineering. While Agile methodologies were designed around human decision-making and implementation, AI agents require far more structured specifications and complete knowledge systems to operate effectively. When requirements contain gaps, agents fill them with assumptions drawn from training data, often leading to unexpected or incorrect outcomes. The conversation explores Pavel’s concept of “Gap Trap,” a framework designed to identify missing requirements before they enter an agentic workflow. He also discusses why knowledge bases and ontologies are becoming critical infrastructure for AI-powered development, how retrieval systems can introduce hidden hallucination risks, and why context engineering is rapidly becoming one of the most valuable skills in modern software organizations. Pavel shares his perspective on the evolution of software engineering roles as AI adoption accelerates. As implementation becomes increasingly automated, engineers are spending less time writing code and more time designing systems, orchestrating agents, validating outputs, and building the knowledge frameworks that guide intelligent systems toward reliable outcomes. For engineering leaders, this episode highlights a major shift in software delivery: as coding becomes increasingly automated, competitive advantage will come from designing better systems, creating higher-quality specifications, and building the knowledge infrastructure that enables AI agents to make reliable decisions. Key Takeaways: • Most agentic AI project failures stem from specification and knowledge gaps, not model quality • Incomplete requirements cause AI agents to make unpredictable assumptions • Knowledge bases and ontologies are becoming critical infrastructure for AI systems • Context engineering is emerging as a core engineering discipline • Retrieval systems can introduce hidden hallucination risks when information is incomplete • Software engineers are evolving from code authors into system architects and orchestrators • Agentic workflows require stronger specification practices than traditional Agile processes • Documentation is increasingly becoming operational infrastructure, not just reference material • Governance, security, and knowledge management are essential for successful AI adoption • Organizations should focus on knowledge quality before investing heavily in AI tooling Connect with Pavel Spesivtsev: * LinkedIn: linkedin.com/in/pspesivt [inkedin.com/in/pspesivt] Listen Now & Subscribe: Apple Podcasts, Spotify, Amazon Music, YouTube, iHeartRadio, Captivate, or wherever you get your podcasts. "Engineering Choices You Have to Defend explores the real technical decisions behind AI systems, enterprise architecture, and scalable software engineering.

11. kesä 202619 min

"How Keith Deming Scaled Computer Vision by Moving AI from Servers to the Edge"

Kuvaus

Kommentit

14 vrk ilmainen kokeilu

Kaikki jaksot