The Rise of Cloud Prem: Data Ownership in the Age of AI with Galileo's Sam Dhar
Sam Dhar has spent 14 years building infrastructure at Cisco, Amazon Alexa, and Adobe, and now works as Senior Staff Engineer and AI infrastructure leader at Galileo, the enterprise AI evaluation platform. In this episode of the Smooth Scaling Podcast, Sam walks host Jose Quaresma through Cloud Prem: deploying your full product stack inside the customer's own cloud environment instead of running it as SaaS. They get into why the model is resurging, and it mostly comes down to data. Enterprises want ownership and control, plus a heavy compliance load (SOC 2, HIPAA, fully air-gapped government workloads), and they do not want a vendor sitting in the read path of their most sensitive data. Sam is candid about the hard parts. Cloud Prem can be a losing game on margins, deployment is the slowest thing in the pipeline, and every customer environment is different enough to reset the work. The conversation closes on AI: why it makes Cloud Prem urgent, the brutal GPU shortage, and why self-hosting an Opus-class model is still out of reach for most companies. A direct, practitioner-level look at where enterprise AI infrastructure is actually heading.
Episode page [https://queue-it.com/smooth-scaling-podcast/ep025-cloud-prem-and-ai/]
---
* (00:00) - Intro
* (01:08) - What Cloud Prem actually is
* (06:05) - Why Cloud Prem is resurging now
* (09:37) - Provider, vendor, customer: who owns what
* (11:10) - "Data is paramount": the compliance driver
* (14:29) - Shipping software into someone else's environment
* (19:57) - When Cloud Prem becomes a losing game
* (26:48) - Quality, and the control plane / data plane split
* (28:50) - Monitoring without seeing the customer's data
* (30:52) - Why Sam moved to AI evals
* (34:56) - Self-hosting LLMs and the GPU bottleneck
* (38:01) - Smaller runtimes, frontier-level intelligence
* (41:46) - Why AI makes Cloud Prem urgent
* (46:59) - Rapid fire: the one book to read
* (49:01) - "Business equals scalability"
Satyam “Sam” Dhar is a senior Staff Engineer and AI infrastructure leader at Galileo, where he designs systems that support real-time LLM workflows at enterprise scale. Prior to Galileo, he spent over six years at Adobe, contributing to AI-powered product development, evaluation platforms, and large-scale data systems. Earlier in his career at Amazon, he worked on high-throughput distributed services supporting Alexa’s device orchestration. Based in San Francisco, Sam’s insights and commentary have been featured in Newsweek, CNET, InfoQ, The New Stack, The Deep View, and others. He is also a Senior Member of the Institute of Electrical and Electronics Engineers.
🔗 Connect
Sam Dhar: https://www.linkedin.com/in/satyamdhar/
Host José Quaresma: https://www.linkedin.com/in/jose-quaresma/
This podcast is researched by Joseph Thwaites, produced by Perseu Mandillo, and brought to you by Queue-it, your virtual waiting room partner.
© Queue-it, 2026