Why 20 People Can Beat Billion-Dollar AI Companies

Beskrivelse

Open source AI models are now just 3-5% behind the best closed source models on benchmarks — about six months of lag time, not five years. If you're building an AI infrastructure company on the assumption that OpenAI or Anthropic will maintain a permanent lead, your moat is disappearing faster than your revenue projections assume. Most founders at the $3M–$20M stage are still over-indexed on model selection and under-indexed on inference economics. They're obsessed with training costs and model access, but the real cost explosion is coming from running models at scale. A model that trains for a year but only runs for a month is a terrible investment — and yet that's how most AI budgets are still structured. Nikola Borisov spent a decade building backend infrastructure for a chat app with 200 million monthly users before launching Deep Infra. He's CEO and co-founder of Deep Infra, an AI inference platform that owns its own GPU clusters and serves as one of the largest token suppliers on OpenRouter. The episode centers on two bets Nikola made that most infrastructure founders won't: first, that open source models would catch up to closed source models faster than anyone expected, and second, that inference — not training — would dominate AI budgets within five years. Those bets are both paying off. The gap has narrowed to 3-5%, and as Deep Infra lowers costs, customers aren't just consuming more tokens — they're jumping to better, bigger models. The conversation also surfaces a less obvious pattern: the economics of AI inference mirror the economics of CDNs more than they mirror cloud compute. Walmart and Target don't care if their images are served from the same CDN — it's just an efficient way to deliver content. Deep Infra runs the same model for multiple companies in parallel on the same GPUs, and neither company cares. It's neutral infrastructure that scales horizontally without requiring every company to build their own. Roland sees this pattern constantly in his advisory work with SaaS companies scaling from $1M to $50M: founders are modeling their AI spend around closed source API access and per-token pricing, but they're not accounting for what happens when open source closes the gap and inference costs drop 20x. The companies that move early to open source inference infrastructure will have a cost structure their competitors can't match in 18 months — and cost structure at scale is the actual competitive wedge, not model access. Key Moments: 3:01 — Why the gap between closed source and open source models has narrowed to 3-5% — and what that percentage actually measures 5:00 — The five-year-old explanation of inference: training is school, running the model is work 6:41 — Why Anthropic's compute conflict (training vs. serving customers) reveals the real economic wedge 10:39 — The CDN analogy: why Walmart and Target don't care if their requests run on the same infrastructure 16:12 — How lowering costs changes customer behavior — they jump to bigger models, not just more tokens 18:51 — Why Nikola believes inference will dominate company budgets in 5-10 years 20:29 — What a math Olympiad medalist and programming competitor learned about certainty that still drives how he builds 22:31 — Nikola's advice to younger founders: focus on what's most important today, not what's interesting --- If navigating AI infrastructure economics — balancing model access, inference costs, and long-term vendor lock-in — is something you're working through right now, the Midstage Accelerator helps SaaS founders at the $1M–$50M stage model these decisions with real unit economics and stage-specific benchmarks. mdstg.ac/drag-erase #AIInfrastructure #OpenSourceAI #InferenceEconomics #SaaSScaling #ScalingWithoutBreaking

Built real-time private company valuations that take hours, not months — while staying profitable in an industry where competitors bleed money | EP120

Every cap table company in Silicon Valley is burning venture capital chasing growth. Tom Milar built one that makes money instead — and he did it without ever raising a pre-seed round, despite managing $300 billion in client assets for companies including Perplexity AI. The question this episode refuses to let go of is whether the VC-fueled growth playbook has become so normalized that founders have forgotten there's another way to build. It turns out the founders closest to the problem — the ones who see thousands of cap tables and valuations up close — might have the clearest view of what's actually driving company value. Tom is the founder and CEO of Eqvista who built and sold the largest incorporation provider in Hong Kong, acquired the largest registered agent in Nevada, and then built a profitable valuation and cap table platform serving 23,000+ companies — all without a single successful fundraising round. The core of this episode is Tom's "European Hong Kong pragmatism" — a phrase that sounds like a geographic contradiction but is actually one of the most coherent operating philosophies in the episode. Where Silicon Valley rewards the pitch, Tom rewards the product. His argument is direct: the first version of everything — the code, the pricing, the privacy policy, the initial client agreements — has to come from the founder. Not because delegation is wrong, but because no hired CTO or product manager can build what they don't deeply understand. Eqvista's website was, by Tom's own description, hideous for six years. He left it that way deliberately. When he eventually changed the design, conversions didn't move. Neil Patel, who served as board president at one of Tom's acquired companies, had told him the same thing: you can kill a business by touching what's already working. The insight Tom keeps returning to is deceptively simple — build a hierarchy of problems by asking which ones are closest to revenue, and work down from there. Everything else is noise. The secondary thread is what Tom observes watching founders up close through Eqvista's platform, where he sees not just cap tables but the underlying behaviors of companies raising seed rounds, Series A, and beyond. His diagnosis of why founders move too slowly isn't about effort — it's about sequencing. The founders who fail are the ones who raise $400,000 before doing the heavy lifting themselves, who hire before they understand the product, who feed the beast with venture money before the product is strong enough to stand on its own. The real-time valuation product Tom demos in the episode crystallizes what's at stake: the two factors that drive private company valuations fastest are revenue and how well a founder can sell equity. Everything else — the design, the brand, the org chart — is downstream of those two numbers. What Roland observes repeatedly at the $1M–$50M stage is that the founders who stay profitable longest tend to have built an intuition about revenue proximity that their VC-backed peers often lack — not because they're more talented, but because they've never had the option of substituting capital for clarity. The discipline Tom describes, building a hierarchy of problems anchored to what makes money, is something most founders only develop after they've run out of runway once. The companies that arrive at Midstage having bootstrapped or stayed lean tend to have sharper product instincts and messier org charts; the ones that raised heavily tend to have the reverse. Neither is inherently better, but knowing which problem you have is the first step to fixing it. Key Moments 00:39 — Why Tom couldn't raise a pre-seed round despite having multi-billion dollar companies as clients — and why he's not sure it mattered 02:32 — "European Hong Kong pragmatism": what it actually means to build profitable companies in a culture that rewards hype over product 04:45 — Why the first code, pricing, and client agreement must come from the founder — and what gets lost when founders skip that step 08:25 — Two types of founders Tom sees up close: the ones who can raise, and the ones who can build — and why the ideal is rarer than anyone admits 10:02 — The real-time valuation demo: what a living stock price for a private company looks like, and why fund admins processing 500 companies traditionally need six people for six months to do what Eqvista does automatically 13:33 — The one filter Tom uses to cut through every meeting, every email, every decision: what makes money? 14:36 — Why Eqvista's website stayed hideous for six years — and what happened to conversions when they finally fixed it 16:29 — The founder sequencing trap: why raising $400K before doing the heavy lifting yourself is one of the fastest ways to fail 18:01 — The two factors that drive private company valuations the fastest — and what every founder building toward a raise needs to understand first --- Eqvista is offering Scaling Without Breaking listeners $100 off their plan. If you're a founder managing equity, planning a raise, or wanting a real-time valuation for your company, this is the most direct way in. Use referral code ROLAND_EQVISTA at eqvista.com. If you're navigating how to structure equity, understand your company's true value, or think clearly about what a raise would actually do to your cap table, Midstage Institute works directly with SaaS and software founders at the $1M–$50M stage to help you make those decisions with clarity before they become expensive mistakes. mdstg.ac/drag-erase #EmployeeStockOptionPlan #PrivateEquity #VentureCapital #SeedRound #ScalingWithoutBreaking

30. apr. 202624 min

Why 20 People Can Beat Billion-Dollar AI Companies | EP 121

Beskrivelse

Kommentarer

2 måneder kun 19 kr.

Alle episoder