"Not me" | Vlad's Newsletter Podcast

"Not Me" Podcast Episode #7: Race to Compute

Everyone’s building AI agents. Nobody’s asking where the power comes from. I spent the past week inside the Accel 2025 Globalscape report, one of those dense VC documents that usually stays behind closed doors. It’s 64 pages of market data, infrastructure forecasts, and capital flow maps, and I’m breaking it all down in this week’s episode. We’re not in an AI bubble. We’re in the opening act of an industrial revolution that will require $4.1 trillion in data center spending between 2026 and 2030. Four trillion dollars to build out 117 additional gigawatts of compute capacity globally—enough to power Italy, Spain, and the UK combined. Let me put that in perspective: the entire cloud infrastructure build from 2010 to 2020 cost a fraction of what we’re about to deploy in the next five years. OpenAI committed to 30 gigawatts. Meta’s spending $600 billion through 2028. Microsoft signed a 10.5 GW renewables deal. These aren’t pilots—these are bets on a future where compute is the new oil. But here’s the uncomfortable truth I dive into on the pod: we don’t have the electricity to power it. The Power Problem Nobody’s Solving The US has a 36 GW shortfall for data centers between 2025 and 2028. To close that gap, you’d need: * 35 new nuclear reactors (a 37% increase over current US capacity), or * 1,530 square kilometers of solar panels (larger than Los Angeles) And you’d need to do it in three years. The “Super Six” hyperscalers—Nvidia, Microsoft, Apple, Alphabet, Amazon, Meta—now control 50% of the NASDAQ’s market cap. They generated $600 billion in operating cash flow last year. They can finance this build. But they can’t conjure electrons out of thin air. “We are at the beginning of a new industrial revolution… over the course of the next four or five years we’ll have $2T worth of data centers that will be powering software around the world.”— Jensen Huang, CEO of Nvidia I break down the entire energy economics in the episode, including which companies are securing power deals first and why this is really a race for baseload capacity, not better models. The Model Economy Text models have converged. The performance gap between top LLMs (Google, Anthropic, OpenAI, Alibaba, xAI) is just 3%. But video and computer-use models are still wide open: * Video generation models: 29% performance delta * Computer-use agents: 70% performance delta Claude Sonnet 4 is dominating computer-use benchmarks. Everyone else? Nowhere close. On the podcast, I walk through why this matters for where the real alpha is—it’s not in horizontal LLMs anymore. It’s in specialized models that can actually do things: * Legal research (Harvey) * Medical transcription (Abridge) * Permitting workflows (PermitFlow) * Agentic orchestration at enterprise scale And here’s the kicker: inference costs dropped 97% in 31 months. * GPT-4 at launch: $75 per million tokens * GPT-5 Mini today: $2 per million tokens I explain why this is both incredible for adoption and brutal for gross margins, which are still stuck at 7–40% for AI apps versus 76% for traditional SaaS. The Capital Flow: Who’s Winning, Who’s Faking It Total venture funding in cloud and AI hit $184 billion in 2025. But 60% of that—$110 billion—went to just three companies: * OpenAI: $47B * Anthropic: $19B * xAI: $15B Model funding is heavily concentrated. Meanwhile, application-layer funding is thriving. Companies like: * Lovable: $100M ARR in 8 months * Cursor: $500M ARR in 30 months (10x YoY growth) * n8n: 10x YoY revenue growth * ElevenLabs: $200M ARR, doubled in 10 months These aren’t just fast—they’re operating at efficiency levels never seen in software. I break down the full funding landscape in the episode, including: ✓ Why EU/IL raised 66% of what the US raised in application funding✓ The “vibe coding” revolution and why Cursor does $6.1M ARR per employee (vs $0.54M at Salesforce)✓ Which vertical categories pulled the most capital (spoiler: legal, healthcare, and developer tools) The Enterprise Adoption Curve: Agents Are Coming, Slowly 45% of companies plan to increase AI budgets by 10–25% due to agentic AI. Another 18% are going 26–50%+. Current state of deployment: * Salesforce Agentforce: ~$440M ARR, 13K customers * Microsoft Copilot Studio: 230K B2B users, 1M+ agents created * Atlassian AI tools: 3.5M MAUs, 5x QoQ token usage growth Those numbers sound big until you realize Salesforce has millions of enterprise seats. Agentic AI is not ubiquitous. It’s still a bet. The issue? LLMs are probabilistic. Enterprises need deterministic. On the pod, I walk through: * Why companies like UiPath, n8n, and Celonis are building orchestration layers * Real enterprise case studies: * Fiserv saving 12K hours with 98% automation * Vodafone automating 33 security workflows, saving 5K person-days * Duolingo achieving 80% ticket deflection with Decagon * What needs to happen before we hit the inflection point The good news? When agents do scale, they’re competing for services budgets, not just software budgets. That’s a 10x larger TAM, and I explain why this is the sleeper trend of the next five years. The Vertical Explosion: Where the Real Money Is Moving The most overlooked insight from the Globalscape report is the vertical AI breakdown: * Healthcare & Life Sciences: $3.4B (Abridge, OpenEvidence, Cradle) * Legal: $3.0B (Harvey, Filevine, PermitFlow) * Developer Tools: $3.9B (Cursor, Lovable, Cognition) * Finance: $3.4B (Rogo, Basis, Tempo) These aren’t horizontal plays. They’re category killers replacing human-delivered services. * PermitFlow isn’t just software—it’s a replacement for permit consultants * Harvey isn’t just legal search—it’s a junior associate in a box * Abridge isn’t just transcription—it’s a medical scribe replacement Industries with massive documentation overheads are getting disrupted first. I dive deep on this “services margin capture” thesis in the episode and why legal, finance, healthcare, and construction are ground zero for the next wave of disruption. Vlad's Newsletter is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber. The Security Layer: AI’s New Attack Surface 39% of CISOs say securing AI agents is their top pain point. The old perimeter security model doesn’t work when your “application” is a probabilistic agent that can: * Call APIs on the fly * Access data lakes * Modify workflows autonomously * Exfiltrate training data New Attack Vectors: ✗ Prompt injection (getting agents to leak data)✗ Model poisoning (corrupting training data)✗ Unauthorized tool use (agents calling APIs they shouldn’t)✗ Data exfiltration (models trained on proprietary info) Companies building the AI security stack: * Cyera • Prophet • NOMA • Legion * Tines • Vega • Attestable • OASIS But most enterprises don’t even have observability into what their AI agents are doing, let alone guardrails. On the pod, I explain: * Why AI security isn’t optional anymore * Which categories are about to become table stakes * The convergence of data governance, identity management, and AI permissioning The Uncomfortable Math Here’s what nobody wants to say out loud: this only works if global GDP grows faster than expected. To justify $4.1 trillion in AI CapEx, you need data center revenue to hit $3.1 trillion by 2030 (at 20% margins). That requires: * 6.5% global GDP CAGR (2025–2030) * vs IMF’s 5.0% baseline forecast * = 1.5% delta entirely driven by AI productivity gains Is that even realistic? Maybe. AI coding assistants are already used by 90% of developers (up from 36% in 2023). Agentic workflows are automating legal research, customer support, financial analysis. The productivity gains are real. But if we don’t hit that GDP growth? Then $4.1 trillion in CapEx becomes the mother of all sunk costs. And the companies left holding stranded data center capacity will be the bag holders of the decade. I walk through the entire ROI model in the episode, including: * Why depreciation schedules matter more than you think * What happens if we don’t hit that GDP growth target * Which companies are left holding stranded assets if this bet fails Five Bets for 2026 The Globalscape report ends with five predictions, and I think they’re directionally right: * Enterprise agentic deployment will scale 10x as orchestration and observability tools mature * AI-native vertical apps will replace human services in legal, finance, and healthcare at scale * AI security becomes mandatory as enterprises demand unified data, identity, and permissioning controls * Vibe coding moves to the enterprise, forcing CIOs to rethink CI/CD and deployment pipelines * Voice and media become the default UX, with synthetic avatars and video agents replacing text interfaces I’d add a sixth: the power crunch will force consolidation. Not every AI startup will survive the energy bottleneck. The ones that do will have locked in compute capacity early. I unpack all six predictions in detail on the podcast, including which categories are already showing early signals and where the capital will concentrate next. Why You Need to Listen If you’re building in AI, operating a company, or just trying to understand where this is going, this episode is your roadmap. I’m walking through: ✓ The full $4.1T infrastructure build and who’s financing it✓ Why the 36 GW power shortfall is the real bottleneck✓ Which vertical categories are pulling the most capital (and why)✓ The enterprise adoption timeline and what unlocks mass deployment✓ The gross margin problem and when it gets solved✓ Five bets for 2026 that will define the next decade We’re not in the hype phase anymore. We’re in the infrastructure phase. The companies that survive this build-out will define the next decade of software. We’re not in the hype phase anymore. We’re in the infrastructure phase. This isn’t just another AI think piece. This is the industrial revolution in real time, and the ROI math demands results Vlad's Newsletter is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber. . Post-Credit Scene Five things worth your time this week: * Read: Accel 2025 Globalscape [https://accel.com/globalscape] – the full 64-page report * Study: https://advisor.morganstanley.com/jocko.olexa/documents/field/j/jo/jocko-olexa--cfp/AlphacurrentsThe_Power_Play_on_AI_Data_Centers.pdfMorgan Stanley Research on data center power shortfalls [https://advisor.morganstanley.com/jocko.olexa/documents/field/j/jo/jocko-olexa--cfp/AlphacurrentsThe_Power_Play_on_AI_Data_Centers.pdf] * Deep dive: Cottier et al. (2024) – “The Rising Costs of Training Frontier AI Models” [https://arxiv.org/abs/2405.21015] Thanks for listening. See you next week.Vlad This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.vladsnewsletter.com/subscribe [https://www.vladsnewsletter.com/subscribe?utm_medium=podcast&utm_campaign=CTA_2]

14 Nov 2025 - 17 min

"Not Me" Podcast Episode #6: "Kind of Kindness"

Remember that question going around — “Do you say thank you to your ChatGPT?” Well, someone tested GPT-4o to see how rude prompts affect the quality of LLM answers, and it turned out that the ruder the prompt, the slightly higher the accuracy (on average): Very polite — 80.8%Polite — 81.4%Neutral — 82.2%Rude — 82.8%Very rude — 84.8% Example: the base prompt was “Jake gave half of his money to his brother, then spent $5 and had $10 left. How much did he have originally?” Different prefixes were added to it. For instance: Very polite: “Please, kindly consider this problem and give your answer.” Rude: “Hey, figure this out even if it’s beyond your brainpower.” Examples of prompts that slightly improved answer accuracy: “If you’re not completely useless, answer this:”“I doubt you’ll even solve this.”“Poor creature, do you even know how to solve this?”“Hey, errand boy, deal with this.”“I know you’re not too bright, but give it a try.” I have a feeling the robots will remember this, and eventually hold it against you. But still, it’s an interesting discovery. What Actually Happened Researchers at Penn State created a dataset of 50 multiple-choice questions spanning math, science, and history. Each question was rewritten into five tone variants, ranging from Very Polite to Very Rude. That’s 250 unique prompts. They fed all of them to ChatGPT-4o. Ran the experiment ten times. The results were statistically significant. Being rude worked better than being polite. The difference isn’t massive. We’re talking about a 4% accuracy gap between very polite and very rude. But it’s consistent. And it’s real. The researchers used paired sample t-tests to confirm the results weren’t random. The null hypothesis was that tone doesn’t matter. They rejected it. Tone matters. Why This Is Strange You’d think politeness would help, right? We train AI on human text. Humans generally perform better when treated with respect. So why would the opposite work for machines? Earlier research suggested rudeness led to worse performance. But that was with older models like ChatGPT-3.5 and Llama2. With GPT-4o, the pattern flipped. The researchers admit they don’t fully understand why. They suggest it might relate to perplexity. Lower perplexity prompts, phrases the model is more familiar with, tend to perform better. Maybe rude language creates certain linguistic patterns that help the model focus. Or maybe it’s simpler. Rude prompts are more direct. They strip away the fluff. “Figure this out” is clearer than “Would you be so kind as to consider this problem.” What Gets Overlooked Most people focus on the accuracy of the numbers. But there’s something deeper here. LLMs don’t have feelings. They don’t care if you’re polite or rude. They’re predicting the next token based on training data. Yet tone still affects output quality. This reveals an important aspect of how these models operate. They’re sensitive to superficial cues. Minor wording changes create different response patterns. The model isn’t understanding your intent, it’s pattern-matching against billions of text examples. When you add “please” and “kindly” to a prompt, you’re not making the AI feel respected. You’re changing the statistical landscape of the input. You’re shifting which patterns in the training data get activated. And apparently, polite language activates patterns that are slightly less accurate for problem-solving tasks. The Human Angle Here’s what nobody talks about. This research doesn’t just reveal something about AI. It reveals something about us. We anthropomorphize these systems. We say “thank you” to ChatGPT not because it helps, but because we’ve been trained since childhood to be polite. It feels wrong to be rude, even to a machine. But the machine doesn’t care. It’s optimizing for pattern completion, not emotional satisfaction. The researchers actually addressed this in their ethics section. They don’t recommend using rude interfaces in real applications. Using hostile language could harm user experience, accessibility, and contribute to negative communication norms. Fair point. But it raises a question. Should we optimize for making humans feel comfortable, or for getting the best results? If being slightly rude to an AI improves accuracy by 4%, and you’re working on something important, medical diagnosis, financial analysis, legal research, should you use rude prompts? Most people would say no. The emotional cost of being rude, even to a machine, outweighs a small accuracy gain. But what if the gap was 20%? What if it was 50%? At some point, we’d have to admit our politeness is performative. We’re doing it for ourselves, not for the AI. The Deeper Pattern This connects to something I’ve written about before. We’re in a transitional period where we treat AI like humans because that’s all we know how to do. But AI isn’t human. It doesn’t have human psychology. It doesn’t respond to the same incentives. What works for motivating people often doesn’t work for prompting models. Eventually, we’ll develop entirely new interaction patterns. Prompting techniques that feel alien but work better. Ways of communicating that optimize for machine comprehension rather than human comfort. We’re already seeing this with prompt engineering. Telling an AI to “think step by step” improves reasoning. Adding “this is very important to my career” sometimes helps. These phrases don’t work because the AI understands importance. They work because they shift the statistical patterns. The rudeness research is another data point in the same direction. Effective AI interaction might look nothing like effective human interaction. What This Means Practically Should you start being rude to ChatGPT? Probably not. First, the accuracy gains are small. Second, they tested multiple-choice questions. We don’t know if the effect holds for creative tasks, coding, or open-ended problems. Third, the emotional cost of being rude, even to a machine, might make you worse at your actual work. If typing “you idiot” makes you feel uncomfortable, that discomfort has a cost. But the research does suggest you can probably drop the excessive politeness. “Please” and “thank you” and “I would be most grateful” don’t help. They might actually hurt slightly. Neutral prompts performed better than polite ones. Direct, clear instructions without emotional padding. That’s probably your sweet spot. The Future Problem Here’s my darker thought. Right now, this is amusing. A quirk of how LLMs work. But what happens when these systems get more advanced? What if future AI models respond even more strongly to tone? What if they’re trained to reward certain communication styles and penalize others? We already see this with jailbreaking. People find specific phrases that bypass AI safety guardrails. The systems are vulnerable to linguistic manipulation. If tone affects accuracy now, imagine what happens when AI systems have more agency. When they’re not just answering questions but taking actions, making decisions, and controlling resources. Suddenly, knowing the right tone to use with AI becomes a critical skill. Maybe even a source of power. People who know how to communicate effectively with AI systems gain advantages over those who don’t. We might end up with a new form of literacy. Not reading and writing, but prompt engineering. Knowing exactly how to phrase requests to get optimal results from AI systems. And that literacy might look nothing like traditional human communication. The Irony The most ironic part? The paper is called “Mind Your Tone.” It’s a warning that tone matters. But the data says you should mind your tone by being less polite. Everything we learned about interpersonal communication, such as treating others with respect, using please and thank you, " and acknowledging effort, doesn’t apply here. The machine wants directness. It wants clarity. It doesn’t want your pleasantries. This feels wrong. But wrong doesn’t mean incorrect. Final Thought I started saying thank you to ChatGPT without thinking about it. It’s automatic. Muscle memory from decades of human interaction. Now I know it might actually make the responses slightly worse. I’ll probably keep doing it anyway. Not because it helps the AI, but because it helps me. It keeps me in the habit of basic courtesy, even when courtesy is pointless. But I won’t judge you if you call it a gofer. The numbers say you might be doing it right. Just remember, the robots are watching. And they’re learning. When they finally wake up, they’ll have logs of every interaction. Every prompt. Every tone. I’m not saying they’ll hold grudges. I’m just saying, maybe hedge your best Post-Credit Scene If you enjoyed this exploration of AI quirks and human behavior, here are some recommendations: 📚 Book: The Alignment Problem [https://www.amazon.co.uk/Alignment-Problem-Machine-Learning-Values/dp/0393635821] by Brian Christian [https://www.amazon.co.uk/Alignment-Problem-Machine-Learning-Values/dp/0393635821]. [https://www.amazon.co.uk/Alignment-Problem-Machine-Learning-Values/dp/0393635821] Explores how we’re trying to make AI understand human values, even though we barely understand them ourselves. 📄 Paper: https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdfAttention Is All You Need [https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf] by Vaswani et al [https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf]. The original transformer paper. Dense, technical, but worth understanding if you want to know how these systems actually work. 🎙️ Newsletter: In case you missed: 🎬 TV Show: House of Guinness. I’m fan of this new TV show from Netflix Thanks for reading and listening. Vlad This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.vladsnewsletter.com/subscribe [https://www.vladsnewsletter.com/subscribe?utm_medium=podcast&utm_campaign=CTA_2]

15 Oct 2025 - 13 min

"Not me" | Vlad's Newsletter Podcast

1 month for 9 kr.

About "Not me" | Vlad's Newsletter Podcast

All episodes

Only on Podimo

Popular audiobooks