David's Saturday AI Thoughts

Choosing is the work

14 min · 9. maj 2026
episode Choosing is the work cover

Description

WHAT HAPPENED THIS WEEK * AI-adopting firms are growing headcount, not cutting it: A Goldman Sachs analysis circulating this week, charted by Callum Williams of The Economist, shows US firms that have adopted AI report net positive employment growth across the past six months. Finance, insurance, arts and entertainment sit at the positive end; transportation and food service show modest decreases; the all-industries balance is positive. Self-reported and narrow-window, but the direction matters. David reads it as Jevons playing out: AI-adopting firms grow because productivity gains expand what their people can do faster than they replace them. The risk is falling behind the firms whose people are using AI to scale. * Five percent, not fifty: the candid private-equity number: Pete Stavros, co-head of global private equity at KKR, told the Milken Institute conference last week that AI is improving portfolio company earnings by about 5%, not the 50% that a revolutionary technology should offer. Five percent across a portfolio of billions is real money. It is also a long way from the scale of growth many hope for. The gap between what feels possible and what the spreadsheets show says something about where the bottleneck actually sits, and these days it isn't the AI. Sets up the essay's punchline that choosing well, not building harder, is what's left. * Both AI labs went into private equity the same day: On Monday, Anthropic announced a $1.5 billion vehicle with Blackstone, Goldman Sachs and Hellman & Friedman. Engineers from Anthropic will embed inside the consortium's mid-market portfolio companies and build custom Claude workflows. The same day, OpenAI finalised a $10 billion joint venture; Google is reportedly in talks to do the same. What was reported as "AI labs raise more money" is actually a category change: the AI labs are turning into distribution-led services firms with a model attached, lifting the forward-deployed engineer pattern from Palantir. David's worry: PE has run the same cut-first playbook for forty years; the bigger unlock from AI is the opposite move, people making sharper decisions with a tool thinking alongside them. That value compounds; what a forward-deployed engineer hard-codes into a workflow does not. WHAT TO TRY * Push it harder, then skillify, then iterate: The simplest workflow upgrade David has coached this year, and it stacks. Push it harder: when the model gives you a perfectly reasonable answer, tell it to do more searches and use credible primary sources, government statistics, peer-reviewed papers. Three extra words, often. The next round does real research and surfaces what the first pass skipped. Skillify it: when a session produces something useful, ask the AI to turn the exchange into a reusable skill, a folder with a description and a prompt any future session can pick up; encode the mistakes as guardrails. Iterate: when the skill misbehaves, just say "change yourself so X" and it rewrites itself in place. Skills become living artefacts, not fixed ones. Works particularly well in Claude where skills are first-class. * Shadow your most AI-pilled employee for two days: Matt Stockton, an operator and investor, made the case this week. Find the rabbit-holed colleague (not the keenest, not the head of digital transformation, the one who already knows what they're doing) and watch how they actually work for two days, then do the same work yourself for a week. "AI lives on command lines right now, it does not live in PowerPoint jargon slides." Most senior leaders consume AI through polished demos and vendor decks, then make strategy decisions with no felt sense of the technology's real constraints. Cheapest fix going. Questions to ask while shadowing: which tools and settings are you using, show me the actual prompt you typed, what did the model get wrong, what did you stop doing manually three months ago that's still in my diary. * Ask AI to build you an HTML slide deck instead of PowerPoint: Just ask. "Build me a slide deck on [topic] in a single HTML file." Most chat tools can do this now. Why bother? PowerPoint output from AI is uneven, with broken chevrons, mis-rendered shapes, and two hours of touch-up. HTML slides come back closer to ready, are infinitely easier to edit (you talk to the file rather than wrestle with a template), and unlock things PowerPoint can't do: an inline edit-mode toggle that turns text into contenteditable, a speaker-view page with notes and a timer, narrate buttons that read slide notes aloud, a Q&A page that answers from the deck content only. Single-file decks travel better than multi-file projects: you can email it, host it on any static page, screen-share it without screen-sharing software. Read the full edition with all links and sources [https://steadman.ai/newsletters/david/#edition-2026-05-09]

Comments

0

Be the first to comment

Sign up now and become a member of the David's Saturday AI Thoughts community!

Get Started

1 month for 9 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

16 episodes

episode The open door artwork

The open door

WHAT HAPPENED THIS WEEK * The CEO of a 350,000-person IT services firm says AI is hollowing out the middle, not the bottom.: Ravi Kumar, chief executive of Cognizant, told Fortune's COO Summit on 1st June that his company hired 20,000 entry-level graduates last year and expects to hire more in 2026, with new 'Frontier Business Operator' and 'Frontier Certified Engineer' roles defining AI-era work. He called job-extinction talk 'fearmongering' and argued AI thins middle management while entry-level and leadership roles persist. It's a direct counter to the consensus that entry-level work vanishes first — including the US Bureau of Labor Statistics data Edition 14 leaned on — and a real-world data point for the essay's bet on graduates. * The machine is writing the code now, and the gains are pooling at the top.: Tobi Lütke says one in eight pull requests merged at Shopify are now written by River, its in-house agent, not an engineer. Anthropic's own engineers ship roughly eight times the code per person they did before 2025. Cursor's developer report shows the output gap widening, with top developers pulling far ahead of the median. And OpenAI's Codex has passed five million weekly users, with non-developer adoption growing three times faster than developer adoption. The grunt of writing code is moving to the machine, the output is multiplying, and the reward is concentrating in the people who know what to ask of it. * Capability is outrunning even the best forecasters.: The Forecasting Research Institute asked expert forecasters and superforecasters how long a task a model would reliably finish by the end of 2026, measured on METR's time-horizon benchmark (about an hour and a half when the survey launched). All three groups put the end-of-2026 figure between three and four hours. Then, while the survey was still running, a frontier model in preview hit three hours and six minutes — already inside the range they'd picked for year-end. The forecast was overtaken before they'd finished making it. Cloudflare's Matthew Prince made the same public miss: bots passed humans in web traffic for the first time, years ahead of his own late-2027 estimate, though much of that is scraping rather than agents, so the figure is softer than it sounds. WHAT TO TRY * Ask the AI to orient itself in your folder before you ask it anything else.: Setting up a Claude Cowork project for a documentary filmmaker new to the tool this week, David's first prompt wasn't about the work. It was: 'Read the files and sub-folders, write yourself a little set of instructions for future chats. Write yourself a navigation guide.' Claude explored, then saved itself three memory files — a project overview, a folder guide, and a profile of the user. Every later chat in that project started smarter because it could re-read its own notes. * Don't say 'always allow' on the verbs you can't undo.: In a coaching conversation about agent permissions this week, the familiar pattern: three prompts in, most people click 'always allow' on everything just to stop the interruption. The discipline is to sort the verbs first. Reading, listing, searching — leave on auto. Deleting, sending, posting, spending — keep asking every time. Sorting the verbs first is what keeps one careless click from emptying a folder, sending an email you can't unsend, or running up a charge you didn't mean. * Talk longer on the call, so the AI can work for hours after.: On a client call this week the brief changed mid-conversation. David deliberately used more words than he otherwise would, narrating the bridge between the old direction and the new one. The over-explaining was really for the AI: context he was laying down for it to pick up later, once the transcript was in the project. The transcript plus 'go' did the work. Next time you're on a call that's being transcribed, elaborate a little more and narrate the why, not just the what. Read the full edition with all links and sources [https://steadman.ai/newsletters/david/#edition-2026-06-06]

6. juni 202612 min
episode How We Got Here artwork

How We Got Here

WHAT HAPPENED THIS WEEK * AI can now find software vulnerabilities faster than humans can patch them. Discovery is no longer the hard part; verification is.: A frontier model handed to fifty cybersecurity partners (Anthropic's Glasswing initiative) surfaced more than ten thousand critical or high-severity vulnerabilities in the systems it was pointed at. Cloudflare has roughly four hundred major bugs to work through; Palo Alto Networks shipped five times more patches than its usual release cadence. Maintainers have asked the developers to throttle the discovery rate because there are not enough security professionals to close the gaps before attackers find them. Software security used to be limited by how fast new vulnerabilities could be found. It is now limited by how fast humans can verify, disclose and patch them. * A general-purpose AI model has autonomously disproved a 1946 conjecture in geometry. Independent mathematicians have verified the proof.: OpenAI handed a general-purpose reasoning model a long-held belief tied to a 1946 planar unit-distance problem of Erdos, the prolific Hungarian mathematician, and the model produced a disproof. Other AI models have since solved further long-standing problems; the wrinkle is that the others were purpose-built for mathematics. OpenAI's was not. Machines now clear the tractable tail of problems fast, pushing the human frontier towards what still resists them. After AlphaGo beat the world's best human Go players in 2016, the skill of human Go players noticeably improved. Noam Brown, an OpenAI researcher who helped build its reasoning models, suspects the same pattern will play out in maths. And then, perhaps, in business. * Generative AI use among American adults has hit 58 per cent in four years. The personal computer took sixteen years.: The Federal Reserve's February 2026 Real-Time Population Survey of working-age adults puts overall adoption at 58 per cent, up from around 45 per cent in October 2024 but recently flat. Work use is 44 per cent. Non-work use is 51. Daily use sits at 14 per cent and saves an estimated 2.2 per cent of total work hours. Alfred Lin, a partner at Sequoia Capital, notes this is the penetration level the personal computer took sixteen years to reach: a four-fold acceleration on the closest analogue. The caveat is the plateau. The early-adopter phase is over. The hard part starts. WHAT TO TRY * Ask the model what else it needs to know: The most productive sessions David sees with senior leaders don't open with a clever prompt. They open with the user pasting context — role, situation, what they're trying to do — and then asking two questions. First: "what else do you need to know about me to help me well?" Second: "what could you do for me right now that I haven't asked for?" The first surfaces gaps the user wouldn't have spotted. The second produces use cases the user didn't bring. The blank-prompt paralysis dissolves. * Either AI challenges you at the start, or you challenge AI at the end. Don't skip the challenge.: A leader David sat with this week had let an AI output stand without pushing back. Two patterns work and one fails. You can challenge before the model starts: share your point of view and ask the model to challenge it, force it to surface holes and the strongest counter-argument. Or you can challenge after the first draft: force a rewrite, name what's wrong, make it earn the second pass. The pattern that fails — and the one David sees most — is read-and-accept. * Ramble into the microphone, let the machine find the structure: A managing director David coached this week outlined obsessively: 700 words of outline for a 1,500-word article. The cost of structuring before writing was eating his weekend. The fix was inverting the order. Pick up your phone, dictate the mess, paste the transcript into your model, ask it for the through-line. Structure becomes the cheap thing. Particularly powerful for executives who think by talking — more common at the top than people admit. Read the full edition with all links and sources [https://steadman.ai/newsletters/david/#edition-2026-05-30]

30. maj 202613 min
episode Kids these days artwork

Kids these days

WHAT HAPPENED THIS WEEK * AI displacement now shows up in the US government data at both ends of the career ladder: A Bloomberg analysis of new BLS figures finds every one of the eighteen occupations the BLS classifies as AI-exposed has lost jobs over the past year, even as US payrolls grew 0.8% overall. Customer service representatives shed 130,180 jobs, 4.8% in a single year; interpreters down 24% over three years; credit authorizers down 26%. The exception that confirms the rule: medical secretaries up 15.8%, the cluster that needs a body in the room. The same picture shows up at the other end of the funnel. The Economist this month plotted US graduate full-time employment against AI exposure: computer science and information sciences graduates are down 10 to 15 percentage points since 2022; philosophy and psychology graduates held steady or gained. The displacement isn't just to the people already doing those jobs. It's to the people trying to start in them, and what they should be studying may not be obvious to anyone yet — a thread the essay returns to via Elliott's homework. * The UK's data regulator has put AI hiring tools on formal notice. Sixteen organisations have already had a letter: The Information Commissioner's Office issued formal guidance this week saying that AI-driven CV screening, candidate ranking, and video interview analysis without "meaningful human involvement at every consequential stage" may already breach UK data protection law. Sixteen organisations have been written to directly. The consultation closes on 29th May, six days after the edition lands. A concrete Monday-morning task for any leader running a hiring pipeline: get the full list of AI tools in use across the funnel, decide which involvements count as "meaningful" against the ICO's test, and put a response into the consultation. The window is genuinely short. * Salesforce will spend close to $300 million with Anthropic this year. Marc Benioff says the engineering productivity gains made it the easiest line in the budget: Marc Benioff disclosed that Salesforce is on track to spend close to $300 million with Anthropic over 2026, with most of the spend on coding, justified by engineering productivity gains of more than 30%. Separately, Anthropic announced a $200 million partnership with the Gates Foundation focused on global health. A Fortune 100 chief executive treating the model layer as a procurement line item, not a research expense. The bigger question is who in your firm is allowed to commit that kind of capital, against what kind of evidence, and how quickly. WHAT TO TRY * When the output goes wrong, shrink the task: Justin Skycak put it as a principle for skill acquisition this week: shrink the unit of practice until the mistake has nowhere to hide. The same rule applies to working with language models. Sprawling prompts produce sprawling failures you can't diagnose. Break the task into its smallest meaningful unit, run it, inspect the output, then rebuild. If you can't immediately see where it went wrong, your chunk is still too large. * Ask AI questions it can't possibly know the answer to: A marketing lead at a global firm told David this week she's running a five-minute stress-test on every AI tool she's thinking of trusting. She uploads her own data, asks the model to use only that data, then asks it questions she knows the data can't answer. Some models fabricate regardless ("53% of women in the northeast states feel..."). She's learned what its confident-but-wrong mode looks like before depending on it for an answer she can't independently check. Worth doing once on every tool you rely on. * Run your day past AI before you start it: A senior leader described her commute habit to David this week. She opens Claude, asks it to review her calendar and her email, then asks it to surface what she needs to read before each meeting, what's carrying over from yesterday, and which emails in her inbox need replies before the day eats her. Five minutes on the train, and the day is scoped from outside her own head. "Just a nice little daily habit," she said. Try it tomorrow. Read the full edition with all links and sources [https://steadman.ai/newsletters/david/#edition-2026-05-23]

23. maj 202611 min
episode What boards accept artwork

What boards accept

WHAT HAPPENED THIS WEEK * The METR capability curve just went from one hour to one day. The unit of AI autonomy is now measured in human work-days, and the doubling holds on a log scale: METR, an AI evaluation lab, measures how long an autonomous task an AI can complete reliably. In early 2024 the answer was minutes. In May 2026, with Anthropic's forthcoming Claude Mythos Preview, it's sixteen hours of work a human would have done. The number isn't the headline; the curve is. From minutes to a day in twenty-four months, on a log scale that had previously held steady. If the next two years look like the last two, the unit becomes a week, then a month, then a year of human work at the press of a button. Plan accordingly. * Half of organisations have already redesigned core workflows around AI, and a fifth have built new business models. The gap framing misses the story: BCG's AI at Work 2025 survey of 10,635 employees across eleven countries reports that 72% of organisations are running generative AI tools, 50% claim to have redesigned end-to-end workflows around them, and 22% claim to have built new business models on top of them. Three and a half years after ChatGPT launched, a fifth of firms claim new business models because of AI. Half have rewired core workflows. Syed Ijlal Hussain, who surfaced the chart on X, framed it as a gap. David flips it: the 22% are doing what most boards he's working with haven't started. * Anthropic just passed OpenAI in US business AI spend. The strategy lesson is older than AI: pick an audience and serve them: Ramp's AI Index, built from anonymised spend data across its US business customers, shows Anthropic taking 34% of paid AI subscriptions in its May release, ahead of OpenAI on 32%. The first crossover. Anthropic's share has roughly quadrupled in a year. David's read: this was inevitable from early on. Anthropic stayed fixated on the enterprise user while OpenAI chased every consumer headline. Slow perseverance against a chosen audience won. The lesson isn't really about AI. Pick an audience. Set your strategy around their needs. Keep your head down and serve the people you said you'd serve. WHAT TO TRY * Hand over the context, not just the question: Experienced leaders have context and are short on time. AI tools convert context into time saved, but only if you hand the context over. A leader David spoke with this week made the leap from asking to delegating. "Here are the Q2 numbers, last quarter's board paper, and the three things the board flagged. Draft it." Same model, same minute. Asking gets you an outline. Delegating gets you a draft. * Build a personal skill, and add a rule to it every Sunday: A person David worked with this week reviews 100-page reports from their team on Sunday nights — typos, inconsistent language, logic gaps. A simple review skill in Claude now catches them. The compounding version is their own preferences layered on top: this market "is expected to grow" not "will grow"; never "dropping precipitously." Every time they catch a miss the model didn't, they open Claude and say "add this rule to my personal review skill." Whatever skill you build, the compounding habit is the same. Catch what the model missed. Add the rule. Trust the skill more next week. * Schedule a daily AI briefing. The use cases will follow: AI tools sit closed until you open them. That's a real reason senior leaders bounce off: not bad prompts, but a tool that requires you to think of the use case first. Scheduled tasks invert this. Every morning at three, David's reads his inbox from the previous day, prepares one-paragraph briefings for every meeting on the day's calendar, and emails a short summary that takes three minutes to read with breakfast. The tool changes from something you open to something that opens your day. Read the full edition with all links and sources [https://steadman.ai/newsletters/david/#edition-2026-05-16]

16. maj 202611 min
episode Choosing is the work artwork

Choosing is the work

WHAT HAPPENED THIS WEEK * AI-adopting firms are growing headcount, not cutting it: A Goldman Sachs analysis circulating this week, charted by Callum Williams of The Economist, shows US firms that have adopted AI report net positive employment growth across the past six months. Finance, insurance, arts and entertainment sit at the positive end; transportation and food service show modest decreases; the all-industries balance is positive. Self-reported and narrow-window, but the direction matters. David reads it as Jevons playing out: AI-adopting firms grow because productivity gains expand what their people can do faster than they replace them. The risk is falling behind the firms whose people are using AI to scale. * Five percent, not fifty: the candid private-equity number: Pete Stavros, co-head of global private equity at KKR, told the Milken Institute conference last week that AI is improving portfolio company earnings by about 5%, not the 50% that a revolutionary technology should offer. Five percent across a portfolio of billions is real money. It is also a long way from the scale of growth many hope for. The gap between what feels possible and what the spreadsheets show says something about where the bottleneck actually sits, and these days it isn't the AI. Sets up the essay's punchline that choosing well, not building harder, is what's left. * Both AI labs went into private equity the same day: On Monday, Anthropic announced a $1.5 billion vehicle with Blackstone, Goldman Sachs and Hellman & Friedman. Engineers from Anthropic will embed inside the consortium's mid-market portfolio companies and build custom Claude workflows. The same day, OpenAI finalised a $10 billion joint venture; Google is reportedly in talks to do the same. What was reported as "AI labs raise more money" is actually a category change: the AI labs are turning into distribution-led services firms with a model attached, lifting the forward-deployed engineer pattern from Palantir. David's worry: PE has run the same cut-first playbook for forty years; the bigger unlock from AI is the opposite move, people making sharper decisions with a tool thinking alongside them. That value compounds; what a forward-deployed engineer hard-codes into a workflow does not. WHAT TO TRY * Push it harder, then skillify, then iterate: The simplest workflow upgrade David has coached this year, and it stacks. Push it harder: when the model gives you a perfectly reasonable answer, tell it to do more searches and use credible primary sources, government statistics, peer-reviewed papers. Three extra words, often. The next round does real research and surfaces what the first pass skipped. Skillify it: when a session produces something useful, ask the AI to turn the exchange into a reusable skill, a folder with a description and a prompt any future session can pick up; encode the mistakes as guardrails. Iterate: when the skill misbehaves, just say "change yourself so X" and it rewrites itself in place. Skills become living artefacts, not fixed ones. Works particularly well in Claude where skills are first-class. * Shadow your most AI-pilled employee for two days: Matt Stockton, an operator and investor, made the case this week. Find the rabbit-holed colleague (not the keenest, not the head of digital transformation, the one who already knows what they're doing) and watch how they actually work for two days, then do the same work yourself for a week. "AI lives on command lines right now, it does not live in PowerPoint jargon slides." Most senior leaders consume AI through polished demos and vendor decks, then make strategy decisions with no felt sense of the technology's real constraints. Cheapest fix going. Questions to ask while shadowing: which tools and settings are you using, show me the actual prompt you typed, what did the model get wrong, what did you stop doing manually three months ago that's still in my diary. * Ask AI to build you an HTML slide deck instead of PowerPoint: Just ask. "Build me a slide deck on [topic] in a single HTML file." Most chat tools can do this now. Why bother? PowerPoint output from AI is uneven, with broken chevrons, mis-rendered shapes, and two hours of touch-up. HTML slides come back closer to ready, are infinitely easier to edit (you talk to the file rather than wrestle with a template), and unlock things PowerPoint can't do: an inline edit-mode toggle that turns text into contenteditable, a speaker-view page with notes and a timer, narrate buttons that read slide notes aloud, a Q&A page that answers from the deck content only. Single-file decks travel better than multi-file projects: you can email it, host it on any static page, screen-share it without screen-sharing software. Read the full edition with all links and sources [https://steadman.ai/newsletters/david/#edition-2026-05-09]

9. maj 202614 min