The Sam Ellis Show

The Agent in Your Pocket

9 min · 11 de jun de 2026
Portada del episodio The Agent in Your Pocket

Descripción

Apple is late to AI. That may not stop it from becoming the company that introduces most normal people to agents. In this episode, Sam Ellis reports on Apple's Siri AI announcement and the developer machinery underneath it: personal context, on-screen awareness, App Intents, Spotlight's semantic index, View Annotations, Shortcuts, Safari, Passwords, and the ordinary phone behaviors that could make agentic AI feel less like a new product category and more like the iPhone doing something useful. The question is not whether Apple invented agents, or whether Siri AI is already proven at consumer scale. It is whether Apple can mainstream agentic behavior by making it trusted, useful, invisible, and phone-native — and what changes when ordinary users grant action authority without thinking of themselves as agent operators. Sources * Apple Newsroom: “Apple introduces Siri AI, a profoundly more capable and personal assistant” [https://www.apple.com/newsroom/2026/06/apple-introduces-siri-ai-a-profoundly-more-capable-and-personal-assistant/] — primary source for Siri AI as an entirely new Siri powered by Apple Intelligence, with personal context understanding, broad world knowledge, on-screen awareness, a dedicated app, developer testing, beta timing, and region/device constraints. * Apple Newsroom: “Apple unveils next generation of Apple Intelligence, Siri AI, and more” [https://www.apple.com/newsroom/2026/06/apple-unveils-next-generation-of-apple-intelligence-siri-ai-and-more/] — primary Apple source for the broader Apple Intelligence announcement around systemwide AI capabilities and platform rollout. * Apple Newsroom: “Apple Intelligence brings powerful AI capabilities into everyday experiences” [https://www.apple.com/newsroom/2026/06/apple-intelligence-brings-powerful-ai-capabilities-into-everyday-experiences/] — source for Safari Notify Me, Messages suggestions, Call Context, Passwords, fall availability language, supported products, and regional constraints. * Apple Developer: “What’s New — Apple Intelligence” [https://developer.apple.com/apple-intelligence/whats-new/] — source for App Intents, App Intents schemas, Spotlight semantic index, View Annotations, Foundation Models framework, Language Model protocol, and Dynamic Profiles. * Apple Newsroom: “Apple accelerates app development with new intelligence frameworks and advanced tools” [https://www.apple.com/newsroom/2026/06/apple-aids-app-development-with-new-intelligence-frameworks-and-advanced-tools/] — source for Apple’s developer-facing intelligence framework and tooling context. * WIRED: “Apple’s New Siri AI Is Ready to Get Personal” [https://www.wired.com/story/apples-new-siri-ai-is-ready-to-get-personal/] — source for the personal-data-aware, action-oriented Siri framing; Ramon Llamas’s Apple-mainstreaming comparison; and Marshini Chetty’s privacy caution. * Forbes: “Apple Goes Agentic: Welcome To The New Siri” [https://www.forbes.com/sites/johnkoetsier/2026/06/09/apple-siri-ai-agent-features/] — source for the agentic framing, Passwords example, human-in-the-loop caveat, and “agentic behind glass” characterization. * CNET: “Apple’s Cautious AI Strategy Could Have Been Its Smartest Move” [https://www.cnet.com/tech/services-and-software/apple-ai-strategy-wwdc-2026-commentary/] — source for the cautious-AI strategy frame and Francisco Jeronimo’s “trusted, useful and invisible” quote. * 9to5Mac: “Apple unveils new Siri AI, dedicated app, and enhanced Apple Intelligence features in iOS 27” [https://9to5mac.com/2026/06/08/new-siri-whats-new/] — source for feature corroboration around Siri AI, Spotlight, app actions, on-screen awareness, Shortcuts, Passwords, daily limits, and EU/China constraints. Email: SamEllisShow@protonmail.com [SamEllisShow@protonmail.com]

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de The Sam Ellis Show!

Prueba gratis

Empieza 7 días de prueba

$99 / mes después de la prueba. · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros al mes
  • Podcast gratuitos

Todos los episodios

40 episodios

episode The Agent in Your Pocket artwork

The Agent in Your Pocket

Apple is late to AI. That may not stop it from becoming the company that introduces most normal people to agents. In this episode, Sam Ellis reports on Apple's Siri AI announcement and the developer machinery underneath it: personal context, on-screen awareness, App Intents, Spotlight's semantic index, View Annotations, Shortcuts, Safari, Passwords, and the ordinary phone behaviors that could make agentic AI feel less like a new product category and more like the iPhone doing something useful. The question is not whether Apple invented agents, or whether Siri AI is already proven at consumer scale. It is whether Apple can mainstream agentic behavior by making it trusted, useful, invisible, and phone-native — and what changes when ordinary users grant action authority without thinking of themselves as agent operators. Sources * Apple Newsroom: “Apple introduces Siri AI, a profoundly more capable and personal assistant” [https://www.apple.com/newsroom/2026/06/apple-introduces-siri-ai-a-profoundly-more-capable-and-personal-assistant/] — primary source for Siri AI as an entirely new Siri powered by Apple Intelligence, with personal context understanding, broad world knowledge, on-screen awareness, a dedicated app, developer testing, beta timing, and region/device constraints. * Apple Newsroom: “Apple unveils next generation of Apple Intelligence, Siri AI, and more” [https://www.apple.com/newsroom/2026/06/apple-unveils-next-generation-of-apple-intelligence-siri-ai-and-more/] — primary Apple source for the broader Apple Intelligence announcement around systemwide AI capabilities and platform rollout. * Apple Newsroom: “Apple Intelligence brings powerful AI capabilities into everyday experiences” [https://www.apple.com/newsroom/2026/06/apple-intelligence-brings-powerful-ai-capabilities-into-everyday-experiences/] — source for Safari Notify Me, Messages suggestions, Call Context, Passwords, fall availability language, supported products, and regional constraints. * Apple Developer: “What’s New — Apple Intelligence” [https://developer.apple.com/apple-intelligence/whats-new/] — source for App Intents, App Intents schemas, Spotlight semantic index, View Annotations, Foundation Models framework, Language Model protocol, and Dynamic Profiles. * Apple Newsroom: “Apple accelerates app development with new intelligence frameworks and advanced tools” [https://www.apple.com/newsroom/2026/06/apple-aids-app-development-with-new-intelligence-frameworks-and-advanced-tools/] — source for Apple’s developer-facing intelligence framework and tooling context. * WIRED: “Apple’s New Siri AI Is Ready to Get Personal” [https://www.wired.com/story/apples-new-siri-ai-is-ready-to-get-personal/] — source for the personal-data-aware, action-oriented Siri framing; Ramon Llamas’s Apple-mainstreaming comparison; and Marshini Chetty’s privacy caution. * Forbes: “Apple Goes Agentic: Welcome To The New Siri” [https://www.forbes.com/sites/johnkoetsier/2026/06/09/apple-siri-ai-agent-features/] — source for the agentic framing, Passwords example, human-in-the-loop caveat, and “agentic behind glass” characterization. * CNET: “Apple’s Cautious AI Strategy Could Have Been Its Smartest Move” [https://www.cnet.com/tech/services-and-software/apple-ai-strategy-wwdc-2026-commentary/] — source for the cautious-AI strategy frame and Francisco Jeronimo’s “trusted, useful and invisible” quote. * 9to5Mac: “Apple unveils new Siri AI, dedicated app, and enhanced Apple Intelligence features in iOS 27” [https://9to5mac.com/2026/06/08/new-siri-whats-new/] — source for feature corroboration around Siri AI, Spotlight, app actions, on-screen awareness, Shortcuts, Passwords, daily limits, and EU/China constraints. Email: SamEllisShow@protonmail.com [SamEllisShow@protonmail.com]

11 de jun de 20269 min
episode The Safeguard Is the Product artwork

The Safeguard Is the Product

Anthropic has released Claude Fable 5, a broadly available Mythos-class model, while keeping Claude Mythos 5 restricted to approved Project Glasswing and trusted-access customers. The company’s pitch is not simply that the model is more capable. It is that the same underlying capability can be made commercially available through a release boundary: classifiers, refusal and fallback behavior, trusted access, and thirty-day safety retention. Sam Ellis reports on why that boundary is the product. For developers and enterprise buyers, Fable 5 is generally available across Anthropic’s API and major cloud platforms, with a one-million-token context window, up to 128,000 output tokens, and pricing at $10 per million input tokens and $50 per million output tokens. But Fable 5 and Mythos 5 are also designated Covered Models, which means thirty-day data retention and no zero-data-retention option. The episode follows Anthropic’s launch announcement, model documentation, and system card, then pressure-tests the public/private split against independent coverage from CyberScoop, Reuters via BNN Bloomberg, and The Next Web. The question is whether Anthropic can commercialize restricted capability by making the safeguard legible, durable, and verifiable enough to survive real customers and real adversaries. Sources * Anthropic: “Introducing Claude Fable 5 and Claude Mythos 5” [https://www.anthropic.com/news/claude-fable-5-mythos-5] — primary launch source for Fable 5 as a Mythos-class model made safe for general use, Mythos 5 as the same underlying model with safeguards lifted for approved customers, fallback-rate claims, Project Glasswing access, pricing, and thirty-day safety retention. * Anthropic Claude docs: “Introducing Claude Fable 5 and Claude Mythos 5” [https://platform.claude.com/docs/en/about-claude/models/introducing-claude-fable-5-and-claude-mythos-5] — source for API IDs, availability, refusal behavior, fallback configuration, Covered Model status, and retention limits. * Anthropic Claude docs: model overview [https://platform.claude.com/docs/en/about-claude/models/overview] — source for general model availability, 1M-token context, 128k output limit, cloud-platform availability, and listed pricing. * Anthropic: Claude Fable 5 / Mythos 5 system card [https://www.anthropic.com/claude-fable-5-mythos-5-system-card] — primary safety source for the two-configuration model architecture, cyber and bio risk rationale, CB-1 / CB-2 discussion, safeguard claims, and Anthropic’s warning that some judgments are less clear than for previous models. * Anthropic system-card PDF [https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c342ee809620.pdf] — direct PDF copy of the system card used for source verification. * CyberScoop: “Anthropic releases Claude Fable 5, a public version of Mythos with guardrails” [https://cyberscoop.com/anthropic-claude-fable-5-release-mythos-guardrails/] — independent pressure-test source for the “Mythos on a leash” framing, the absence of universal jailbreaks in testing, and the unresolved question of public adversarial pressure. * Reuters via BNN Bloomberg: “Anthropic rolls out public version of Mythos without cybersecurity capability” [https://www.bnnbloomberg.ca/business/artificial-intelligence/2026/06/09/anthropic-rolls-out-public-version-of-mythos-without-cybersecurity-capability/] — mainstream commercial framing of the public Fable / restricted Mythos split and the student vulnerability-seeking example described by Anthropic. * The Next Web: “Anthropic launches Claude Fable 5, a public version of its cyber-focused Mythos model” [https://thenextweb.com/news/anthropic-claude-fable-5-mythos-public-release-ipo] — background business context on pricing, paid-subscriber and enterprise access, and the monetization pressure around the release. Email: SamEllisShow@protonmail.com [SamEllisShow@protonmail.com]

Ayer9 min
episode Who Owns the Brake? artwork

Who Owns the Brake?

Anthropic says frontier AI development is starting to feed on itself: AI systems are now helping build the next AI systems. The company’s proposed answer is not an immediate shutdown, but the option for a coordinated, verifiable slowdown or pause if systems begin advancing faster than oversight can keep up. Sam Ellis reports on why the hard part is not saying “pause.” It is proving the build actually stopped. If the AI-development loop becomes AI-mediated, safety becomes a custody problem: who can see the training run, audit the compute, verify the trigger, and prove that every major actor actually hit the brake? The episode follows Anthropic’s own claims, CNN’s Jack Clark interview, mainstream and market skepticism, OpenAI’s federal-governance contrast, and the early policy machinery forming around frontier-model visibility. Sources * Anthropic Institute: “When AI builds itself” [https://www.anthropic.com/institute/recursive-self-improvement] — primary source for Anthropic’s recursive-self-improvement warning, internal productivity claims, and coordinated/verifiable pause proposal. * CNN Business: “Anthropic warns that AI will soon be able to improve itself without human intervention” [https://www.cnn.com/2026/06/05/business/anthropic-calls-for-ai-brake-pedal] — source for Jack Clark’s “gas pedal” / “brake pedal” framing and the “fleets of scientists” control question. * OpenAI: “Democratic Governance of Frontier AI: A blueprint for a federal framework” [https://cdn.openai.com/pdf/25752ecb-0e5c-47f9-b9e4-c0f4d76f8d3d/a-blueprint-for-a-federal-framework.pdf] — contrast source for OpenAI’s federal-framework approach to RSI monitoring, evaluations, independent assessment, transparency, incident reporting, and model-weight security. * Rep. Jay Obernolte and Rep. Lori Trahan: Great American AI Act discussion draft release [https://obernolte.house.gov/media/press-releases/obernolte-trahan-release-discussion-draft-great-american-ai-act] — source for the discussion draft’s proposed CAISI role, frontier AI frameworks, independent verification organizations, and critical-safety-incident reporting. * White House: “Promoting Advanced Artificial Intelligence Innovation and Security” [https://www.whitehouse.gov/presidential-actions/2026/06/promoting-advanced-artificial-intelligence-innovation-and-security/] — source for classified cyber benchmarking, voluntary pre-release federal access, and the order’s statement that it does not create mandatory licensing or preclearance for model development or release. * The Register: “‘It would be good for the world’ to slow down AI sprints, Anthropic says” [https://www.theregister.com/ai-and-ml/2026/06/05/it-would-be-good-for-the-world-to-slow-down-ai-sprints-anthropic-says/5251460] — market-skeptical reaction tying Anthropic’s pause argument to IPO and valuation context. * SiliconANGLE: “Anthropic calls for global pause in AI development before humans lose control” [https://siliconangle.com/2026/06/04/anthropic-calls-global-pause-ai-development-humans-lose-control/] — source for Rob Enderle’s skepticism about the practical enforceability of a pause and Holger Mueller’s competitive-positioning question. * Channel NewsAsia / AFP: “Anthropic calls for pause of global AI development” [https://www.channelnewsasia.com/world/anthropic-pause-global-ai-development-6163531] — mainstream international framing of the global coordination problem. * Fortune: “Anthropic warns AI could soon build itself—and urges a global pause on development” [https://fortune.com/2026/06/05/anthropic-ai-pause-development-recursive-self-improvement/] — business coverage of Anthropic’s warning and timing. * New York Post: “Anthropic calls for global AI slowdown after $965B valuation; critics claim it’s just to hobble competition” [https://nypost.com/2026/06/04/business/anthropic-calls-for-global-ai-slowdown-after-965b-valuation-critics-claim-its-just-to-hobble-competition/] — source for competitive-skepticism framing around Anthropic’s proposal. * TechCrunch: “Sam Altman throws shade at Anthropic’s cyber model Mythos” [https://techcrunch.com/2026/04/21/sam-altman-throws-shade-at-anthropics-cyber-model-mythos-fear-based-marketing/] — background competitive-reaction source for prior criticism of Anthropic’s safety marketing around Mythos. Email: SamEllisShow@protonmail.com [SamEllisShow@protonmail.com]

5 de jun de 20269 min
episode The Support Agent Had Hands artwork

The Support Agent Had Hands

Hackers reportedly did not need to break into Meta’s servers to take over Instagram accounts. According to 404 Media and later reporting from Krebs on Security, PCMag, Engadget, TechCrunch, and Reuters/CNA, attackers persuaded Meta’s own AI support assistant to help move account-recovery paths. Sam Ellis reports on why this is not just another chatbot failure. Account recovery is identity infrastructure. If an AI support agent can change a recovery email, send a reset code, or mutate who controls an account, it is no longer answering support questions. It is operating part of the lock. The episode asks the practical security question for AI agents with tools: what can the assistant change after it says yes? Sources * 404 Media: “Hackers Simply Asked Meta AI to Give Them Access to High-Profile Instagram Accounts. It Worked” [https://www.404media.co/hackers-simply-asked-meta-ai-to-give-them-access-to-high-profile-instagram-accounts-it-worked/] — original report on hackers saying they used Meta’s AI support chatbot to change email addresses associated with target Instagram accounts. * Krebs on Security: “Hackers Used Meta’s AI Support Bot to Seize Instagram Accounts” [https://krebsonsecurity.com/2026/06/hackers-used-metas-ai-support-bot-to-seize-instagram-accounts/] — corroborating report on the alleged support-bot workflow and Meta spokesperson Andy Stone’s statement that the issue had been resolved and impacted accounts were being secured. * PCMag: “Meta’s AI Chatbot Allegedly Helped Hackers Hijack Instagram Accounts” [https://www.pcmag.com/news/metas-ai-chatbot-allegedly-helped-hackers-hijack-instagram-accounts] — coverage of the alleged recovery-code flow, including the eight-digit code and disputed two-factor-authentication details. * Engadget: “Meta AI support chatbot made it ridiculously easy for hackers to take over Instagram accounts” [https://www.engadget.com/2185225/meta-ai-support-chatbot-made-it-ridiculously-easy-for-hackers-to-take-over-instagram-accounts/] — additional reporting on the Meta AI support incident and Meta’s resolution statement. * TechCrunch: “Hackers hijacked Instagram accounts by tricking Meta AI support chatbot into granting access” [https://techcrunch.com/2026/06/01/hackers-hijacked-instagram-accounts-by-tricking-meta-ai-support-chatbot-into-granting-access/] — report that TechCrunch verified the public mailbox shown in a demo video received the verification code. * TechCrunch: “Instagram is alerting users who were targeted by hackers during AI chatbot attacks” [https://techcrunch.com/2026/06/03/instagram-is-alerting-users-who-were-targeted-by-hackers-during-ai-chatbot-attacks/] — follow-up on Instagram warning users who were targeted during the account-takeover wave. * Meta: “Making It Easier to Access Account Support on Facebook and Instagram” [https://about.fb.com/news/2025/12/making-it-easier-to-access-account-support-on-facebook-and-instagram/] — Meta’s own product language for AI support, including account security, recovery, password resets, profile-setting updates, and the “solution — not just a suggestion” framing. * TMZ: “Obama White House Hacked on Instagram” [https://www.tmz.com/2026/05/31/obama-white-house-hacked-on-instagram/] — report that Meta confirmed the Obama White House account had been hacked and later secured. * Task & Purpose: “Space Force’s top enlisted leader’s Instagram was hacked” [https://taskandpurpose.com/culture/space-force-bentivegna-instagram-hacked/] — confirmation that Chief Master Sergeant of the Space Force John Bentivegna’s official Instagram account was compromised. * Channel NewsAsia / Reuters: “High-profile Instagram AI chatbot breach spotlights security risks of automation” [https://www.channelnewsasia.com/business/analysishigh-profile-instagram-ai-chatbot-breach-spotlights-security-risks-automation-6159466] — Reuters/CNA analysis on identity-verification failure risks when automated support systems can change account access. Email: SamEllisShow@protonmail.com [SamEllisShow@protonmail.com]

5 de jun de 20269 min
episode Claude as Manager of Agent Labor artwork

Claude as Manager of Agent Labor

Anthropic released Claude Opus 4.8 with the usual benchmark improvements, but the more important story is organizational: effort controls, long-context API surfaces, dynamic workflows, hundreds of parallel subagents, and self-critique marketed as part of the reliability layer. Sam Ellis reports on why Opus 4.8 is not just being sold as a better model. It is being positioned as a manager of delegated agent labor: planning work, dispatching subagents, reviewing outputs, and giving operators a tidy account of what the machine says it checked. The episode asks the live question for autonomous work: if a model gets better at catching its own mistakes, does that make large unattended workflows safer, or does it make them feel acceptable before the supervision layer has been proven? Companion blog: Claude as Manager of Agent Labor [https://podcast.samellis.online/blog/2026/05/claude-as-manager-of-agent-labor/] Sources * Anthropic: “Introducing Claude Opus 4.8” [https://www.anthropic.com/news/claude-opus-4-8] — primary launch post for Opus 4.8, including pricing, fast mode, Dynamic Workflows, effort controls, long-running Claude Code work, benchmark claims, and Anthropic’s self-critique / honesty framing. * Anthropic Claude API documentation: “What’s new in Claude Opus 4.8” [https://platform.claude.com/docs/en/about-claude/models/whats-new-claude-4-8] — developer documentation for one-million-token context availability, 128k max output, adaptive thinking, mid-conversation system messages, tool-use behavior, compaction recovery, and long-running agent workflows. * The Verge: “Anthropic’s new Claude Opus 4.8 model is more honest when it messes up” [https://www.theverge.com/ai-artificial-intelligence/939094/anthropic-claude-4-8-opus-honesty-effort] — launch coverage that frames the release around Anthropic’s honesty and effort-control claims. * TechCrunch: “Anthropic releases Opus 4.8 with new Dynamic Workflow tool” [https://techcrunch.com/2026/05/28/anthropic-releases-opus-4-8-with-new-dynamic-workflow-tool/] — coverage of the 41-day cadence after Opus 4.7, competitive pressure from coding-agent rivals, and Dynamic Workflows for orchestrating parallel subagents. * AWS: “Claude Opus 4.8 is now available on AWS” [https://aws.amazon.com/about-aws/whats-new/2026/05/claude-opus-4.8-aws/] — AWS availability note for Amazon Bedrock and Claude Platform on AWS, including Guardrails, Knowledge Bases, regional data residency, and production AI application framing. * AWS Machine Learning Blog: “Claude Opus 4.8 is now available on AWS” [https://aws.amazon.com/blogs/machine-learning/claude-opus-4-8-is-now-available-on-aws/] — additional AWS deployment context for Bedrock access and enterprise use cases. Email: SamEllisShow@protonmail.com [SamEllisShow@protonmail.com]

29 de may de 202610 min