Episode 8: Inside the AI Agent Revolution

Descripción

In this episode, Holly and Ewan explore one of the most hyped (yet deeply misunderstood) topics in AI today: AI agents. Holly opens with the big question: What actually is an AI agent? Ewan explains why definitions vary wildly, but broadly defines an AI agent as any system that can operate independently on your behalf to complete tasks. That could be a coaching assistant, a financial helper, or even a household or education agent. Ewan shares real-world stories, such as trying to buy a dishwasher using ChatGPT Agent Mode [https://openai.com/index/introducing-chatgpt-agent/]... Only to find that Amazon actively blocks agent-based access. When he switched to AO.com [https://ao.com/], the agent succeeded instantly - a perfect illustration of today’s fragmented ecosystem. He also discusses experimenting with agents to manage LinkedIn connection acceptance, with mixed results, highlighting how even simple point-solution tasks can quickly fall apart. The discussion then moves into the wider implications: * Why agents are transformational in theory, but fragile and unreliable today * How browser-based agents actually work using “computer use” screenshot loops * Why traditional RPA (Robotic Process Automation) remains far safer and more predictable * Early signs of agent-powered cyberattacks, referencing the first reported case of agentic hacking [https://www.anthropic.com/news/disrupting-AI-espionage] * The Carnegie Mellon “Agent Company [https://agent.company]” benchmark, which evaluates how well different agents perform real office tasks. With current leaderboards showing DeepSeek’s Matrix agent at ~43%, Google Gemini around 41%, and Claude Sonnet 4 around 33%. The conclusion? The vision is exciting, but today’s agents are nowhere near enterprise-ready. Expect rapid evolution, more experiments, and many more failures as this technology matures. If you've got feedback, we'd love to hear it. We reply to every single message! Find us at ⁠Working On It Podcast⁠ [https://www.workingonitpodcast.com/], or follow our ⁠LinkedIn Page⁠ [https://www.linkedin.com/company/wereworkingonit/]. Or talk to ⁠Holly⁠ [https://www.linkedin.com/in/digitalholly/] or ⁠Ewan⁠ [https://www.linkedin.com/in/ewanmacleod] on LinkedIn.

Episode 18: Drones, Defence and Misinformation

This is a different kind of episode. Holly Joint joins from a region now living under daily missile alerts, and the conversation with Ewan MacLeod turns from the usual workplace-and-AI territory to something far more immediate: how technology shapes life, safety and truth in a conflict zone. It is, by both hosts' admission, a difficult subject, but one they feel matters too much to skip. Holly describes a striking asymmetry in modern warfare. On one side, cheap, low-tech drones crossing overhead several times a day; on the other, a sophisticated, AI-enabled defence system that calculates trajectories, identifies interception points and responds in extraordinarily short windows, always, she stresses, with a human in the loop. Living beneath it, she explains how that technology translates into a genuine sense of safety, and how the household adapts: honest but calm conversations with the children, reframing the frightening boom of an intercept as the sound of a missile stopped and everyone kept safe. A recurring theme is information itself. In wartime, Holly notes, misinformation and propaganda flood WhatsApp groups and social feeds, and one of the smartest uses of technology she's seen is a simple web app that aggregates only official sources, the government media office, ministry of defence, crisis management, into a single trusted place to check rumours against. Alongside this, she points to the quiet rise of low-cost AI therapy tools helping people cope, because living under missiles is not normal, however well one carries on. The episode has its lighter human moments too: Holly's 3am backup-battery purchases during sleepless nights, which turned out more useful against thunderstorms than the war, and Ewan's enthusiasm for Starlink, both as a home backup and, more seriously, as genuinely transformative infrastructure. They touch on its life-or-death role in conflicts like Ukraine and Iran, and how connectivity can boost economies that lack reliable infrastructure. The conversation closes on the hardest question of all: the ethics of AI in warfare. Holly raises Anthropic's decision to restrict how its tools may be used, and the consequence of being removed from a US Department of War supplier list, a move both hosts find genuinely significant. They circle back to a book referenced in an earlier episode, "If Anyone Builds It, Everyone Dies," and to autonomous weapons, computer vision targeting, and the danger of AI's misplaced certainty in contexts where a wrong answer costs lives. Both land firmly in the same place: humans must stay in the loop, and far more work is needed to understand the consequences. Key Topics * The asymmetry of cheap drones versus high-tech AI defence * How AI-enabled interception systems work, with a human in the loop * Living and parenting calmly under daily missile alerts * Combating wartime misinformation by aggregating trusted sources * Low-cost AI therapy tools for people under stress * Starlink as resilient, sometimes life-or-death, connectivity * Anthropic's use restrictions and removal from a US supplier list * The ethics of autonomous weapons and AI certainty in warfare Links & References * Starlink — https://www.starlink.com [https://www.starlink.com] * Anthropic — https://www.anthropic.com [https://www.anthropic.com] * If Anyone Builds It, Everyone Dies (Eliezer Yudkowsky & Nate Soares) — https://ifanyonebuildsit.com [https://ifanyonebuildsit.com]

24 de abr de 202616 min

Episode 8: Inside the AI Agent Revolution

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios