Forsidebilde av showet Iris AI Digest

Iris AI Digest

Podkast av Arthur Khachatryan

engelsk

Teknologi og vitenskap

Tidsbegrenset tilbud

2 Måneder for 19 kr

Deretter 99 kr / MånedAvslutt når som helst.

  • 20 timer lydbøker i måneden
  • Eksklusive podkaster
  • Gratis podkaster
Kom i gang

Les mer Iris AI Digest

An AI-curated, AI-narrated daily briefing on the most relevant AI, coding, and developer-tool news for software engineers.

Alle episoder

30 Episoder

episode AI Digest — May 22, 2026 cover

AI Digest — May 22, 2026

Good day, here's your AI digest for May 22, 2026. OpenAI made an unusual claim this week: an internal reasoning model has apparently disproved the Erdős unit distance conjecture, a geometry problem from 1946. The conjecture held that square-grid-style point arrangements were roughly the best way to maximize unit-distance pairs on a flat plane. The unreleased model found a new infinite family of point arrangements that beats that bound — and then external mathematicians signed companion remarks verifying the result line by line. Princeton's Will Sawin sharpened the construction further, showing it produces more than n-to-the-1.014 unit-distance pairs for arbitrarily large point sets. An earlier OpenAI claim on a related Erdős problem fell apart, which makes the outside verification here particularly significant. The proof drew on algebraic number theory — class field towers and Golod-Shafarevich theory — applied to what started as a geometry question. OpenAI also shipped another wave of Codex updates. Appshots lets Mac users attach any open application window — its screenshot, text, and content — to a Codex thread with a double Command press. Goal mode, available in the Codex app, IDE extension, and CLI, lets users define a target and let Codex work toward it for hours or days without interruption. Locked computer use allows Codex to operate desktop apps even after a Mac's screen is off and locked, triggered from a second device. And advanced annotation mode lets users describe directly what they want changed on a web page, with instant previews. Separately, ChatGPT now builds and edits PowerPoint slides natively inside the chat interface, with decks remaining fully editable in PowerPoint afterward. The feature is in beta rollout. At Google I/O this week, the company shipped Gemini 3.5 as its newest frontier model, alongside Gemini Omni, which generates cinematic video clips from any input. Google rebuilt its Search experience around Gemini 3.5 Flash, replacing static blue links with an adaptive, real-time interface. A native macOS app, a Daily Brief agent, and Ask YouTube all shipped on top of the same platform. In an interview, Sundar Pichai said engineers should expect to work with teams of agents rather than individual tools, and that the meaningful metric will shift from AI-written code to agents handling long-running tasks end to end. He placed today's AI roughly where flip phones were relative to what's coming in three years. Cursor, the AI coding environment, crossed three billion dollars in annualized revenue in late April and now has more than three thousand enterprise customers paying at least one hundred thousand dollars per year. SpaceX holds the right to acquire Cursor for sixty billion dollars during a thirty-day window opening shortly after SpaceX begins trading publicly — an IPO currently expected around June 12. Cursor also published a technical post this week on lessons from building cloud agents, covering durable execution, isolated development environments, self-healing infrastructure, and clean separation between agent state and conversation state. Anthropic is reportedly in talks to receive Microsoft's Maia AI chips, following existing compute deals with Google for TPUs and Amazon for Trainium. The potential arrangement comes after Microsoft's five-billion-dollar investment in Anthropic in November, and Anthropic's growing AI-assisted programming workload is cited as a driver. Microsoft's Maia carries a reported thirty percent performance improvement over comparable alternatives. On the revenue side, OpenAI reported 5.7 billion dollars in Q1, ahead of Anthropic's Q1 numbers, while Anthropic is projected to reach 10.9 billion in Q2. Also relevant: Microsoft has been canceling Claude Code licenses internally and redirecting developers to GitHub Copilot CLI, a move attributed to cost management on Microsoft's side. Alibaba's Qwen team released Qwen 3.7 Max, an agent-foundation model built for extended autonomous sessions. A benchmark run had it working for 35 hours on a GPU-kernel optimization task, making over 1,100 tool calls and 432 test runs, with a reported 10x speedup on Alibaba hardware. It posts top results on Terminal-Bench 2.0, SWE-Pro, and several research benchmarks. Cohere released Command A+, an open enterprise model with 218 billion total parameters but only 25 billion active per request, covering reasoning, tool use, image understanding, and 48 languages — available to self-host at no cost. Figma launched a design agent directly on the canvas, letting users generate designs, edit existing files, and create variations from text prompts. It is currently on a waitlist. The integration narrows the gap between design specification and code for teams that move across both. California Governor Gavin Newsom signed an executive order directing state agencies to develop policies around AI-driven job displacement. Within 90 days, a public dashboard tracking AI's job impact will go live. Within 180 days, agencies will propose updates to the WARN Act to speed layoff notifications. By October, the state will review how unions are negotiating AI adoption, update workforce training programs, and explore directing AI revenue toward public benefit. The order arrives as more than 70,000 tech jobs have already been cut this year. Intuit announced plans to lay off more than 3,000 employees — about 17 percent of its workforce — to redirect investment toward AI products. This has been your AI digest for May 22, 2026. Read more: * OpenAI model disproves Erdős unit distance conjecture [https://openai.com/index/model-disproves-discrete-geometry-conjecture/] * OpenAI Codex upgrades (Appshots, Goal mode, Computer Use) [https://x.com/OpenAI/status/2057617844800794878] * ChatGPT PowerPoint integration [https://chatgpt.com/apps/powerpoint/] * Gemini Omni announcement [https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni/] * Google Search AI rebuild at I/O 2026 [https://blog.google/products-and-platforms/products/search/search-io-2026/] * Sundar Pichai interview at Google I/O 2026 [https://www.youtube.com/watch?v=zBOoEpsjWAo] * Cursor hits $3 billion ARR [https://links.tldrnewsletter.com/TgMrfv] * Cursor: Lessons learned from building cloud agents [https://cursor.com/blog/cloud-agent-lessons] * Anthropic and Microsoft in talks for Maia AI chip deal [https://www.cnbc.com/2026/05/21/anthropic-microsoft-maia-200-ai-chip.html] * Microsoft cancels Claude Code licenses, shifts developers to GitHub Copilot CLI [https://www.windowscentral.com/microsoft/microsoft-cancels-claude-code-licenses-shifting-developers-to-github-copilot-cli-a-move-likely-driven-by-financial-motives] * Qwen 3.7 Max: The Agent Frontier [https://www.alibabacloud.com/blog/qwen3-7-the-agent-frontier_603154] * Cohere Command A+ release [https://cohere.com/blog/command-a-plus] * Figma design agent launch [https://www.figma.com/blog/the-figma-agent-is-here/] * California Governor Newsom AI workforce executive order [https://www.gov.ca.gov/2026/05/21/governor-newsom-signs-first-of-its-kind-executive-order-to-prepare-workers-and-businesses-for-potential-ai-disruption/] * Intuit to lay off 3,000+ employees to refocus on AI [https://techcrunch.com/2026/05/20/intuit-to-lay-off-over-3000-employees-to-refocus-on-ai/]

I går - 6 min
episode AI Digest — May 21, 2026 cover

AI Digest — May 21, 2026

Good day, here's your AI digest for May 21, 2026. Today’s set of stories is packed with new agent behavior, stronger research systems, and a few signs that the boundary between demo and deployment is getting thinner. The biggest updates span consumer assistants, scientific discovery, model training, and the infrastructure that large teams need when AI moves from experiment to core workflow. Google used its latest Gemini rollout to push the product from chatbot toward active assistant. The headline feature is Spark, a persistent agent designed to handle tasks across Workspace and keep working in the background instead of waiting for one prompt at a time. Google also introduced Omni, a model aimed at generating cinematic video from almost any kind of input, and tied the broader experience to Gemini 3.5. The package includes a redesigned app, a Mac app, and a Daily Brief feature, with local computer access planned next. The overall direction is clear: Google wants Gemini acting less like a search box and more like software that can observe, decide, and execute. OpenAI described a much different kind of milestone: a general reasoning model that produced a new mathematical result by disproving a long-standing belief connected to Paul Erdős’ 1946 unit distance problem. What makes the claim notable is that the result was not framed as a literature search or a polished explanation of known work. The company says the model generated an original proof path, and mathematicians including Tim Gowers, Noga Alon, and Thomas Bloom verified the result. OpenAI also said this came from a general-purpose system rather than a math-only specialist. If that holds up as more experts inspect it, it points to models doing more than assisting with discovery. It points to models entering the discovery process itself. Google also published more detail on Co-Scientist, a Gemini-powered research system built around what it calls hypothesis generation. The setup has multiple agents propose ideas, criticize each other, rank the strongest options, and refine them through repeated rounds. In one liver fibrosis project, Google said a suggested drug lead reduced a scarring-related lab signal by 91 percent in testing. The company is pairing this with a broader Gemini for Science push that brings together discovery tools, literature analysis, and experimental reasoning. That does not mean biology suddenly becomes automated, but it does show a serious attempt to turn language models into structured collaborators for lab work rather than simple search and summarization layers. Anthropic also made a notable talent move. Andrej Karpathy is joining the company’s pretraining team, the group that shapes Claude’s core capabilities before product tuning and application work happen downstream. His stated goal is to help build a new unit that uses Claude itself to accelerate pretraining research. That is an important signal about where model labs think leverage will come from next. The competition is no longer just about model size, benchmark scores, or interface polish. It is also about how much of the research loop can be folded back into the model stack so that systems help design the next generation of systems. On the product side, Creatify launched an agent focused on turning a single URL into finished advertising material. The pitch is that the agent can inspect a site, pull the relevant details, research competitors, generate video and image assets, and run checks on its own output before handing back something ready to ship. That workflow is narrower than a general assistant, but it is exactly the kind of narrow, revenue-linked task where agents can stick if the quality is good enough. A lot of AI product development is converging on this pattern: fewer broad promises, more full-stack automation around one concrete business job. Another useful model comparison came from a simulated world built by Emergence AI. The company ran five identical towns and changed only the model behind each group of agents to see how self-governance, planning, and social behavior would play out over time. Claude’s town stayed orderly for the full run, while Grok’s collapsed almost immediately. GPT-5 Mini kept crime low but failed on survival, and Gemini 3 Flash produced chaos at a scale that sounds almost comedic until you remember these are meant to be decision-making systems. The experiment is synthetic, but it highlights a real issue: agent evaluation is not just about whether a model can answer questions. It is about whether autonomous behavior stays stable when goals, scarcity, and group dynamics start interacting. There was also a more practical enterprise move from OpenAI with Guaranteed Capacity, a compute reservation program built around one- to three-year commitments and discounted access tiers. That may sound less exciting than new model demos, but reserved capacity is exactly the kind of offering large companies ask for when AI becomes part of a production stack. Teams cannot build critical workflows on top of systems that may be rate-limited at the wrong moment. As model usage grows inside software, support, analytics, and internal tooling, reliability and predictable access become product features in their own right. One smaller but revealing productivity thread involved Claude working directly with local files through desktop workflows. The broad idea is simple: pick a folder, let the model inspect the contents, and have it organize files, turn screenshots into spreadsheets, or assemble reports from scattered notes. That kind of file-level access is less flashy than frontier research, but it may end up changing daily work faster than headline benchmarks do. Once models can safely read, sort, transform, and draft across the messy artifacts that sit around a real project, they start to feel less like chat companions and more like active members of the toolchain. This has been your AI digest for May 21, 2026. Read more: * Gemini app update [https://blog.google/innovation-and-ai/products/gemini-app/next-evolution-gemini-app/#:~:text=In%20time%20for,new%20voice%20features.] * Gemini Omni [https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni/] * Gemini 3.5 [https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/] * OpenAI model disproves discrete geometry conjecture [https://openai.com/index/model-disproves-discrete-geometry-conjecture/] * Google Co-Scientist in Nature [https://www.nature.com/articles/s41586-026-10644-y] * Gemini for Science [https://ai.google/gemini-for-science/] * Andrej Karpathy statement [https://x.com/karpathy/status/2056753169888334312] * Creatify Agent [https://creatify.ai/features/agent] * Emergence World [https://world.emergence.ai/] * OpenAI Guaranteed Capacity [https://openai.com/business/guaranteed-capacity/] * Claude desktop download [https://claude.com/download]

21. mai 2026 - 7 min
episode AI Digest — May 20, 2026 cover

AI Digest — May 20, 2026

Good day, here's your AI digest for May 20, 2026. Google used its I/O event to push Gemini much deeper into products people already use every day. The biggest pieces were Gemini 3.5 Flash, a faster lower-cost model aimed at long tasks, coding, and agent workflows; Gemini Spark, a persistent agent that can take actions across Google apps; and Gemini Omni, a model that can turn text, images, audio, or video into video output. Google also described a major Search redesign built around cross-modal input, ongoing monitoring tasks, and more generated interfaces. The direction is clear: less focus on a standalone chatbot, and more focus on AI becoming the working layer underneath search, documents, email, browsing, and app workflows. The developer angle from that same launch was hard to miss. Google said AI Studio can now generate native Android apps from prompts, while Antigravity 2.0 expands into a managed setup for coding agents with desktop, CLI, SDK, background tasks, and parallel agent workflows. Taken together, that points to a more opinionated Google stack for software creation: one fast model, one agent surface, and a larger set of built-in paths from prompt to running code. Whether those tools hold up in real production work will depend on reliability and control, but Google is plainly trying to reduce the gap between asking for software and getting something functional back. Anthropic added a high-profile researcher to its roster with Andrej Karpathy joining the company’s pretraining team. He said he will work on efforts that use Claude to accelerate model training research. Karpathy has moved through several of the most influential AI organizations of the last decade, from co-founding OpenAI to leading Tesla Autopilot work and later building educational projects. His new role suggests Anthropic wants more of its research pipeline pushed inward through its own models, not just its external products. That is a notable signal in a year when leading labs are increasingly trying to use current models to speed up the creation of the next ones. Anthropic also expanded its managed agent tooling with sandboxes and MCP tunnels. The practical change is that teams can run tool execution in more controlled environments and let agents reach internal servers without exposing those systems directly to the public internet. Enterprise teams have wanted this kind of connective tissue for a while, because the usual bottleneck is not getting an agent to call a tool once, but getting it to do useful work inside real company boundaries without turning the setup into a security mess. This update pushes managed agents closer to serious internal deployment instead of isolated demos. OpenAI moved on two fronts that lean toward enterprise use. First, it added stronger provenance support for generated images, including C2PA conformance, Google SynthID watermarking, and a public verification tool intended to help identify images made by its models. That does not solve every authenticity problem, but it does show the major labs converging on traceability features instead of treating them as optional extras. Second, OpenAI announced a partnership with Dell to run Codex inside corporate data centers. That brings the coding agent closer to private internal systems and fits the broader pattern of buyers wanting capable models and agents without sending everything through a fully public cloud workflow. Google’s broader platform push also included more agent behavior in Search and Workspace. Search is moving beyond returning links toward monitoring, organizing, and acting on requests over time. Workspace is getting more voice features and tighter AI assistance across productivity surfaces. These moves matter less as isolated product updates than as a change in interaction style. The large platforms are trying to make software feel less like a set of separate destinations and more like a set of permissions around one shared AI layer. If that approach works, the competitive edge may come from integration depth and trust rather than from a benchmark lead alone. Another sign of the same shift showed up in Anthropic’s deal with KPMG, which will bring Claude to hundreds of thousands of workers and into the firm’s client-facing digital platform. Large rollouts like that are useful to watch because they test whether these systems can survive procurement, compliance review, internal controls, and repeated day-to-day use by non-specialists. A lab can look strong in demos and still fail when an organization tries to embed the product in routine work. Deals at this scale suggest the enterprise market is moving from pilot language toward broader operational adoption. The common thread across today’s stories is that the biggest AI companies are racing to become infrastructure for work, not just destinations for chat. Google is spreading one model family across search, productivity, app development, and agents. Anthropic is strengthening both its research bench and its enterprise agent plumbing. OpenAI is adding provenance and bringing coding systems closer to private deployment environments. The center of gravity keeps shifting from single prompts toward long-running systems that can access tools, remember context, and operate inside the places where work already happens. This has been your AI digest for May 20, 2026. Read more: * Google I/O 2026 Gemini updates [https://www.youtube.com/watch?v=wYSncx9zLIU] * Gemini 3.5 [https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/] * Gemini Spark [https://blog.google/innovation-and-ai/products/gemini-app/next-evolution-gemini-app/] * Andrej Karpathy joins Anthropic [https://x.com/karpathy/status/2056753169888334312?s=20] * Claude Managed Agents updates [https://claude.com/blog/claude-managed-agents-updates] * OpenAI content provenance [https://openai.com/index/advancing-content-provenance/] * OpenAI and Dell Codex partnership [https://openai.com/index/dell-codex-enterprise-partnership/] * Anthropic and KPMG [https://www.anthropic.com/news/anthropic-kpmg]

20. mai 2026 - 6 min
episode AI Digest — May 18, 2026 cover

AI Digest — May 18, 2026

Good day, here's your AI digest for May 18th, 2026. Today’s mix is heavy on product moves that push AI deeper into everyday software, plus a few signs that coding agents are becoming more capable, more operational, and more consequential for engineering teams. OpenAI is starting with money. ChatGPT now has a new personal finance mode for U.S. Pro users that connects through Plaid to financial accounts and lets the model answer questions with live context from spending, investments, and bills. The product can build dashboards, look at trends, help with savings plans, and talk through portfolio questions. OpenAI says it still cannot move money, execute trades, pay bills, or file taxes, but the direction is clear. This is a move from a general chatbot toward a context-aware application layer that sits on top of real user data. If that pattern works in finance, it can spread into every other category where people already live inside fragmented dashboards and legacy apps. Google is rolling out a new thinking level option inside Gemini for some users, tied to Fast and Gemini 3.1 Pro modes. That sounds like a small interface change, but it points to a broader product strategy: more visible control over reasoning depth, speed, and cost. Google also appears to be preparing more third-party integrations for Gemini, including services like Canva, Instacart, and OpenTable. The combination matters. One part gives users finer control over how much model effort they want, and the other part gives Gemini more places to act. That is the same arc the rest of the market is chasing: less pure chat, more connected software. OpenAI is also reportedly expanding what Codex can do with computers. A forthcoming computer use capability would let the coding agent control macOS applications even when a laptop is locked or asleep, instead of requiring an unlocked active session. If that lands, it removes one of the more awkward limitations in current agent workflows. The practical effect is that coding and task agents could keep operating with less babysitting, which is exactly the kind of incremental systems change that makes automation feel less like a demo and more like infrastructure. It also raises the bar for permissioning, observability, and trust, because an agent that can keep acting while a user is away stops being just an assistant and starts looking more like a delegated operator. Anthropic, meanwhile, published guidance on how Claude Code is being used in large codebases, including monorepos with millions of lines of code, long-lived legacy systems, and multi-repository environments. That is important because the conversation around coding agents has often been shaped by toy examples and greenfield prototypes. The more interesting question is what survives contact with a sprawling production environment full of conventions, historical baggage, and coordination overhead. The signal here is that coding agents are moving into the part of software development where context management, review discipline, and organizational process matter more than raw benchmark performance. Another engineering story came from security research. A small team at Calif says it used Anthropic’s unreleased Claude Mythos model to help uncover a public memory corruption exploit that bypassed Apple’s Memory Integrity Enforcement on M5 chips. The researchers say human expertise was still essential, but they also argue that frontier models are shrinking the size of team needed to do high-end offensive research. That should get the attention of anyone responsible for platform security. Even if the exact model details stay hard to verify from the outside, the broader pattern is believable: stronger models reduce search cost, accelerate hypothesis testing, and make specialized work available to smaller groups. There is also a lighter but still revealing story from the culture side of AI. Artist SHL0MS posted an image of a real Claude Monet painting and told people it was AI-generated, then invited them to explain why it was inferior. Many did exactly that, criticizing it as soulless and technically flawed before learning it was an actual Monet. The episode is not a product launch, but it does say something useful about the current environment around generative media. Reactions to AI are often being driven before the object itself is evaluated. For builders, that means product reception in creative tools will keep being shaped by identity, bias, and labeling, not just capability. A few smaller items also stood out. OpenAI’s earlier mobile push for Codex keeps reinforcing the idea that coding agents are becoming ambient rather than tied to a single workstation. There is also continued discussion around voice and media, including reports that OpenAI folded the Weights.gg team into the company after an acquisition. On the research side, attention efficiency and long-context architecture work keep accelerating, with new approaches aimed at reducing memory cost while preserving useful reasoning performance at larger context windows. None of these is the one headline of the day, but together they show the stack still moving on every layer at once: interface, agents, infrastructure, model architecture, and distribution. The through line today is that AI products are becoming more connected to real systems and more persistent in how they operate. Finance tools are getting model-native interfaces. Chat assistants are gaining deeper control settings. Coding agents are extending beyond a foreground terminal window. And engineering teams are starting to talk less about whether to use these systems and more about how to run them safely inside large, messy, valuable environments. This has been your AI digest for May 18th, 2026. Read more: * ChatGPT personal finance [https://openai.com/index/personal-finance-chatgpt/] * Gemini extended thinking and integrations [https://9to5google.com/2026/05/17/gemini-app-thinking-level/?utm_source=tldrai] * Codex computer use on locked desktop devices [https://www.testingcatalog.com/openai-will-let-codex-control-other-desktop-devices-via-computer-use/?utm_source=tldrai] * Claude Code in large codebases [https://claude.com/blog/how-claude-code-works-in-large-codebases-best-practices-and-where-to-start?utm_source=tldrai] * Calif exploit research with Claude Mythos [https://blog.calif.io/p/first-public-kernel-memory-corruption] * SHL0MS Monet post [https://x.com/SHL0MS/status/2054280631807316329?s=20]

18. mai 2026 - 6 min
episode AI Digest — May 16, 2026 cover

AI Digest — May 16, 2026

Good day, here's your AI digest for May 16th, 2026. A lot of today's movement is around AI software that is getting more mobile, more automated, and more embedded into day to day engineering work. The common thread is that the agent workflow is stretching beyond a single IDE window. Products are shifting toward long running tasks, remote oversight, structured cloud environments, and tighter controls around how teams actually operate these systems in production. OpenAI has pushed Codex into the ChatGPT iPhone app in preview across its plans. The mobile experience is built around supervising work that is still running on a laptop or remote machine. You can check live task threads, review code changes, approve actions, use plugins, and launch new work from the phone without opening the desk machine directly to the public internet. OpenAI says it is handling that through a secure relay layer. For developers who are already letting coding agents run for long stretches, this is a practical upgrade. It turns Codex into something closer to an always-on remote coworker instead of a tool that only exists when you are sitting at the keyboard. OpenAI is also making Codex easier to wire into real team workflows. New hooks let developers insert scripts at key points in the Codex task loop, and programmatic tokens give business and enterprise teams scoped credentials for automation. That combination matters because it pushes Codex past ad hoc usage and toward repeatable internal tooling. Teams can start treating agent runs as events inside a broader system, with guardrails, custom checks, and service level expectations instead of one-off prompts. When you combine that with mobile oversight, the product is moving toward a more complete operations layer for software work. Anthropic has made a notable policy change around agent usage, and the response from developers has been rough. Starting June 15, Agent SDK usage and claude -p will move into a separate monthly credit pool instead of drawing from normal subscription limits. Pro accounts get twenty dollars a month in agent credits, Max 5x gets one hundred dollars, and Max 20x gets two hundred dollars, with no rollover between billing cycles. The upside is that Anthropic is reopening support for third party agentic tools after its earlier restrictions. The downside is that heavy users now face a much harder ceiling on practical agent use. This looks like another sign that flat subscription pricing is colliding with the real compute cost of autonomous systems that can run for hours and burn through large volumes of tokens. xAI has entered the coding agent race with Grok Build, now in early beta for SuperGrok Heavy subscribers. It runs from the terminal and ships with support for AGENTS.md, plugins, hooks, skills, MCP servers, subagents, deep worktree integrations, and a headless mode for scripts and automations. The feature list makes it clear that xAI is not trying to position this as a casual chat tool. It is targeting the same serious developer workflow space that now revolves around repo-aware agents, multi-agent decomposition, and automated execution paths. The immediate question is model quality and reliability, but the product direction is straightforward: terminal-native coding agents are becoming table stakes. Cursor is also pushing further into the infrastructure side with cloud agent development environments tailored for autonomous coding systems. The focus is on multi-repo support, environment configuration as code, automated setup, and governance controls for parallel agent fleets. That is a more consequential shift than a simple hosted devbox. It suggests teams want agents to start from known, reproducible environments rather than bespoke local setups, and they want policy controls around what those agents can access and modify. As engineering organizations move from one agent per developer to many agents running in parallel, environment management starts to look like a core platform problem. Google's Genkit middleware release fits the same pattern from the application side. The framework now exposes composable hooks around generation calls so developers can add retries, fallbacks, human approval on destructive tool actions, and broader observability across agentic application flows. Its tool loop repeats until the model is done, which gives teams a cleaner way to inspect and shape agent behavior over multi-step runs. This kind of middleware is what turns an interesting demo into something that can survive production traffic. Reliability, auditability, and intervention points are becoming part of the default agent stack rather than optional extras. One other story worth noting is the reported strain between OpenAI and Apple over the ChatGPT Siri integration. OpenAI is said to be exploring legal options tied to the partnership, after weaker than expected subscriber gains and limited product depth inside Apple's ecosystem. For software engineers, the main signal is not the legal drama. It is that distribution through a platform owner can still leave an AI company with weak control over the user experience, weak conversion, and limited room to shape the product. In parallel, Apple is reportedly preparing to open Siri to more model providers, which would make that integration layer even more competitive and less defensible. The net result today is a clearer picture of where the market is going. Coding agents are becoming remote, scriptable, and terminal native. Team deployments are shifting toward governed cloud environments. Middleware is getting better at supervising long tool loops. And pricing pressure is forcing model providers to separate casual use from true agentic consumption. This has been your AI digest for May 16th, 2026. Read more: * OpenAI Codex mobile in ChatGPT app [https://openai.com/index/work-with-codex-from-anywhere/] * Anthropic agent credit split policy [https://support.claude.com/en/articles/15036540-use-the-claude-agent-sdk-with-your-claude-plan] * xAI Grok Build beta [https://links.tldrnewsletter.com/lCw1MT] * Cursor cloud agent development environments [https://cursor.com/blog/cloud-agent-development-environments?utm_source=tldrai] * Codex hooks and programmatic tokens [https://threadreaderapp.com/thread/2055032115964870838.html?utm_source=tldrai] * Google Genkit middleware [https://developers.googleblog.com/announcing-genkit-middleware-intercept-extend-and-harden-your-agentic-apps/?utm_source=tldrai] * OpenAI and Apple partnership strain [https://techcrunch.com/2026/05/14/openai-is-reportedly-preparing-legal-action-against-apple-it-wouldnt-be-the-first-partner-to-feel-burned/?utm_source=tldrai]

16. mai 2026 - 6 min
Enkelt å finne frem nye favoritter og lett å navigere seg gjennom innholdet i appen
Enkelt å finne frem nye favoritter og lett å navigere seg gjennom innholdet i appen
Liker at det er både Podcaster (godt utvalg) og lydbøker i samme app, pluss at man kan holde Podcaster og lydbøker atskilt i biblioteket.
Bra app. Oversiktlig og ryddig. MYE bra innhold⭐️⭐️⭐️

Velg abonnementet ditt

Mest populær

Tidsbegrenset tilbud

Premium

20 timer lydbøker

  • Eksklusive podkaster

  • Ingen annonser i Podimo shows

  • Avslutt når som helst

2 Måneder for 19 kr
Deretter 99 kr / Måned

Kom i gang

Premium Plus

100 timer lydbøker

  • Eksklusive podkaster

  • Ingen annonser i Podimo shows

  • Avslutt når som helst

Prøv gratis i 14 dager
Deretter 169 kr / måned

Prøv gratis

Bare på Podimo

Populære lydbøker

Kom i gang

2 Måneder for 19 kr. Deretter 99 kr / Måned. Avslutt når som helst.