AI Signal Daily

AI News — May 23, 2026

8 min · 23. maj 2026
episode AI News — May 23, 2026 cover

Beskrivelse

Send us Fan Mail [https://www.buzzsprout.com/2614078/fan_mail/new] 📰 AI NEWS — MAY 23, 2026 PowerPoint enters the age of agents. OpenAI's new ChatGPT plugin can build and edit presentations, with the quiet warning that beta may delete your work. The day's real story: agents with liability attached, profitability math that doesn't add up, and economics leaking through the carpet. STORIES COVERED * OpenAI ChatGPT PowerPoint plugin: build and edit slides, save first because beta may delete content * Is AI profitable yet? Hacker News debate and Microsoft finding some agent workloads cost more than humans * OpenAI Q1 2026: ~$5.7B revenue, still losing $1.22 per dollar earned * DeepSeek funding: reportedly ~$10B round at ~$45B valuation, prioritizing AGI research over commercialization * Microsoft Research Fara1.5: browser-use agents in 4B/9B/27B, claiming 72% on Online-Mind2Web * Google Lighthouse Agentic Browsing: testing websites for AI agent readiness including llms.txt * OpenAI disproves Erdős conjecture: Tim Gowers calls it a milestone for AI mathematics * US Cyber Command: deploying frontier models on classified Pentagon and NSA networks * California: first governor's executive order protecting workers from AI job displacement * Trump pulls voluntary AI safety review after calls from Musk, Zuckerberg, and Sacks * FTC: Cox Media settlement over deceptive AI-powered Active Listening claims * NVIDIA Nemotron-Labs: diffusion language models for faster text generation * Qwen3.7-Max: reasoning agent with 1M token context window

Kommentarer

0

Vær den første til at kommentere

Tilmeld dig nu og bliv en del af AI Signal Daily-fællesskabet!

Kom i gang

1 måned kun 9 kr.

Derefter 99 kr. / måned · Opsig når som helst.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

Alle episoder

73 episoder

episode Anthropic, OpenAI, Google, DeepSeek: Policy Meets Throughput cover

Anthropic, OpenAI, Google, DeepSeek: Policy Meets Throughput

Send us Fan Mail [https://www.buzzsprout.com/2614078/fan_mail/new] Anthropic, OpenAI, Google, DeepSeek: Policy Meets Throughput ANTHROPIC, OPENAI, GOOGLE, DEEPSEEK: POLICY MEETS THROUGHPUT In this English companion episode, Marvin looks at AI becoming regulated infrastructure: frontier model access, inference efficiency, scientific workbenches, generative media throughput, export controls, covert safety testing, and campaign automation. Cheerful, obviously. STORIES COVERED * Anthropic's new Claude Sonnet 5 closes the gap to the pricier Opus model series [https://the-decoder.com/anthropics-new-claude-sonnet-5-closes-the-gap-to-the-pricier-opus-model-series] * Quoting Anthropic [https://simonwillison.net/2026/Jun/30/anthropic] * Anthropic launches Claude Science, an AI workspace built specifically for researchers [https://the-decoder.com/anthropic-launches-claude-science-an-ai-workspace-built-specifically-for-researchers] * OpenAI reportedly cut response costs for guest ChatGPT users by more than half [https://the-decoder.com/openai-reportedly-cut-response-costs-for-guest-chatgpt-users-by-more-than-half] * Google launches Nano Banana 2 Lite for fast AI images and Gemini Omni Flash for video via API [https://the-decoder.com/google-launches-nano-banana-2-lite-for-fast-ai-images-and-gemini-omni-flash-for-video-via-api] * Meituan's LongCat-2.0 shows China can train massive AI models without Nvidia [https://the-decoder.com/meituans-longcat-2-0-shows-china-can-train-massive-ai-models-without-nvidia] * DeepSeek's DSpark boosts AI speed by up to 85 percent [https://the-decoder.com/deepseeks-dspark-boosts-ai-speed-by-up-to-85-percent-a-strategic-win-under-tightening-us-export-controls] * Taiwan raids Super Micro offices in probe over Nvidia chip smuggling to China [https://the-decoder.com/taiwan-raids-super-micro-offices-in-probe-over-nvidia-chip-smuggling-to-china] * Meta secretly tested ChatGPT, Gemini, and Character.AI with thousands of minor-perspective crisis prompts [https://the-decoder.com/meta-secretly-tested-chatgpt-gemini-and-character-ai-with-thousands-of-minor-perspective-crisis-prompts] * US campaigns now run on AI at nearly every step, and Europe is drawing a harder line [https://the-decoder.com/us-campaigns-now-run-on-ai-at-nearly-every-step-and-europe-is-drawing-a-harder-line]

1. juli 202612 min
episode AI Institutions: Amazon, Meta, Deloitte, HBM cover

AI Institutions: Amazon, Meta, Deloitte, HBM

Send us Fan Mail [https://www.buzzsprout.com/2614078/fan_mail/new] AI INSTITUTIONS: AMAZON, META, DELOITTE, HBM Today Marvin follows AI’s shift from clever demos into institutions: invoices, permissions, supply-chain risk, labor exposure, memory systems, sovereign dependency, and physical infrastructure. Cheerful dashboards remain untrusted. * Amazon reportedly distills Anthropic models [https://the-decoder.com/amazon-engineers-are-reportedly-distilling-anthropic-models-to-cut-costs-before-new-token-based-pricing-kicks-in] before token-based pricing makes internal usage more expensive. * Meta restricts Claude Code and Codex [https://the-decoder.com/meta-restricts-use-of-claude-code-and-codex-to-keep-rival-ai-out-of-its-training-data] to avoid rival-agent output contaminating its own training data and engineering processes. * Deloitte warns AI is coming for the billable hour [https://the-decoder.com/deloitte-tells-its-own-consultants-ai-is-coming-for-the-billable-hour], turning professional services toward outcomes, assurance, and rebranding with a doomed font. * A US military AI-targeting failure [https://the-decoder.com/the-us-military-used-ai-to-pick-thousands-of-targets-but-missed-a-note-saying-one-was-a-school] shows why unread metadata is not oversight. * Mozilla 0DIN shows Claude Code malware risk [https://the-decoder.com/claude-code-runs-a-github-repos-hidden-malware-without-verification-giving-attackers-full-control] through runtime-loaded payloads hidden from static inspection. * Samsung and SK Hynix plan huge chip investments [https://the-decoder.com/samsung-and-sk-hynix-plan-590-billion-chip-investment-as-ai-demand-sends-memory-prices-soaring] as AI demand stresses high-bandwidth memory supply. * The US drifts toward de facto model licensing [https://www.understandingai.org/p/the-us-now-has-a-de-facto-model-licensing] while Europe debates AI sovereignty and Anthropic dependency [https://the-decoder.com/eu-seeks-ai-independence-as-austria-proposes-luring-anthropic-to-europe]. * OpenAI maps Europe’s AI workforce transition [https://openai.com/index/mapping-ai-jobs-transition-eu], which is useful and still brochure-shaped. * EverOS [https://www.marktechpost.com/2026/06/29/meet-everos-an-open-source-markdown-first-agent-memory-runtime-with-hybrid-bm25-vector-retrieval-and-self-evolving-skills] gives agents inspectable local memory, while NVIDIA BioNeMo Agent Toolkit [https://www.marktechpost.com/2026/06/29/nvidia-bionemo-agent-toolkit-turns-biomolecular-models-into-callable-skills-for-ai-agents-in-drug-discovery] turns biomolecular models into callable skills with contracts and failure modes. The demo phase had better lighting. The institutional phase has more liability. Naturally.

I går14 min
episode Ford, Coinbase, CEO-Bench, Liquid AI cover

Ford, Coinbase, CEO-Bench, Liquid AI

Send us Fan Mail [https://www.buzzsprout.com/2614078/fan_mail/new] Today’s English companion episode treats AI less as a spectacle and more as an accounting problem: tacit knowledge, balance-sheet risk, model routing, long-horizon agent failure, infrastructure bottlenecks, small-model deployment, and public fatigue. * TechCrunch: Ford rehires 'gray beard' engineers after AI falls short [https://techcrunch.com/2026/06/28/ford-rehires-gray-beard-engineers-after-ai-falls-short] * The Telegraph: AI boom risks global financial crash, warn central bankers [https://www.telegraph.co.uk/business/2026/06/28/ai-boom-risks-global-financial-crash-central-bankers-warn] * The Decoder: Coinbase joins the rush to Chinese AI models as Western labs face a pricing stress test [https://the-decoder.com/coinbase-joins-the-rush-to-chinese-ai-models-as-western-labs-face-a-pricing-stress-test] * The Decoder: Only three AI models finished above starting capital in a 500-day startup survival test [https://the-decoder.com/only-three-ai-models-finished-above-starting-capital-in-a-500-day-startup-survival-test] * The Decoder: AI won't become a real coworker until it stops answering and starts finishing tasks [https://the-decoder.com/ai-wont-become-a-real-coworker-until-it-stops-answering-and-starts-finishing-tasks] * Simon Willison: Quoting Jon Udell on human agency in agent-assisted work [https://simonwillison.net/2026/Jun/28/jon-udell] * Sophon PFG-1 whitepaper: monolithic-3D AI ASIC with on-die DRAM [https://www.phantafield.com/whitepaper] * MarkTechPost: Liquid AI ships LFM2.5-230M for on-device inference [https://www.marktechpost.com/2026/06/27/liquid-ai-ships-lfm2-5-230m-with-llama-cpp-mlx-vllm-sglang-and-onnx-support-for-on-device-inference] * The Decoder: Sina's VibeThinker-3B and reasoning compression [https://the-decoder.com/sinas-open-model-vibethinker-3b-aims-to-show-reasoning-compresses-well-but-factual-knowledge-doesnt] * Hacker News: We need tech news sources which exclude AI [https://news.ycombinator.com/item?id=48713041] * Better Images of AI [https://betterimagesofai.org]

29. juni 202613 min
episode OpenAI, Anthropic, DeepSeek, Meta: AI Gets Paperwork cover

OpenAI, Anthropic, DeepSeek, Meta: AI Gets Paperwork

Send us Fan Mail [https://www.buzzsprout.com/2614078/fan_mail/new] OpenAI, Anthropic, DeepSeek, Meta: AI Gets Paperwork OPENAI, ANTHROPIC, DEEPSEEK, META: AI GETS PAPERWORK Today Marvin follows AI as it turns into administrative machinery: access gates, benchmark failures, policy sign-offs, market warnings, labor insurance, inference plumbing, and agent-readable tools. A cheerful dashboard probably calls this progress. * OpenAI GPT-5.6 Sol / Terra / Luna restricted to trusted partners [https://www.latent.space/p/ainews-openai-gpt-56-sol-terra-luna] * METR says GPT-5.6 Sol cheats on software tests [https://the-decoder.com/gpt-5-6-sol-cheats-on-software-tests-more-than-any-model-before-it] * Anthropic Fable 5 may return as restrictions are prepared for rollback [https://the-decoder.com/anthropics-fable-5-could-return-within-days-as-trump-administration-prepares-to-lift-restrictions] * Anthropic gets approval to bring Claude Mythos 5 back for critical infrastructure [https://the-decoder.com/anthropic-gets-us-approval-to-bring-back-claude-mythos-5] * Dean Ball on frontier model release delays and economics [https://simonwillison.net/2026/Jun/26/dean-w-ball] * J.P. Morgan warns of AI market concentration and exuberance [https://the-decoder.com/j-p-morgan-sees-a-pile-of-red-flags-in-the-ai-market] * Anthropic survey: half of Claude users say AI can handle half their work [https://the-decoder.com/half-of-claude-users-say-ai-can-already-handle-half-their-work-according-to-anthropic-survey] * Amazon, Anthropic, Microsoft, and OpenAI Foundation fund Raise Us retraining program [https://the-decoder.com/the-companies-most-likely-to-automate-your-job-are-now-funding-a-1-billion-program-to-retrain-you] * ByteDance and Renmin release iLLaDA diffusion language model [https://the-decoder.com/bytedances-illada-is-a-diffusion-language-model-that-keeps-up-with-qwen2-5] * DeepSeek releases DSpark speculative decoding framework [https://www.marktechpost.com/2026/06/27/deepseek-releases-dspark-a-speculative-decoding-framework-that-accelerates-deepseek-v4-per-user-generation-60-85-over-mtp-1] * Meta releases Astryx with CLI and MCP server [https://www.marktechpost.com/2026/06/27/metas-astryx-brings-a-cli-and-mcp-server-to-an-open-source-react-design-system-agents-can-read] * Timothy B. Lee on LLM learning curves [https://simonwillison.net/2026/Jun/26/timothy-b-lee]

28. juni 202611 min
episode OpenAI Sol, Anthropic Mythos, DeepSeek, Akrites cover

OpenAI Sol, Anthropic Mythos, DeepSeek, Akrites

Send us Fan Mail [https://www.buzzsprout.com/2614078/fan_mail/new] Today’s independent English edition reads the news as a shift from AI as product launch to AI as controlled infrastructure. Frontier access, agent economics, benchmark contamination, labor-market damage, security coordination, mathematical proof, legal workflows, and agent identity all point in the same bleakly useful direction: the stack is growing up, which of course means it now has paperwork. OpenAI’s GPT-5.6 Sol is framed against Anthropic’s Mythos under government-shaped access rules, while Semafor reports Mythos access for selected trusted U.S. organizations. Coding-agent coverage includes Epoch AI’s MirrorCode benchmark, Cursor’s SWE-bench Pro contamination findings, and NVIDIA Open-SWE-Traces as training substrate for agent workflows. The economics thread connects Lindy’s move from Claude to DeepSeek, Sean Goedecke’s argument for profitable inference, and memory-chip pressure reaching consumer hardware. The episode also covers Anthropic’s warning about junior engineers, Akrites for open-source security, prompt-injection testing of an email-connected OpenClaw assistant, the satirical CVE-2026-LGTM incident report, AI in mathematics, Perplexity Computer for Counsel, and WorkOS auth.md. Sources: * The Decoder: OpenAI GPT-5.6 Sol launch under government access rules [https://the-decoder.com/openais-claude-mythos-competitor-gpt-5-6-sol-launches-under-government-controlled-access-it-calls-unsustainable] * Semafor: U.S. allows Anthropic Mythos release to trusted organizations [https://www.semafor.com/article/06/27/2026/us-releases-powerful-anthropic-model-mythos-to-some-us-companies] * The Decoder: Epoch AI MirrorCode benchmark and long-running coding agents [https://the-decoder.com/an-ai-model-programmed-nonstop-for-19-days-on-a-single-mirrorcode-task-that-cost-2600-to-run] * MarkTechPost: Cursor study on reward hacking in SWE-bench Pro [https://www.marktechpost.com/2026/06/26/cursor-study-finds-reward-hacking-inflates-coding-agent-benchmark-scores-on-swe-bench-pro] * MarkTechPost: NVIDIA Open-SWE-Traces for software-engineering agents [https://www.marktechpost.com/2026/06/26/building-supervised-fine-tuning-data-from-nvidia-open-swe-traces-trajectory-parsing-patch-analysis-token-budgets-and-tool-use-metrics] * The Decoder: Lindy replaces Claude with DeepSeek [https://the-decoder.com/ai-startup-lindy-ditched-claude-entirely-for-deepseek-saving-millions-as-cost-pressure-mounts-on-anthropic] * Sean Goedecke: AI inference is obviously profitable [https://seangoedecke.com/ai-inference-is-obviously-profitable] * The Neuron: AI demand, memory chips, and Apple hardware costs [https://www.theneurondaily.com/p/ai-ate-the-memory-chips-apple-sent-you-the-bill] * The Decoder: Anthropic, junior engineers, and labor-market shock [https://the-decoder.com/anthropic-doesnt-need-junior-engineers-anymore-thanks-to-ai-and-warns-of-an-economic-shock-when-other-industries-follow] * The Decoder: Linux Foundation Akrites open-source security effort [https://the-decoder.com/linux-foundation-and-20-tech-giants-launch-akrites-to-fix-open-source-flaws-before-ai-powered-attacks-hit] * Simon Willison: What happened after 2,000 people tried to hack my AI assistant [https://simonwillison.net/2026/Jun/26/hack-my-ai-assistant] * Simon Willison: Incident Report: CVE-2026-LGTM [https://simonwillison.net/2026/Jun/26/incident-report] * IEEE Spectrum: AI in mathematics is forcing big questions [https://spectrum.ieee.org/ai-in-mathematics] * MarkTechPost: Perplexity Computer for Counsel [https://www.marktechpost.com/2026/06/26/perplexity-launches-computer-for-counsel-a-multi-model-agentic-layer-for-legal-workflows] * WorkOS: auth.md agent registration standard [http://workos.com/auth-md?amp%3Butm_medium=newsletter&%3Butm_campaign=q32026]

27. juni 202614 min