Devin at 89%, Opus 4.8 Price War & Illinois AI Safety Law

4 min · 29 mei 2026

Beschrijving

(00:00:00) Devin at 89%, Opus 4.8 Price War & Illinois AI Safety Law (00:00:52) Devin Writes Itself: 13% to 89% (00:01:29) Anthropic Opus 4.8 Price Pressure (00:02:04) Illinois AI Safety Law (00:02:40) OpenAI Solves 80-Year Conjecture (00:03:08) What to Watch Next Autonomous coding just crossed a threshold that most enterprise software teams weren't ready for. Cognition closed a $1 billion raise for Devin at a $26 billion valuation — up from $10.2 billion just eight months ago — backed by $492 million in annualised revenue and 50% month-over-month enterprise growth for six straight months. Clients include Mercedes-Benz, NASA, Goldman Sachs, and Santander. At Mercedes, projects that once took eight months now close in eight days. The most striking internal metric: Devin wrote 13% of Cognition's own codebase in December. Today that figure is 89%. On the model front, Anthropic released Claude Opus 4.8 with a fast mode that is 2.5× quicker and 3× cheaper than its predecessor, while matching Claude 4.7 pricing. The move applies direct cost pressure to OpenAI's agentic offerings. Anthropic's valuation has now cleared $965 billion, surpassing OpenAI's March figure of $852 billion. At the regulatory layer, Illinois passed a landmark AI safety law — including whistleblower protections — filling a gap left by the Trump administration's decision to pull back its federal frontier AI regulatory plan. The risk of state-by-state fragmentation is real and unresolved. OpenAI's model also produced the first counterexample to the Erdős planar unit distance problem, an 80-year-old mathematical conjecture — a result researchers are calling the first genuinely autonomous and interesting AI math finding. DeepMind has separately resolved nine Erdős problems. Finally, Demis Hassabis framed AI this week as a species-level transition moving ten times faster than the Industrial Revolution, with a five-to-ten year window for meaningful international coordination. This episode includes AI-generated content.

Reacties

Wees de eerste die een reactie plaatst

Meld je nu aan en word lid van de AI Daily Briefing community!

Begin hier

Alle afleveringen

36 afleveringen

Anthropic Tops $965B, Perplexity's Legal Siege & Nvidia's China Loophole Closed

(00:00:00) Anthropic Tops $965B, Perplexity's Legal Siege & Nvidia's China Loophole Closed (00:00:48) Anthropic Surpasses OpenAI Valuation (00:01:36) OpenAI EU Compliance Framework (00:02:28) US Closes Nvidia Chip Loophole (00:02:58) China AI Price Hikes Signal Cost Pressure (00:03:30) Anthropic Infrastructure and Settlement (00:04:16) What to Watch Next Anthropic has overtaken OpenAI in valuation — at least on paper. A $65 billion Series H round led by Altimeter, Dragoneer, Greenoaks, and Sequoia, with participation from Google and Amazon, places Anthropic at a $965 billion post-money valuation against OpenAI's $960 billion. The real story is the revenue engine underneath: $47 billion annualised, growing nearly 5x year-over-year, driven almost entirely by enterprise Claude deployments. Meanwhile, Perplexity's legal exposure deepens. CNN's 54-page complaint filed May 28th brings the total to nine publishers — including the New York Times, News Corp, the Washington Post, Tribune, Britannica, Merriam-Webster, Reddit, and Japan's Yomiuri Shimbun. The suits target protected expression, not just facts, making Perplexity's standard defence increasingly difficult to hold. Regulatory pressure is compounding from multiple directions. OpenAI published its Frontier Governance Framework to align with the EU AI Act ahead of the August 2nd enforcement deadline. The Commerce Department closed a Nvidia chip export loophole that had allowed hundreds of thousands of advanced chips to reach Chinese-headquartered firms via overseas subsidiaries. And China's own AI pricing model is cracking — Alibaba, Baidu, and Tencent have all raised service prices as AI agents push energy costs up to 100x higher than chatbots. Also on the radar: the Bartz v. Anthropic copyright settlement covering 120,000 authors and valued at $1.5 billion awaits final court approval, with payment expected mid-2026. Three legal regimes, two valuation leaders in a dead heat — this episode maps where AI's structural fault lines are opening up. This episode includes AI-generated content.

1 jun 20265 min

Claude's Pricing Shock, Codex on Windows & the AI Governance Gap

(00:00:00) Claude's Pricing Shock, Codex on Windows & the AI Governance Gap (00:00:25) Claude Pricing Crisis Hits CIOs (00:01:18) Codex Windows Computer Use Goes Live (00:02:07) Rosalind Biodefense Restricted Access (00:02:48) Microsoft ISO 42001 Governance Expansion (00:03:29) What to Watch Next The $700B AI infrastructure commitment is locked in — but the enterprise economics meant to justify it are under serious pressure. Today's episode examines the real-world cost and governance fallout rippling through AI adoption in mid-2026. Anthropics shift to usage-based tokenizer pricing for Claude is triggering cost alarms at major companies. CIOs are evaluating offshore AI development in India as a direct response to runaway model costs — a signal that frontier AI ROI is fragile enough that a pricing model change alone can reopen the entire business case. Abandoned projects are a baseline risk, and nine-figure AI deployment lawsuits are already setting precedent. On the capability front, OpenAI launched Codex computer use on Windows as of May 29, extending autonomous desktop control to the world's dominant enterprise OS. The feature is live — but audit logging, privilege escalation exposure, and malware surface area remain open governance questions. The EEA, UK, and Switzerland are excluded at launch. OpenAI also unveiled Rosalind Biodefense, a restricted-access program giving select U.S. government and allied partners access to GPT-Rosalind for life sciences applications. It marks a deliberate reversal of OpenAI's broad-access posture for a high-stakes domain — though whether access restrictions are enforceable in practice remains unproven. Microsoft's ISO 42001 recertification for Copilot expanded significantly in scope, now covering Copilot Studio, multi-model architectures, and admin approval workflows — a governance layer that didn't exist at enterprise scale a year ago. The throughline: infrastructure investment is done. The pricing, security, and governance infrastructure to support it is still catching up. Watch enterprise contract renewals over the next two quarters. This episode includes AI-generated content.

Gisteren4 min

GPT-5.5 Hallucinates 52% Less, Mythos Restricted & Tech's 142K Layoffs

(00:00:00) GPT-5.5 Hallucinates 52% Less, Mythos Restricted & Tech's 142K Layoffs (00:00:54) Mythos Restricted — Cybersecurity Risk (00:01:46) Tech Layoffs vs. AI Capex $700B (00:02:24) Developer Jobs Under-26 Drop 20% (00:02:54) CNN Sues Perplexity — Copyright Escalates (00:03:32) Hassabis Species-Level Warning (00:04:13) What To Watch Next Two major AI labs are racing to quantify honesty, and this episode unpacks what that really means. OpenAI's GPT-5.5 Instant is now the default ChatGPT model, with the company claiming 52.5% fewer hallucinations on medical, legal, and financial prompts — an internal figure with no independent benchmark yet. Anthropic's Opus 4.8 follows with reported gains in honesty and reduced sycophancy. One week, two labs, convergent claims: honesty is now a competitive surface. The bigger story may be what Anthropic chose not to release. The lab restricted access to a model called Mythos after flagging strikingly capable cybersecurity capabilities, launching Project Glasswing — a collaboration with Google, Microsoft, and Nvidia — focused on critical software defense. A frontier lab treating its own model as too dangerous to release openly is a genuine first. Meanwhile, 142,000 U.S. tech workers have been laid off in the first five months of 2025, up 33% year-over-year, as the same companies commit $700 billion to AI infrastructure. Developer employment for workers under 26 has dropped 20% since 2024, with entry-level roles disappearing fastest. CNN became the first TV network to sue an AI company, filing against Perplexity after failed licensing talks — adding a new media category to an already crowded copyright litigation track. And DeepMind CEO Demis Hassabis told Stanford that AI is advancing ten times faster than the Industrial Revolution, with little margin for error over the next decade. The honesty benchmarks need independent verification. The Mythos situation remains unresolved. Both will have answers — neither does yet. This episode includes AI-generated content.

30 mei 20264 min

Devin at 89%, Opus 4.8 Price War & Illinois AI Safety Law

29 mei 20264 min

Sovereign Wealth Owns AI Now: Q1's $300B Funding Shock

(00:00:00) Sovereign Wealth Owns AI Now: Q1's $300B Funding Shock (00:00:43) Three Companies, Two-Thirds of Capital (00:01:20) US Government's Nine Billion Dollar Lag (00:02:00) China Locks Down AI Talent (00:02:34) Cisco Challenges Safety Benchmarks (00:03:12) Optical Supply Chain Tightening (00:03:41) AGI Timeline and Closing Watch Points Three hundred billion dollars flowed into AI in Q1 2026 alone — and the story isn't just the number, it's who's writing the checks. Traditional venture capital has hit its ceiling. Sovereign wealth funds from Temasek, the Qatar Investment Authority, Saudi Arabia's Public Investment Fund, and Mubadala are now the dominant force at the frontier. OpenAI, Anthropic, and xAI combined to capture 67% of all global venture capital across every sector — a concentration so extreme it's compressing Series A rounds and widening the barbell between frontier labs and everyone else. Meanwhile, the US government is openly playing catch-up. The CIA and NSA have requested $9 billion for Nvidia Grace Blackwell GB10 superchips — a catch-up request, not a forward investment — while China has moved in the opposite direction, locking DeepSeek and Alibaba employees behind international travel approvals. Beijing is treating frontier AI talent as critical national infrastructure. On the safety front, Cisco's new research exposes a critical flaw in how the industry benchmarks model security. Multi-turn attack success rates across 15 frontier models ranged from 7.89% to 88.3% — meaning every model tested failed iterative safety evaluation. The threat model the industry has been using is already outdated. Elsewhere, optical components are the newest infrastructure chokepoint: demand for 800G transceivers is set to jump 2.6x, and Nvidia has locked up roughly 80% of EML laser supply. And DeepMind's Demis Hassabis moved his AGI forecast forward to 2029, citing faster-than-expected agent proliferation. All the signals, no filler. That's today's briefing. This episode includes AI-generated content.

28 mei 20264 min

Devin at 89%, Opus 4.8 Price War & Illinois AI Safety Law

Beschrijving

Reacties

2 maanden voor € 1

Alle afleveringen