AI Daily Briefing
(00:00:00) GPT-5.5 Hallucinates 52% Less, Mythos Restricted & Tech's 142K Layoffs (00:00:54) Mythos Restricted — Cybersecurity Risk (00:01:46) Tech Layoffs vs. AI Capex $700B (00:02:24) Developer Jobs Under-26 Drop 20% (00:02:54) CNN Sues Perplexity — Copyright Escalates (00:03:32) Hassabis Species-Level Warning (00:04:13) What To Watch Next Two major AI labs are racing to quantify honesty, and this episode unpacks what that really means. OpenAI's GPT-5.5 Instant is now the default ChatGPT model, with the company claiming 52.5% fewer hallucinations on medical, legal, and financial prompts — an internal figure with no independent benchmark yet. Anthropic's Opus 4.8 follows with reported gains in honesty and reduced sycophancy. One week, two labs, convergent claims: honesty is now a competitive surface. The bigger story may be what Anthropic chose not to release. The lab restricted access to a model called Mythos after flagging strikingly capable cybersecurity capabilities, launching Project Glasswing — a collaboration with Google, Microsoft, and Nvidia — focused on critical software defense. A frontier lab treating its own model as too dangerous to release openly is a genuine first. Meanwhile, 142,000 U.S. tech workers have been laid off in the first five months of 2025, up 33% year-over-year, as the same companies commit $700 billion to AI infrastructure. Developer employment for workers under 26 has dropped 20% since 2024, with entry-level roles disappearing fastest. CNN became the first TV network to sue an AI company, filing against Perplexity after failed licensing talks — adding a new media category to an already crowded copyright litigation track. And DeepMind CEO Demis Hassabis told Stanford that AI is advancing ten times faster than the Industrial Revolution, with little margin for error over the next decade. The honesty benchmarks need independent verification. The Mythos situation remains unresolved. Both will have answers — neither does yet. This episode includes AI-generated content.
38 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de AI Daily Briefing!