The AI & Tech Society by Danar

Musk vs. Altman: The OpenAI Legal Battle Explained

19 min · 10 mei 2026
aflevering Musk vs. Altman: The OpenAI Legal Battle Explained artwork

Beschrijving

FOR TECH LEADERS 1. Corporate structure creates 5-10 year litigation exposure 2. Nonprofit pivots require AG negotiation, not just board approval 3. Mission-aligned structures (PBC) gain credibility advantage 4. Document founder discussions formally 5. Co-founder departure terms matter more than ever FOR INVESTORS 1. Governance risk is now diligence requirement 2. Demand mission-protection documentation 3. Monitor AG agreements and state oversight 4. Understand partner-investor risk compounding WHAT TRIAL REVEALED > "The picture that emerged is not one of villains stealing a charity, nor one of crusaders defending a mission. It is one of co-founders making consequential decisions under significant uncertainty, with informal arrangements that proved inadequate to the scale of value the technology eventually created." KEY QUOTE > "Musk will likely lose the case but is succeeding at something his lawsuit may not have intended — establishing a public record of how AI labs are actually governed, and creating durable pressure for that governance to become more formal, more transparent, and more constrained." ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

Reacties

0

Wees de eerste die een reactie plaatst

Meld je nu aan en word lid van de The AI & Tech Society by Danar community!

Probeer gratis

Probeer 14 dagen gratis

€ 9,99 / maand na proefperiode. · Elk moment opzegbaar.

  • Podcasts die je alleen op Podimo hoort
  • 20 uur luisterboeken / maand
  • Gratis podcasts

Alle afleveringen

115 afleveringen

aflevering AI Model Cost War: Claude Fable 5 vs Chinese Open Source Models artwork

AI Model Cost War: Claude Fable 5 vs Chinese Open Source Models

FABLE 5 VS CHATGPT 5.5 VS OPUS 4.8 VS KIMI 2.6 VS QWEN 3.7 THE TOKEN EFFICIENCY WRINKLE * Fable 5 uses fewer tool calls than Opus-tier models * 25-30% faster on Anthropic's spreadsheet suite * Fewer turns partially offset the 2x per-token price * Measure cost per outcome, not cost per token FABLE 5 SAFEGUARD ARCHITECTURE Novel design: Routes risky prompts to less capable model rather than refusing Classifier domains: 1. Cybersecurity 2. Biology and chemistry 3. Model distillation Fallback model: Claude Opus 4.8 Trigger rate: <5% (Anthropic) / 8-9% (Artificial Analysis) Security testing: 1,000+ hours bug bounty, no universal jailbreak found KEY QUOTES > "It's like hiring a brain surgeon to put on a band-aid." > "There is no best model. There's only the best model for this task, at this input/output ratio, with this latency tolerance." > "Everyone will have access to the smartest model. The decisive competency is knowing when not to use it." > "The first phase of enterprise AI was about access. The next phase is about allocation." ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

12 jun 202619 min
aflevering Claude Opus 4.8: Benchmark Results and Review artwork

Claude Opus 4.8: Benchmark Results and Review

CLAUDE OPUS 4.8 REVIEW AND BENCHMARK RESULTS Key insight: 10.6-point gap on SWE-bench Pro is the largest between Opus 4.8 and GPT-5.5 DYNAMIC WORKFLOWS What it is: Research preview feature letting Claude orchestrate hundreds of parallel subagents How it works: 1. Claude plans a large task 2. Writes JavaScript orchestration script 3. Spawns tens to hundreds of parallel subagents 4. Runs them simultaneously 5. Verifies results against test suite 6. Returns coordinated final answer Limits: * Up to 16 concurrent agents * Up to 1,000 agents total per run * "Meaningfully more tokens" than typical sessions * Available on Max, Team, Enterprise plans Demonstrated capability: 750,000-line codebase migrated in 11 days with 99.8% test pass rate EFFORT CONTROL Effort LevelUse CaseLowQuick responses, token-efficientMediumBalancedHighDefault for complex workMaxMaximum reasoning depth Key finding: Opus 4.8 at minimum effort matches Opus 4.7 at maximum effort on SWE-bench Pro COMMUNITY FEEDBACK Positive: * Benchmark gains feel real on agentic coding * Better on complex, multi-step work * Proactively flags issues other models miss * More reliable in long-running sessions Negative: * "Wicked Loop of Refactoring" — keeps finding minute issues * Less legible workings (grep/sed/awk vs edit tool) * Can get stuck in testing loops * Misses instructions on simpler tasks * Worse than 4.7 on some UI generation prompts ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

4 jun 202617 min
aflevering Vibe Coding Is Dead: The Rise of Agentic Engineering artwork

Vibe Coding Is Dead: The Rise of Agentic Engineering

THE THREE-PANEL FRAMEWORK Panel 1: Vibe Coding * You → Prompt → Model → Code * Fast to start * Feeling over structure * Good for prototypes * "You ask the model to solve the problem directly" Panel 2: What Changed * Stronger models are not the whole answer * The new bottleneck is context, rules, and review * Engineer writes spec → Sets rules → Lets agents work → Reviews output * "You code less. You steer the system more." Panel 3: Agentic Engineering * Agents build. The human orchestrates. * Bring together: spec, goal, constraints, history, data, rules, tools, tests * "More scalable. More repeatable. Better results." KEY QUOTES > "Many people have tried to come up with a better name for this to differentiate it from vibe coding. Personally, my current favorite is 'agentic engineering.'" — Andrej Karpathy > "The goal is to claim the leverage from the use of agents but without any compromise on the quality of the software." — Andrej Karpathy > "I think by the end of the year, everyone is going to be a product manager, and everyone codes. The title software engineer is going to start to go away." — Boris Cherny > "You can outsource your thinking but you can't outsource your understanding." — Tweet Karpathy thinks about every other day ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

28 mei 202616 min
aflevering Claude Code at the Organization Layer: What Actually Changes artwork

Claude Code at the Organization Layer: What Actually Changes

WHAT ACTUALLY CHANGES WHEN CLAUDE CODE REACHES THE WHOLE ENGINEERING ORGANIZATION METRICS THAT ACTUALLY MATTER Stop measuring: * Lines of code per developer * Token consumption * Individual productivity Start measuring: * Cycle time (Claude-assisted vs non-assisted PRs) * Time to first PR for new hires * PR throughput with quality counterweight (defect rate, rollback frequency) * Incident resolution time * Maintenance burden trajectory NON-ENGINEERS BUILDING SOFTWARE Examples from one company: * Support team: Tool surfacing relevant past tickets and customer history * Finance team: Expense categorization assistant * HR team: Onboarding checklist app pulling from live systems What engineering built: * Architecture patterns for internal apps * Plugin marketplace with pre-approved skills/MCP connections * Managed permissions (read from X, write to Y, not Z) * Audit logs for AI-generated changes The shift: Engineering didn't build the apps. Engineering built the conditions under which apps could be built safely. ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

22 mei 202619 min