The AI & Tech Society by Danar

Claude Code at the Organization Layer: What Actually Changes

19 min · 22. mai 2026
episode Claude Code at the Organization Layer: What Actually Changes cover

Beskrivelse

WHAT ACTUALLY CHANGES WHEN CLAUDE CODE REACHES THE WHOLE ENGINEERING ORGANIZATION METRICS THAT ACTUALLY MATTER Stop measuring: * Lines of code per developer * Token consumption * Individual productivity Start measuring: * Cycle time (Claude-assisted vs non-assisted PRs) * Time to first PR for new hires * PR throughput with quality counterweight (defect rate, rollback frequency) * Incident resolution time * Maintenance burden trajectory NON-ENGINEERS BUILDING SOFTWARE Examples from one company: * Support team: Tool surfacing relevant past tickets and customer history * Finance team: Expense categorization assistant * HR team: Onboarding checklist app pulling from live systems What engineering built: * Architecture patterns for internal apps * Plugin marketplace with pre-approved skills/MCP connections * Managed permissions (read from X, write to Y, not Z) * Audit logs for AI-generated changes The shift: Engineering didn't build the apps. Engineering built the conditions under which apps could be built safely. ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

Kommentarer

0

Vær den første til å kommentere

Registrer deg nå og bli medlem av The AI & Tech Society by Danar sitt community!

Prøv gratis

Prøv gratis i 14 dager

99 kr / Måned etter prøveperioden. · Avslutt når som helst.

  • Eksklusive podkaster
  • 20 timer lydbøker i måneden
  • Gratis podkaster

Alle episoder

118 Episoder

episode AI Is Now the #1 Reason for Layoffs: Reading the 2026 Workforce Data cover

AI Is Now the #1 Reason for Layoffs: Reading the 2026 Workforce Data

THREE HONEST OBSERVATIONS 1. Tech is the exception — 5.8% vs 3.8% overall; displacement invisible in macro stats 2. Regulation mobilizing — Newsom executive order; EU pressure; state legislation likely 2026-27 3. "More jobs than it destroys" is partly evasive — new roles need different skills; reskilling timeline lags; aggregate doesn't help individuals SEVEN ACTIONS FOR LEADERS 1. Be honest about what's changing (no "efficiency" euphemisms) 2. Redirect savings into upskilling, not just GPUs 3. Protect the entry-level rung (new apprenticeship paths) 4. Promote harness skill, not just prompt skill 5. Stop AI-washing organizational decisions 6. Set explicit headcount-vs-AI tradeoffs 7. Treat severance/outplacement as engineering quality FIVE ACTIONS FOR ENGINEERS 1. Build harness skill, not prompt skill 2. Get certified (e.g., Claude Certified Architect) 3. Track your skill exposure honestly 4. Build a portable, public portfolio 5. Maintain 6-12 months financial runway SEVEN KEY TAKEAWAYS 1. AI became #1 layoff reason in May 2026 (40%); 7%→40% in five months 2. AI washing is real (6 in 10 companies admit it) 3. The precise truth is capital reallocation 4. CEO statements remarkably consistent (Oracle cut while profitable) 5. Displacement is structural, not uniform (middle hollows out) 6. Tech is the exception (5.8% vs 3.8%) 7. The response defines the next decade KEY QUOTES > "Regardless of whether individual jobs are being replaced by AI, the money for those roles is." — Andy Challenger > "We're already seeing that the intelligence tools we're creating... fundamentally changes what it means to build and run a company. I think most companies are late." — Jack Dorsey, Block > "The leadership test of 2026 is whether you handle the AI workforce transition as a tactical cost-cutting opportunity — or as the defining strategic moment of the decade." ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

29. juni 202617 min
episode The State of AI Engineering: What a Thousand Companies' Telemetry Reveals cover

The State of AI Engineering: What a Thousand Companies' Telemetry Reveals

FIVE MOVES FOR LEADERS 1. Adopt a model gateway — centralize routing, failover, governance 2. Build deprecation discipline — retire models deliberately 3. Instrument agents deeply — especially with frameworks 4. Audit prompt caching — fix layout (stable first, dynamic later) 5. Implement budgets & backpressure — cap loops, build queues SEVEN KEY TAKEAWAYS 1. Multi-model is the norm (70%+ use 3+ models); use a gateway 2. LLM tech debt compounds; retire old models deliberately 3. Framework adoption doubled; observability burden doubled too 4. 69% of tokens are system prompts; only 28% use caching 5. Context windows exploded but quality beats volume 6. Rate limits are the #1 failure mode 7. Agents are still mostly monoliths; distributed shift is coming KEY QUOTES > "The gap between a good demo and a dependable system is closed by effective evaluation and operational discipline." — Datadog > "The next wave of agent failures won't be about what agents can't do. It'll be about what teams can't observe." — Guillermo Rauch, CEO, Vercel > "Context quality, not volume, is the new limiting factor for LLM agents." ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

24. juni 202619 min
episode AI Model Cost War: Claude Fable 5 vs Chinese Open Source Models cover

AI Model Cost War: Claude Fable 5 vs Chinese Open Source Models

FABLE 5 VS CHATGPT 5.5 VS OPUS 4.8 VS KIMI 2.6 VS QWEN 3.7 UPDATED ** CLAUDE FABLE JUST GOT SUSPENDED 2026-06-12 BY ANTHROPIC AND THE US GOVERNMENT. THE TOKEN EFFICIENCY WRINKLE * Fable 5 uses fewer tool calls than Opus-tier models * 25-30% faster on Anthropic's spreadsheet suite * Fewer turns partially offset the 2x per-token price * Measure cost per outcome, not cost per token FABLE 5 SAFEGUARD ARCHITECTURE Novel design: Routes risky prompts to less capable model rather than refusing Classifier domains: 1. Cybersecurity 2. Biology and chemistry 3. Model distillation Fallback model: Claude Opus 4.8 Trigger rate: <5% (Anthropic) / 8-9% (Artificial Analysis) Security testing: 1,000+ hours bug bounty, no universal jailbreak found KEY QUOTES > "It's like hiring a brain surgeon to put on a band-aid." > "There is no best model. There's only the best model for this task, at this input/output ratio, with this latency tolerance." > "Everyone will have access to the smartest model. The decisive competency is knowing when not to use it." > "The first phase of enterprise AI was about access. The next phase is about allocation." ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

12. juni 202619 min
episode Claude Opus 4.8: Benchmark Results and Review cover

Claude Opus 4.8: Benchmark Results and Review

CLAUDE OPUS 4.8 REVIEW AND BENCHMARK RESULTS Key insight: 10.6-point gap on SWE-bench Pro is the largest between Opus 4.8 and GPT-5.5 DYNAMIC WORKFLOWS What it is: Research preview feature letting Claude orchestrate hundreds of parallel subagents How it works: 1. Claude plans a large task 2. Writes JavaScript orchestration script 3. Spawns tens to hundreds of parallel subagents 4. Runs them simultaneously 5. Verifies results against test suite 6. Returns coordinated final answer Limits: * Up to 16 concurrent agents * Up to 1,000 agents total per run * "Meaningfully more tokens" than typical sessions * Available on Max, Team, Enterprise plans Demonstrated capability: 750,000-line codebase migrated in 11 days with 99.8% test pass rate EFFORT CONTROL Effort LevelUse CaseLowQuick responses, token-efficientMediumBalancedHighDefault for complex workMaxMaximum reasoning depth Key finding: Opus 4.8 at minimum effort matches Opus 4.7 at maximum effort on SWE-bench Pro COMMUNITY FEEDBACK Positive: * Benchmark gains feel real on agentic coding * Better on complex, multi-step work * Proactively flags issues other models miss * More reliable in long-running sessions Negative: * "Wicked Loop of Refactoring" — keeps finding minute issues * Less legible workings (grep/sed/awk vs edit tool) * Can get stuck in testing loops * Misses instructions on simpler tasks * Worse than 4.7 on some UI generation prompts ---------------------------------------- Hosted on Acast. See acast.com/privacy [https://acast.com/privacy] for more information.

4. juni 202617 min