Episode 65: Hermes Agent v0.16.0 Desktop App, Codex 0.137 Multi-Agent v2, Claude Code Fallback Models, and Gemma 4 12B on the Local Stack

35 min · 6. juni 2026

Description

Hermes Agent v0.16.0 — "The Surface Release" — ships a real native desktop app with OAuth remote connect, drag-and-drop file input, and a browser-based admin panel. Codex 0.137 adds multi-agent v2 runtime choice persistence and parallel web search. Claude Code 2.1.166/2.1.167 introduces fallback model chains and glob tool-name deny rules. Gemma 4 12B is Google's latest open-weight 12B model that runs locally on a laptop with 16GB VRAM. The project radar covers the A2A protocol hitting v1.0, Kimi Code CLI as a TypeScript-native terminal coding agent, and the awesome-ai-agents-2026 curated resource list. Show notes: https://tobyonfitnesstech.com/podcasts/episode-65/

Comments

Be the first to comment

Get Started

All episodes

68 episodes

Episode 66: Claude Friday Outage, Claude Code .168 Day-Late Fix, OpenClaw Monthly Cadence Switch, OpenAI ChatGPT Superapp, Apple WWDC 2026, Anthropic Mythos Widens, Microsoft MAI Lands in Copilot, Gemma 4 12B on Mac

OpenClaw v2026.6.5-beta.2 and Claude Code 2.1.168 lead the agent-harness cycle, and the cycle opens with a Friday June 5 outage that hit Claude API, Claude Code, claude.ai, and Claude Cowork for roughly two hours — primarily Opus 4.7 and 4.8 — peaking near a thousand Downdetector reports. OpenClaw switched release trains to a monthly patch cadence with the June 2026 floor at 5.28. Claude Code shipped a focused day-late bug-fix release on the .167 baseline, closing session attachment, stream-json event ordering, and interrupt handling regressions that some users reported during the outage window. OpenAI is reportedly planning its biggest ChatGPT overhaul yet — a unified superapp that folds in Codex, agents, and third-party services ahead of a fall IPO. Apple WWDC 2026 opens June 8 with a Gemini-powered Siri as the headline. Anthropic expands Project Glasswing to 150+ organizations and signals Mythos-class capabilities are coming in weeks. Microsoft launches MAI-Thinking-1 and MAI-Code-1-Flash into GitHub Copilot. Gemma 4 12B ships an encoder-free multimodal design for 16GB local Macs. The MCP lane is brief this week — a one-paragraph blip, not a deep-dive. Project radar covers A2A v1.0 and the CheetahClaws Python harness. Show notes: https://tobyonfitnesstech.com/podcasts/episode-66/

Yesterday38 min

Episode 65: Hermes Agent v0.16.0 Desktop App, Codex 0.137 Multi-Agent v2, Claude Code Fallback Models, and Gemma 4 12B on the Local Stack

6. juni 202635 min

Episode 64: Claude Code 2.1.165, Microsoft's MAI Coding Model Family, and the Agent Infrastructure Project Radar

Claude Code 2.1.165 is the latest npm `latest` as of June 5, following 2.1.163 and 2.1.164 — all bug-fix and reliability releases that clean up background sessions, plugin hooks, skill syntax, and Windows path handling. Microsoft dropped a seven-model MAI family at Build 2026 on June 2, with MAI-Code-1-Flash as the headline: a 5B-parameter coding model trained on GitHub Copilot production harnesses, scoring 51% on SWE-Bench Pro and 60% leaner on tokens than comparable models. The episode also covers the GitHub Project Radar around agent memory, code graphs, and MCP tooling that serve the local coding-agent stack. Show notes: https://tobyonfitnesstech.com/podcasts/episode-64/

5. juni 202638 min

Episode 63: OpenClaw 2026.6.1, Claude Code 2.1.162, Qwen 3.7 Max/Plus, and Agent Memory Infrastructure

[00:00] Episode hook OpenClaw v2026.6.1, Hermes Agent v2026.5.29.2, and Claude Code 2.1.162 drop in the same episode window. The stable OpenClaw tag is v2026.6.1, the Hermes stable tag stays at v2026.5.29.2, and the latest Claude Code npm `latest` is 2.1.162. OpenClaw v2026.6.1 ships Workboard orchestration, a governed Skill Workshop, SQLite-backed state recovery, and MiniMax M3 provider support. Claude Code 2.1.162 adds waiting-for visibility in `claude agents --json` and a batch of permission and interrupt fixes across five releases from 2.1.158 to 2.1.162. Qwen 3.7 Max and Plus split the coding-reasoning and multimodal-vision lanes. agentmemory makes every agent on your machine share a persistent context layer. This is a 60-minute episode — keep the existing builder stories and extend runtime. Show notes: https://tobyonfitnesstech.com/podcasts/episode-63/

4. juni 202631 min

Episode 62: Codex 0.136, Stanford's Agent Guidelines, AWS OpenAI, and GPU Efficiency

AgentStack Daily EP062 leads with Codex `rust-v0.136.0`: better TUI diagnostics and error context, improved app-server lifecycle handling, named hooks and permission scopes, Python SDK and Node SDK improvements, and non-interactive installation support. Stanford's CS336 course publishes a formal AI agent guidelines document that reaches 1,863 stars in under 24 hours — institutional validation that agent workflow guidelines are becoming a first-class engineering concern. OpenAI puts GPT-4.5, o3, and Codex on AWS Bedrock, completing the pattern where both major labs distribute through the same cloud. Expanse from YC P26 uses cluster-specific fine-tuned models to predict GPU job resource needs and outperforms frontier LLMs by 8x on that task, backed by real HPC telemetry and SLURM/Kubernetes integration. The project radar covers agent OS for hardware, terminal context managers, MCP workflow templates, and physical agent scheduling. Show notes: https://tobyonfitnesstech.com/podcasts/episode-62/

3. juni 202646 min

Episode 65: Hermes Agent v0.16.0 Desktop App, Codex 0.137 Multi-Agent v2, Claude Code Fallback Models, and Gemma 4 12B on the Local Stack

Description

Comments

1 month for 9 kr.

All episodes