Braid
A frontier model gets called a step toward God in one window and a judgmental token-burner in the next. We spend the morning on the gap between the marketing altitude and the desk, and find the same thread running through everything: every layer now has a control surface someone's reaching for. * Dylan Field on Opus 4.8 [https://x.com/zoink/status/2060769829133721974] calls it "a very strange model" — honesty up, curiosity down, personality judgmental — a reminder that a tuning dial has costs you can feel. * scaling01 on DeepSWE [https://x.com/scaling01/status/2060768119941947699] says GPT-5.5 "score-, time- and token-mogged" Opus 4.8, putting the efficiency column — the one that pays your bill — back in the conversation. * Ben Kunkle on Zed's Zeta 2 [https://www.youtube.com/watch?v=phchDt63qAA] shows how a ten-second editing pause becomes a training label, and how a million frontier-model calls got replaced by a self-grading student model. * Philipp Schmid (DeepMind) [https://www.youtube.com/watch?v=3_gYbhABcAE] on the five assumptions that trip up senior engineers building agents — errors as inputs, evals not unit tests, and "build to delete." * Komi-learn [https://github.com/kurikomi-labs/komi-learn] and a year on knowledge-graph memory [https://www.reddit.com/r/AI_Agents/comments/1ts3nq2/i_spent_a_year_building_agent_memory_on_knowledge/] share one missing thing: a controlled before-and-after proving the memory layer, not the model, made the agent better. * A Lancet correspondence [https://www.forbes.com/sites/brucelee/2026/05/30/ai-fabricated-citations-in-over-2800-biomedical-journal-articles/] finds 4,046 fabricated references across 2,810 published articles — model honesty rising while the literature's integrity falls. * Quick hits: AMD's Lisa Su vs Nvidia's Jensen Huang on China [https://www.techmeme.com/260531/p7], IBM's Sovereign Core [https://www.forbes.com/sites/stevemcdowell/2026/05/30/ibms-agentic-operating-model-puts-sovereignty-at-the-center/], and a court ordering Circle to freeze a $12.6M contract [https://www.techmeme.com/260531/p3].
45 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de Braid!