AI Papers: A Deep Dive
TRAINING A TINY MODEL TO RUN THE PLUMBING BETWEEN AN AGENT AND THE WORLD Source: HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness [https://arxiv.org/abs/2606.12882] Paper was published on June 11, 2026 This episode was AI-generated on June 13, 2026. The script was written by an AI language model and the host voices were synthesized by Eleven Labs. The producer is not affiliated with Anthropic or Eleven Labs. What if the reason your AI agent fails isn't that the model is too dumb, but that it's drowning in its own context? This paper takes a frozen model — never retrained — and just by changing what flows in and out, raises success rates while cutting token costs by up to ninety percent. We dig into the elegant design, the surprising results, and where the headline numbers quietly oversell themselves. KEY TAKEAWAYS * Why the 'harness' — the plumbing between an LLM and the world — is a third axis of optimization, distinct from the model's intelligence and the task's difficulty * How a tiny 0.8-billion-parameter model learns to make two narrow judgment calls: what context the agent sees each turn, and which proposed actions to bounce back * The single best design idea in the paper: a gatekeeper that can only reject an action if it can quote specific evidence from the trajectory — 'no quote, no veto' — and defaults to letting questionable actions through * The reframe that the same frozen model fails in 52 wandering turns under one interface and succeeds in 18 under another, recasting 'capability failures' as interface failures * How a sloppy training diet produced a trigger-happy filter that rejected 37% of actions and performed worse than no harness at all — the behavior comes from the data, not the architecture * Where the 'matches or surpasses' framing overreaches: in-domain it's actually matches-to-slightly-down, results are single-run, and the token savings shrink when the baseline model is already efficient * 00:00 — The consultant at the door An analogy introduces the harness — the software that decides what reaches the model and what it sends back — and the paper's core question: why is it still hand-engineered? * 02:58 — What the harness actually is Precisely distinguishing the harness from prompt engineering and fine-tuning, and framing it as the 'transmission' between the engine and the road. * 05:56 — The incoming side: chief of staff How the observation projection produces a curated view over an intact transcript, making three-way keep/compress/drop calls and pinning a standing memo of the agent's live state. * 08:54 — The outgoing side: the evidence-bound bouncer How the action projection can only reject a proposed command by quoting trajectory evidence, and why defaulting to 'pass' is the hard part of building a gatekeeper. * 11:52 — One tiny model, two jobs Why a 0.8-billion-parameter model can handle these narrow judgment calls, and why curating roughly 5,400 clean examples is the real engineering. * 14:50 — The trigger-happy filter that backfired A cautionary experiment in which a sloppy training recipe produced a controller that rejected 37% of actions and scored below using no harness at all. * 17:48 — The results: same engine, better transmission The gained-tasks contrast (18 turns versus 52), the out-of-domain and cross-model transfer numbers, and what the controller learned to leave uncompressed without being told. * 20:46 — Where the framing reaches A critical look at the in-domain results, single-run variance, full-system token accounting, and the open question of whether the gains shrink as models get better at managing their own context.
136 Episoder
Kommentarer
0Vær den første til å kommentere
Registrer deg nå og bli medlem av AI Papers: A Deep Dive sitt community!