AI Papers: A Deep Dive
TWO LEVERS FOR SELF-IMPROVING AI: WHEN REWRITING CODE ISN'T ENOUGH Source: SIA: Self Improving AI with Harness & Weight Updates [https://arxiv.org/abs/2605.27276] Paper was published on May 26, 2026 This episode was AI-generated on May 27, 2026. The script was written by an AI language model and the host voices were synthesized by Eleven Labs. The producer is not affiliated with Anthropic or Eleven Labs. An AI agent spent many iterations rewriting its own scaffolding to denoise genomic data and hit a wall. Then it was allowed to retrain its own weights — and on the first try, it added two trivial lines of code that any biologist would have spotted, cutting error by twenty percent. A new paper argues that scaffold edits and weight updates reach fundamentally different places, and that no self-improvement loop touching only one is going to be enough. KEY TAKEAWAYS * Why scaffold rewrites and weight updates are not interchangeable — they change different things (how the agent searches vs. what the model knows) * How SIA's Feedback-Agent reads full agent trajectories to decide which lever to pull, and even picks which RL algorithm to use * Concrete results across three deliberately different domains: Chinese legal classification, CUDA kernel optimization on H100s, and single-cell RNA-seq denoising * Why the headline 502% improvement is real but misleading — the mechanism claim is closer to a 20% gain over the harness-only ceiling * The 'coupled co-evolutionary Goodhart' failure mode the authors themselves flag: two optimizers converging on a verifier rather than the underlying problem * What the paper does and doesn't prove — a credible proof of concept, not a settled result, with clean verifiers doing more work than the framing admits * 00:00 — The two-line fix that broke a plateau An opening case study where a weight update found a trivial biological invariant that endless scaffold iteration had missed. * 03:08 — Two camps that haven't been talking Framing the field's split between scaffold-evolution work (Darwin Gödel Machine, AI Scientist) and test-time-training work, and the obvious question each camp's silence implies. * 06:17 — Inside the SIA architecture How the Meta-Agent, task agent, and Feedback-Agent fit together, and why giving the Feedback-Agent the full trajectory matters. * 09:26 — Three benchmarks, three shapes of expertise Walking through LawBench, CUDA kernel optimization, and RNA-seq denoising — and what each result implies about the harness ceiling. * 12:34 — Picking the RL algorithm on the fly Why the Feedback-Agent chooses between methods like GRPO and entropic advantage weighting based on the reward landscape, and what that automation does and doesn't prove. * 16:23 — The skeptic pass Where the ablations fall short, why the benchmark selection flatters the method, and how the abstract's biggest number answers a different question than the mechanism claim. * 18:53 — Coupled co-evolutionary Goodhart The deeper failure mode the authors themselves raise: two optimizers fitting each other rather than the underlying problem. * 22:00 — What this would mean if it generalizes Where the human role moves if specifying a task and a verifier is enough, and why that 'if' is still load-bearing. RECOMMENDED READING * Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents [https://arxiv.org/abs/2505.22954] — A leading example of the scaffold-evolution camp the episode contrasts with weight updates — the AI rewrites the code around a frozen model. * The Surprising Effectiveness of Test-Time Training for Abstract Reasoning [https://arxiv.org/abs/2411.07279] — Akyürek et al.'s test-time-training work, representing the opposite camp SIA tries to unify: leave the scaffolding alone and adapt the weights at inference. * DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models [https://arxiv.org/abs/2402.03300] — Introduces GRPO, the RL algorithm the Feedback-Agent picks for the LawBench task — useful background for the algorithm-selection discussion. * The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery [https://arxiv.org/abs/2408.06292] — Another reference point in the scaffold-iteration lineage SIA positions itself against, where an LLM orchestrates research without touching its own weights.
88 episoder
Kommentarer
0Vær den første til at kommentere
Tilmeld dig nu og bliv en del af AI Papers: A Deep Dive-fællesskabet!