GenAI Learner

The Surprising Limits of RL in LLMs: Why Optimization Kills Deep Reasoning Capacity

14 min · 12. nov. 2025
episode The Surprising Limits of RL in LLMs: Why Optimization Kills Deep Reasoning Capacity cover

Description

The Surprising Limits of RL in LLM Reasoning Arxiv: https://arxiv.org/pdf/2504.13837The promise of RL for LLM growth hits a wall: Tsinghua University's study shows RLVR only improves efficiency but is bounded by and does not elicit novel reasoning in base models—get the non-technical scoop on the "GenAI learner" podcast.

Comments

0

Be the first to comment

Sign up now and become a member of the GenAI Learner community!

Get Started

1 month for 9 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

29 episodes