GenAI Learner

The Surprising Limits of RL in LLMs: Why Optimization Kills Deep Reasoning Capacity

14 min · 12. nov. 2025
episode The Surprising Limits of RL in LLMs: Why Optimization Kills Deep Reasoning Capacity cover

Beskrivelse

The Surprising Limits of RL in LLM Reasoning Arxiv: https://arxiv.org/pdf/2504.13837The promise of RL for LLM growth hits a wall: Tsinghua University's study shows RLVR only improves efficiency but is bounded by and does not elicit novel reasoning in base models—get the non-technical scoop on the "GenAI learner" podcast.

Kommentarer

0

Vær den første til å kommentere

Registrer deg nå og bli medlem av GenAI Learner sitt community!

Prøv gratis

Prøv gratis i 14 dager

99 kr / Måned etter prøveperioden. · Avslutt når som helst.

  • Eksklusive podkaster
  • 20 timer lydbøker i måneden
  • Gratis podkaster

Alle episoder

29 Episoder