Unzip
## Episode Summary In this episode, we cover: - **Auditing Multimodal LLM Raters: Central Tendency Bias in Clinical Ordinal Scoring** (Hugging Face Daily) - [Read more](https://huggingface.co/papers/2605.16386) - **Evaluating Cognitive Age Alignment in Interactive AI Agents** (Hugging Face Daily) - [Read more](https://huggingface.co/papers/2605.17894) - **DexHoldem: Playing Texas Hold'em with Dexterous Embodied System** (Hugging Face Daily) - [Read more](https://huggingface.co/papers/2605.18727) - **SCICONVBENCH: Benchmarking LLMs on Multi-Turn Clarification for Task Formulation in Computational Science** (Hugging Face Daily) - [Read more](https://huggingface.co/papers/2605.18630) - **AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs** (Hugging Face Daily) - [Read more](https://huggingface.co/papers/2605.15565) --- *Sponsored by LimitLess AI*
82 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de Unzip!