AI News Today | Julian Goldie Podcast
LongCat 2.0 (Open Source) Tested: Benchmarks, Games, and GLM 5.2 ComparisonThe episode covers the official release of LongCat 2.0, an open-source Chinese agentic model revealed as the model behind the AoAlpha free API, with features like Sparse Attention, Zero Compute Experts, and MIPD. The host reviews benchmark claims (including Terminal Bench 2.1 and SWE-Bench Pro comparisons versus GPT-5.5 and Opus 4.8) and shares hands-on tests building game demos such as Dragon Realm, a Skyrim-style open world, and VoxelCraft, noting mixed results and frequent bugs. Access issues are mentioned, including difficulty using the API without a Chinese setup, so the model is tested via the website chat. A key point is that LongCat was trained on China’s Meituan chips without NVIDIA. Overall, GLM 5.2 is judged stronger in side-by-side game benchmarks, and the host promotes the AI Profit Boardroom and Agent OS setup.00:00 [https://www.youtube.com/watch?v=60es_aKUcBU] LongCat 2.0 Launch00:36 [https://www.youtube.com/watch?v=60es_aKUcBU&t=36s] Benchmarks and API Hurdles01:38 [https://www.youtube.com/watch?v=60es_aKUcBU&t=98s] Game Demos Dragon Realm02:23 [https://www.youtube.com/watch?v=60es_aKUcBU&t=143s] Goldy Bench Verdict02:43 [https://www.youtube.com/watch?v=60es_aKUcBU&t=163s] Trained Without NVIDIA03:32 [https://www.youtube.com/watch?v=60es_aKUcBU&t=212s] How to Use It03:51 [https://www.youtube.com/watch?v=60es_aKUcBU&t=231s] Eval Results vs GPT04:17 [https://www.youtube.com/watch?v=60es_aKUcBU&t=257s] GLM 5.2 Showdown06:13 [https://www.youtube.com/watch?v=60es_aKUcBU&t=373s] Final Take and Recommendation06:35 [https://www.youtube.com/watch?v=60es_aKUcBU&t=395s] Agent OS and Boardroom Plug07:37 [https://www.youtube.com/watch?v=60es_aKUcBU&t=457s] Wrap Up
535 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de AI News Today | Julian Goldie Podcast!