Le Chaton Fat Just Broke the Internet...

Descripción

Why AI Benchmarks are Fake (And How to Actually Test Models) A fake French AI model recently went viral for beating the industry's top benchmarks, proving how easy it is to manipulate performance data. This video explains why you should stop chasing hype-filled charts and start evaluating AI based on your own real-world business workflows. 00:00 - Intro: The Le Chatton Fat Joke 01:08 - Why AI Benchmarks Can Lie 02:42 - The Problem with Self-Reported Tests 04:18 - Real Work is the Only Benchmark 05:20 - How to Avoid AI Overwhelm 06:34 - The New Way to Evaluate AI 07:31 - 3 Key Takeaways for AI Testing 08:45 - Testing AI Systems Yourself

Agent OS Just Changed AI Forever…

Agent Operating System Q&A: Model Restrictions, Local AI, Hermes Jarvis, SEO & Video AutomationThe episode answers community questions about the Agent Operating System and how to get the most from it, highlighting Hermes Oracle for news and SEO automation, Hermes Jarvis as a voice-activated agent, and managing all agents and client instances from one interface using separate profiles. It argues that gated frontier models like GPT 5.6 previews and the removal of Fable 5 matter less than building robust systems that can swap models in and out, including using alternatives like Fusion or Sakana Fugu and open-weight local options such as GLM 5.2. The host discusses a high-volume posting strategy with strict quality control, rising demand for local model setups for privacy and cost, automating video production via a video agent and avatar workflow training, using Paperclip for fact-checking and multi-agent teams, Windows support, SEO article generation and deployment, UI improvements to the Kanban board, and examples of AI-assisted game creation, while promoting the AI Profit Boardroom for daily Q&A, tutorials, coaching calls, and the Agent OS zip file.00:00 [https://www.youtube.com/watch?v=MnIH1VWLfpw] Agent OS Overview01:13 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=73s] Model Access Concerns03:16 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=196s] Posting Strategy Growth04:41 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=281s] Local Models for Clients05:36 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=336s] Managing Client Agents07:10 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=430s] Automating Video Creation07:57 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=477s] Fact Checking Workflow08:36 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=516s] Windows Install Support08:54 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=534s] Hermes Jarvis Voice Agent10:09 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=609s] Paperclip Social Team11:57 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=717s] Systems Over Models12:37 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=757s] Hermes Desktop vs OS13:28 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=808s] Using GLM in Claude Code15:21 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=921s] SEO Blog Automation16:15 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=975s] AI Game Development Demos18:09 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=1089s] Kanban Board Upgrade19:58 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=1198s] Join the Boardroom21:02 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=1262s] Easy Setup Testimonials22:14 [https://www.youtube.com/watch?v=MnIH1VWLfpw&t=1334s] Final Wrap Up

Ayer22 min

Le Chaton Fat Just Broke the Internet...

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios