The Immersive Lens
Welcome to The Immersive Lens - the show where emerging tech meets creativity, education, and real-world impact. Hosted by Paul Engin (Professor of New Media) and Dave Ghidiu (Professor of Computer Science) at Finger Lakes Community College, this podcast dives into the technologies and creative innovations reshaping how we live, work, and learn - from AI and augmented reality to the future of media, design, and learning. In this episode of The Immersive Lens, Paul, Dave, and Jeff tackle LLM benchmarking. In this episode of The Immersive Lens, the crew dives deep into the rapidly evolving landscape of generative AI by conducting a series of real-time benchmark tests on leading large language models, including Google Gemini, OpenAI's ChatGPT, Anthropic's Claude, and Grok. The conversation kicks off with Dave sharing his deep-dive research methodology, which reveals how radically these tools differ when executing massive, multi-source literature syntheses. The team uses these insights to explore how foundational shifts in AI architecture are changing the way we interact with data, moving from simple text generation to proactive research agents. As the episode progresses, Paul, Dave, and Jeff put these bots through their paces with a gauntlet of complex logic puzzles, strict negative constraints, and historically inaccurate prompts. The resulting performance gaps highlight the stark contrast between models that prioritize accuracy versus those that default to confident hallucinations. Ultimately, the team concludes that while AI capabilities are accelerating at breakneck speeds, users must remain highly critical of the output, as even the most advanced models still struggle with basic boundaries and truth-telling. 🔗 Links & Resources: Claude Cowork by Anthropic - https://www.anthropic.com/product/claude-cowork [https://www.anthropic.com/product/claude-cowork] Codex is becoming a productivity tool for everyone - https://openai.com/index/codex-for-knowledge-work/ [https://openai.com/index/codex-for-knowledge-work/] Introducing TRIBE v2 - https://ai.meta.com/blog/tribe-v2-brain-predictive-foundation-model/ [https://ai.meta.com/blog/tribe-v2-brain-predictive-foundation-model/] NVIDIA DGX Spark - https://www.nvidia.com/en-us/products/workstations/dgx-spark/ [https://www.nvidia.com/en-us/products/workstations/dgx-spark/] Perplexity Computer - https://www.perplexity.ai/personal-computer [https://www.perplexity.ai/personal-computer] Wikipedia: Gadsby (novel) - https://en.wikipedia.org/wiki/Gadsby_(novel) [https://en.wikipedia.org/wiki/Gadsby_(novel)] Anthropic will pay xAI $1.25B per month for compute - https://techcrunch.com/2026/05/20/anthropic-will-pay-xai-1-25-billion-per-month-for-compute/ [https://techcrunch.com/2026/05/20/anthropic-will-pay-xai-1-25-billion-per-month-for-compute/] Mentions: @OpenAI @Google @AnthropicAI @X @Meta @Perplexity FLX AI Hub: http://www.flcc.eduhttps://www.flcc.edu/ai/ [https://www.flcc.edu/ai/] FLCC New Media: https://newmedia.csc.flcc.cloud/ [https://newmedia.csc.flcc.cloud/] #LargeLanguageModels #ChatGPT #GoogleGemini #ClaudeAI #Grok #AIBenchmarks #TechHumor #SpatialReasoning #AIHallucinations #ChatbotShowdown #AI #EdTech #GenerativeAI #HigherEd #Podcast #FLCC #NewMedia #TechTrends #TechHistory #TheImmersiveLens ✅ Subscribe so you never miss an episode—and share with a friend who loves tech, creativity, or the future of education. Recorded at Finger Lakes Community College in the heart of New York’s Finger Lakes region. https://www.flcc.edu/ [https://www.flcc.edu/] https://www.theimmersivelens.com/ [https://www.theimmersivelens.com/] Stay curious. Stay connected. Thanks for looking through The Immersive Lens.
26 Episoder
Kommentarer
0Vær den første til å kommentere
Registrer deg nå og bli medlem av The Immersive Lens sitt community!