Agents of Tomorrow

Ep4: AI Agents in Creativity and Design: Minecraft as a 3D Playground, Creating Dream Spaces and 3D Modeling

37 min · 18 nov 2024
aflevering Ep4: AI Agents in Creativity and Design: Minecraft as a 3D Playground, Creating Dream Spaces and 3D Modeling artwork

Beschrijving

In this episode, we dive into the groundbreaking world of AI agents transforming creativity and design. We start with navigating 3D environments in Minecraft [https://venturebeat.com/games/airis-is-a-learning-ai-teaching-itself-how-to-play-minecraft/], setting the stage for more complex real-world tasks. Then, we explore how AI agents are revolutionizing 3D modeling in Blender, bringing intricate designs to life. Finally, we delve into the fascinating applications in interior design, where spatial reasoning is used to create dream spaces. Subscribe and tune in to discover new agentic applications every week. Papers covered: SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code [https://arxiv.org/abs/2403.01248] by Ziniu Hu et. al. I-Design: Personalized LLM Interior Designer [https://arxiv.org/abs/2404.02838] by Ata Çelen et. al

Reacties

0

Wees de eerste die een reactie plaatst

Meld je nu aan en word lid van de Agents of Tomorrow community!

Probeer gratis

Probeer 14 dagen gratis

€ 9,99 / maand na proefperiode. · Elk moment opzegbaar.

  • Podcasts die je alleen op Podimo hoort
  • 20 uur luisterboeken / maand
  • Gratis podcasts

Alle afleveringen

5 afleveringen

aflevering Ep5: Multimodal AI Agents: Benchmarking, Adapting, and Adversarial attacks artwork

Ep5: Multimodal AI Agents: Benchmarking, Adapting, and Adversarial attacks

In this episode, we dive into the multimodal AI agents, starting with the recent release of runner H [https://x.com/hcompany_ai/status/1858907025436205278] and diving into groundbreaking research, including: 04:15 VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks by [https://arxiv.org/abs/2401.13649]Jing Yu Koh et. al [https://arxiv.org/search/cs?searchtype=author&query=Koh,+J+Y] 19:18 AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations by [https://arxiv.org/abs/2411.13451]Gaurav Verma et. al. [https://arxiv.org/search/cs?searchtype=author&query=Verma,+G] 32:32 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast by Xiangming Gu et. al. [https://arxiv.org/abs/2402.08567]

25 nov 202448 min
aflevering Ep4: AI Agents in Creativity and Design: Minecraft as a 3D Playground, Creating Dream Spaces and 3D Modeling artwork

Ep4: AI Agents in Creativity and Design: Minecraft as a 3D Playground, Creating Dream Spaces and 3D Modeling

In this episode, we dive into the groundbreaking world of AI agents transforming creativity and design. We start with navigating 3D environments in Minecraft [https://venturebeat.com/games/airis-is-a-learning-ai-teaching-itself-how-to-play-minecraft/], setting the stage for more complex real-world tasks. Then, we explore how AI agents are revolutionizing 3D modeling in Blender, bringing intricate designs to life. Finally, we delve into the fascinating applications in interior design, where spatial reasoning is used to create dream spaces. Subscribe and tune in to discover new agentic applications every week. Papers covered: SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code [https://arxiv.org/abs/2403.01248] by Ziniu Hu et. al. I-Design: Personalized LLM Interior Designer [https://arxiv.org/abs/2404.02838] by Ata Çelen et. al

18 nov 202437 min
aflevering Ep2: How AI Agents Are Shaping Hiring, Healthcare and Knowledge Work artwork

Ep2: How AI Agents Are Shaping Hiring, Healthcare and Knowledge Work

In this episode, we dive into LinkedIn’s use of AI agents for hiring, Oracle’s clinical AI agent, and review three papers on AI agents in knowledge work, including applications in machine learning and software engineering. Papers covered: * (03:56 - 07:07) WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks [https://arxiv.org/abs/2407.05291] by Léo Boisvert et al. [https://arxiv.org/search/cs?searchtype=author&query=Boisvert,+L] * (07:08 - 10:12) SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning [https://arxiv.org/abs/2410.17238] by Yizhou Chi et al. [https://arxiv.org/search/cs?searchtype=author&query=Chi,+Y] * (10:13 - 19:30) MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework [https://arxiv.org/abs/2308.00352] by Sirui Hong et al. [https://arxiv.org/search/cs?searchtype=author&query=Hong,+S]

4 nov 202419 min
aflevering Ep1 - Claude3.5, AgentForce, Copilot Studio, AgentC, Mixture-of-Agents, Agent-as-a-Judge, On the limits of agency artwork

Ep1 - Claude3.5, AgentForce, Copilot Studio, AgentC, Mixture-of-Agents, Agent-as-a-Judge, On the limits of agency

Applications covered: (0:00 - 5:15) Claude 3.5 - Anthropic AgentForce - Salesforce Copilot Studio - Microsoft AgentC - Celonis Papers covered: (5:16 - 19:28) Mixture-of-Agents Enhances Large Language Model Capabilities [https://arxiv.org/abs/2406.04692] by Junlin Wang [https://arxiv.org/search/cs?searchtype=author&query=Wang,+J] et al. (19:29 - 32:10) Agent-as-a-Judge: Evaluate Agents with Agents [https://arxiv.org/abs/2410.10934] by Mingchen Zhuge [https://arxiv.org/search/cs?searchtype=author&query=Zhuge,+M] et al. (32:11 - 42:44) On the limits of agency in agent-based models [https://arxiv.org/abs/2409.10568] by Ayush Chopra [https://arxiv.org/search/cs?searchtype=author&query=Chopra,+A] et al.

26 okt 202442 min