M365.FM - Modern work, security, and productivity with Microsoft 365

The Latency Wall: Why Your Cloud Strategy Fails at the Edge

1 h 20 min · 12. Juni 2026
Episode The Latency Wall: Why Your Cloud Strategy Fails at the Edge Cover

Beschreibung

For years, organizations have followed a simple rule: move everything to the cloud.The strategy worked brilliantly for collaboration, analytics, business intelligence, and productivity workloads. Microsoft 365, Azure, Power BI, Teams, and modern cloud platforms transformed how organizations operate.But a growing number of industries are discovering a hard reality.Physics doesn't care about your cloud strategy.When robots, autonomous vehicles, computer vision systems, industrial sensors, healthcare devices, and critical infrastructure require responses measured in milliseconds, traditional cloud architectures hit an unavoidable barrier: the Latency Wall.In this episode, we explore why centralized cloud architectures struggle at the edge, why bandwidth isn't the answer, and how organizations are redesigning their technology platforms around private 5G, Multi-Access Edge Computing (MEC), Azure Stack Edge, Azure Arc, and sovereign edge architectures.If your future includes AI, automation, robotics, manufacturing, logistics, healthcare, energy, or industrial IoT, this episode explains why the next phase of digital transformation is happening closer to the data than ever before. WHY THE CLOUD BREAKS WHEN MILLISECONDS MATTER Most enterprise systems were designed around humans.Humans tolerate delay.A dashboard that loads in a few seconds feels fast.A chatbot that responds in under a second feels instant.An analytics report that refreshes in a minute is perfectly acceptable.Machines don't think that way.A robotic arm operating on a production line may require updates every few milliseconds.A computer vision system inspecting defects has fractions of a second to react.An autonomous guided vehicle navigating a warehouse cannot wait hundreds of milliseconds for instructions from a distant cloud region.The challenge isn't cloud performance.The challenge is physics.This episode explores the science of latency, jitter, determinism, and why distance creates a hard limit that no cloud provider can eliminate. THE PHYSICS OF LATENCY Every cloud strategy ultimately runs into the same constraint.Data must travel.Even at the speed of light, distance creates delay.As organizations connect factories, warehouses, hospitals, ports, mines, energy grids, and autonomous systems to cloud platforms, latency becomes an architectural problem rather than a networking problem.We discuss: * Why latency and jitter matter more than bandwidth * Deterministic versus best-effort networking * Real-world control loop requirements * The impact of packet loss and network variability * Why cloud optimization cannot overcome physical distance Understanding these concepts is critical for modern architects designing real-time systems. INDUSTRIES HITTING THE LATENCY WALL The edge is no longer a niche concept.Across every sector, organizations are discovering workloads that cannot depend on centralized cloud architectures.This episode examines real-world examples from: * Manufacturing and industrial automation * Logistics and warehouse robotics * Healthcare and patient telemetry * Energy and utilities * Mining operations * Smart ports and maritime logistics * Retail automation * Autonomous transportation Each industry faces different challenges, but the underlying problem remains the same: critical decisions must happen locally. THE OLD CLOUD MODEL VS THE NEW EDGE MODEL For decades, enterprise architecture followed a hub-and-spoke model.Data flowed to the cloud.The cloud made decisions.The edge executed instructions.That model is changing.The modern edge architecture places intelligence closer to the source of the data.Instead of sending every sensor reading, image, and event to a distant cloud region, organizations process information locally and send only insights, exceptions, and analytics upstream.We explore: * Edge-first architectures * Distributed intelligence * Local decision-making * Autonomous operations * Resilient offline systems * Real-time control loops The result is a fundamental inversion of traditional cloud thinking. PRIVATE 5G EXPLAINED Many organizations think 5G is simply faster wireless networking.Enterprise private 5G is something very different.It provides deterministic connectivity designed specifically for industrial and mission-critical environments.In this episode, we explain: * Private 5G architecture * Network slicing * Ultra-Reliable Low-Latency Communications (URLLC) * SIM-based security * Mobility management * Quality of Service (QoS) * Deterministic networking You'll learn why private 5G is becoming a foundational technology for modern industrial environments. AZURE PRIVATE 5G CORE AND AZURE STACK EDGE Microsoft's answer to the edge challenge combines networking, compute, AI, and cloud management into a unified platform.We take a deep dive into: * Azure Private 5G Core * Azure Stack Edge * Azure Arc * Azure Network Function Manager * Edge AI * Local inference * Sovereign deployments * Hybrid cloud architectures Discover how Microsoft enables organizations to run cloud services locally while maintaining centralized governance and management. MULTI-ACCESS EDGE COMPUTING (MEC) Private 5G alone doesn't solve the problem.Applications still need compute resources close to the workload.This is where Multi-Access Edge Computing comes in.We explore how MEC enables: * Real-time AI inference * Computer vision workloads * Predictive maintenance * Digital twins * Autonomous systems * Edge analytics * Low-latency application hosting The combination of MEC and private 5G creates a platform capable of supporting next-generation industrial applications. THE EVENT-REASONING-ORCHESTRATION MODEL One of the most important concepts in this episode is a new way of thinking about intelligence at the edge.Instead of sending every event to the cloud, the edge becomes responsible for:Event DetectionCapturing data directly from sensors, cameras, machines, and devices.Local ReasoningRunning AI models and analytics locally.Immediate OrchestrationTaking action in real time without waiting for cloud responses.The cloud remains essential for governance, reporting, model training, and enterprise-wide intelligence, but the milliseconds that matter stay local. THE BUSINESS CASE FOR THE EDGE Edge computing isn't just about performance.It's also about economics.We explore real-world research showing how organizations achieve measurable returns through: * Reduced downtime * Predictive maintenance * Automated quality inspection * Energy optimization * Autonomous logistics * Flexible manufacturing * Reduced networking costs You'll learn why some organizations are seeing extraordinary returns from private 5G and edge computing investments. DATA SOVEREIGNTY AND REGULATORY COMPLIANCE Latency isn't the only reason organizations are moving workloads closer to the edge.Data sovereignty is becoming equally important.This episode explores: * GDPR * NIS2 * The EU AI Act * The Data Act * DORA * National data residency requirements * Sovereign cloud architectures Learn why compliance requirements are reshaping enterprise architecture and accelerating investment in local processing capabilities. SECURITY AT THE EDGE Edge environments introduce new security challenges and opportunities.We discuss: * Zero Trust architectures * SIM-based authentication * Identity-driven networking * IEC 62443 * Operational Technology (OT) security * Microsoft Defender integration * Edge security monitoring * Secure AI deployments Security must evolve alongside edge infrastructure. THE CONVERGED FUTURE OF WI-FI 7 AND PRIVATE 5G The future isn't Wi-Fi versus 5G.The future is both.Organizations are increasingly adopting converged networking strategies where: * Wi-Fi 7 supports knowledge workers * Private 5G supports operational technology * Azure Arc provides unified management * Applications automatically use the best network available This converged model is rapidly becoming the standard architecture for enterprise environments. BUILDING YOUR EDGE STRATEGY For architects, technology leaders, and decision-makers, the question is no longer whether edge computing matters.The question is where the latency wall exists within your organization.We provide a practical roadmap covering: * Pilot projects * Platform selection * Governance models * Data foundations * Organizational change * Edge Centers of Excellence * Scaling strategies * Operational readiness Understanding these principles is essential for the next generation of cloud and AI architectures. Become a supporter of this podcast: https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support [https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support?utm_source=rss&utm_medium=rss&utm_campaign=rss].

Kommentare

0

Sei die erste Person, die kommentiert

Melde dich jetzt an und werde Teil der M365.FM - Modern work, security, and productivity with Microsoft 365-Community!

Loslegen

2 Monate für 1 €

Dann 4,99 € / Monat · Jederzeit kündbar.

  • Podcasts nur bei Podimo
  • 20 Stunden Hörbücher / Monat
  • Alle kostenlosen Podcasts

Alle Folgen

648 Folgen

Episode The Latency Wall: Why Your Cloud Strategy Fails at the Edge Cover

The Latency Wall: Why Your Cloud Strategy Fails at the Edge

For years, organizations have followed a simple rule: move everything to the cloud.The strategy worked brilliantly for collaboration, analytics, business intelligence, and productivity workloads. Microsoft 365, Azure, Power BI, Teams, and modern cloud platforms transformed how organizations operate.But a growing number of industries are discovering a hard reality.Physics doesn't care about your cloud strategy.When robots, autonomous vehicles, computer vision systems, industrial sensors, healthcare devices, and critical infrastructure require responses measured in milliseconds, traditional cloud architectures hit an unavoidable barrier: the Latency Wall.In this episode, we explore why centralized cloud architectures struggle at the edge, why bandwidth isn't the answer, and how organizations are redesigning their technology platforms around private 5G, Multi-Access Edge Computing (MEC), Azure Stack Edge, Azure Arc, and sovereign edge architectures.If your future includes AI, automation, robotics, manufacturing, logistics, healthcare, energy, or industrial IoT, this episode explains why the next phase of digital transformation is happening closer to the data than ever before. WHY THE CLOUD BREAKS WHEN MILLISECONDS MATTER Most enterprise systems were designed around humans.Humans tolerate delay.A dashboard that loads in a few seconds feels fast.A chatbot that responds in under a second feels instant.An analytics report that refreshes in a minute is perfectly acceptable.Machines don't think that way.A robotic arm operating on a production line may require updates every few milliseconds.A computer vision system inspecting defects has fractions of a second to react.An autonomous guided vehicle navigating a warehouse cannot wait hundreds of milliseconds for instructions from a distant cloud region.The challenge isn't cloud performance.The challenge is physics.This episode explores the science of latency, jitter, determinism, and why distance creates a hard limit that no cloud provider can eliminate. THE PHYSICS OF LATENCY Every cloud strategy ultimately runs into the same constraint.Data must travel.Even at the speed of light, distance creates delay.As organizations connect factories, warehouses, hospitals, ports, mines, energy grids, and autonomous systems to cloud platforms, latency becomes an architectural problem rather than a networking problem.We discuss: * Why latency and jitter matter more than bandwidth * Deterministic versus best-effort networking * Real-world control loop requirements * The impact of packet loss and network variability * Why cloud optimization cannot overcome physical distance Understanding these concepts is critical for modern architects designing real-time systems. INDUSTRIES HITTING THE LATENCY WALL The edge is no longer a niche concept.Across every sector, organizations are discovering workloads that cannot depend on centralized cloud architectures.This episode examines real-world examples from: * Manufacturing and industrial automation * Logistics and warehouse robotics * Healthcare and patient telemetry * Energy and utilities * Mining operations * Smart ports and maritime logistics * Retail automation * Autonomous transportation Each industry faces different challenges, but the underlying problem remains the same: critical decisions must happen locally. THE OLD CLOUD MODEL VS THE NEW EDGE MODEL For decades, enterprise architecture followed a hub-and-spoke model.Data flowed to the cloud.The cloud made decisions.The edge executed instructions.That model is changing.The modern edge architecture places intelligence closer to the source of the data.Instead of sending every sensor reading, image, and event to a distant cloud region, organizations process information locally and send only insights, exceptions, and analytics upstream.We explore: * Edge-first architectures * Distributed intelligence * Local decision-making * Autonomous operations * Resilient offline systems * Real-time control loops The result is a fundamental inversion of traditional cloud thinking. PRIVATE 5G EXPLAINED Many organizations think 5G is simply faster wireless networking.Enterprise private 5G is something very different.It provides deterministic connectivity designed specifically for industrial and mission-critical environments.In this episode, we explain: * Private 5G architecture * Network slicing * Ultra-Reliable Low-Latency Communications (URLLC) * SIM-based security * Mobility management * Quality of Service (QoS) * Deterministic networking You'll learn why private 5G is becoming a foundational technology for modern industrial environments. AZURE PRIVATE 5G CORE AND AZURE STACK EDGE Microsoft's answer to the edge challenge combines networking, compute, AI, and cloud management into a unified platform.We take a deep dive into: * Azure Private 5G Core * Azure Stack Edge * Azure Arc * Azure Network Function Manager * Edge AI * Local inference * Sovereign deployments * Hybrid cloud architectures Discover how Microsoft enables organizations to run cloud services locally while maintaining centralized governance and management. MULTI-ACCESS EDGE COMPUTING (MEC) Private 5G alone doesn't solve the problem.Applications still need compute resources close to the workload.This is where Multi-Access Edge Computing comes in.We explore how MEC enables: * Real-time AI inference * Computer vision workloads * Predictive maintenance * Digital twins * Autonomous systems * Edge analytics * Low-latency application hosting The combination of MEC and private 5G creates a platform capable of supporting next-generation industrial applications. THE EVENT-REASONING-ORCHESTRATION MODEL One of the most important concepts in this episode is a new way of thinking about intelligence at the edge.Instead of sending every event to the cloud, the edge becomes responsible for:Event DetectionCapturing data directly from sensors, cameras, machines, and devices.Local ReasoningRunning AI models and analytics locally.Immediate OrchestrationTaking action in real time without waiting for cloud responses.The cloud remains essential for governance, reporting, model training, and enterprise-wide intelligence, but the milliseconds that matter stay local. THE BUSINESS CASE FOR THE EDGE Edge computing isn't just about performance.It's also about economics.We explore real-world research showing how organizations achieve measurable returns through: * Reduced downtime * Predictive maintenance * Automated quality inspection * Energy optimization * Autonomous logistics * Flexible manufacturing * Reduced networking costs You'll learn why some organizations are seeing extraordinary returns from private 5G and edge computing investments. DATA SOVEREIGNTY AND REGULATORY COMPLIANCE Latency isn't the only reason organizations are moving workloads closer to the edge.Data sovereignty is becoming equally important.This episode explores: * GDPR * NIS2 * The EU AI Act * The Data Act * DORA * National data residency requirements * Sovereign cloud architectures Learn why compliance requirements are reshaping enterprise architecture and accelerating investment in local processing capabilities. SECURITY AT THE EDGE Edge environments introduce new security challenges and opportunities.We discuss: * Zero Trust architectures * SIM-based authentication * Identity-driven networking * IEC 62443 * Operational Technology (OT) security * Microsoft Defender integration * Edge security monitoring * Secure AI deployments Security must evolve alongside edge infrastructure. THE CONVERGED FUTURE OF WI-FI 7 AND PRIVATE 5G The future isn't Wi-Fi versus 5G.The future is both.Organizations are increasingly adopting converged networking strategies where: * Wi-Fi 7 supports knowledge workers * Private 5G supports operational technology * Azure Arc provides unified management * Applications automatically use the best network available This converged model is rapidly becoming the standard architecture for enterprise environments. BUILDING YOUR EDGE STRATEGY For architects, technology leaders, and decision-makers, the question is no longer whether edge computing matters.The question is where the latency wall exists within your organization.We provide a practical roadmap covering: * Pilot projects * Platform selection * Governance models * Data foundations * Organizational change * Edge Centers of Excellence * Scaling strategies * Operational readiness Understanding these principles is essential for the next generation of cloud and AI architectures. Become a supporter of this podcast: https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support [https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support?utm_source=rss&utm_medium=rss&utm_campaign=rss].

12. Juni 20261 h 20 min
Episode Infrastructure as Code, DevOps & the Future of Azure with Maik van der Gaag [MVP] Cover

Infrastructure as Code, DevOps & the Future of Azure with Maik van der Gaag [MVP]

What does it really take to build secure, scalable, and automated cloud environments in Microsoft Azure? In this episode of M365 FM, Mirko Peters sits down with Microsoft Azure MVP Maik van der Gaag to explore Infrastructure as Code, DevOps culture, Terraform, Bicep, GitHub, Azure automation, cloud governance, and the growing impact of AI on modern platform engineering. Drawing from more than 15 years of experience helping organizations modernize their technology landscapes, Maik shares practical lessons from real-world cloud transformations, enterprise Azure deployments, and large-scale automation projects. The conversation starts with Maik's journey from traditional software development and SharePoint projects into Azure cloud architecture, eventually becoming CTO at 3fifty and later Head of Technology for the Microsoft business at Data Balance. Along the way, he reflects on building technical communities, organizing user groups, and what he has learned from years of helping professionals navigate the rapidly changing cloud landscape. THE STATE OF AZURE, CLOUD & HYBRID INFRASTRUCTURE As organizations continue to evaluate cloud-first strategies, Maik discusses the shift he is seeing toward hybrid cloud and sovereign cloud models. While many organizations remain committed to Microsoft Azure, others are balancing public cloud investments with private datacenters and local infrastructure. The discussion explores how geopolitical concerns, compliance requirements, and business continuity planning are influencing modern cloud architecture decisions. Key takeaways: * Why hybrid cloud is growing again * The rise of sovereign cloud discussions * Azure versus on-premises infrastructure * Cloud transformation challenges * Enterprise cloud strategy trends * Security considerations for modern workloads INFRASTRUCTURE AS CODE EXPLAINED  Infrastructure as Code (IaC) has become one of the most important practices in cloud engineering. Maik breaks down the concept in simple terms, explaining how infrastructure can be represented as code, version-controlled, automated, and deployed consistently across environments. Rather than manually creating virtual machines, databases, networking components, and cloud resources, organizations can define their entire environment through reusable code. This approach reduces human error, improves consistency, accelerates deployments, and creates repeatable infrastructure patterns across development, testing, and production environments. Topics covered: * What Infrastructure as Code actually means * Why manual deployments create problems * Reducing configuration drift * Version control for infrastructure * Automation and repeatability * Cost savings through standardization TERRAFORM VS BICEP One of the most practical parts of the discussion focuses on Terraform and Microsoft Bicep. Maik explains the strengths and weaknesses of both approaches and why the right choice depends heavily on organizational requirements. While Bicep offers a streamlined Azure-focused experience and serves as an abstraction layer for ARM templates, Terraform provides multi-cloud flexibility across Azure, AWS, Google Cloud, Cloudflare, and many other platforms. The conversation also explores state management, extensibility, and the growing capabilities of modern Infrastructure as Code tooling. Key takeaways: * Terraform vs Bicep * ARM templates and Azure deployments * State management concepts * Multi-cloud infrastructure strategies * Infrastructure extensibility * Choosing the right tool for your organization DEVOPS IS NOT A TOOL One of the strongest messages from this episode is Maik's belief that DevOps is fundamentally about culture, processes, and collaboration rather than technology alone. Many organizations mistakenly focus on tools while ignoring the organizational changes required to achieve DevOps success. Maik explains why successful DevOps teams combine developers, operations professionals, security experts, and business stakeholders into integrated teams focused on delivering value. The discussion also covers Azure DevOps, GitHub Enterprise, GitOps, DevSecOps, and how organizations can build more effective engineering cultures.  Topics discussed: * DevOps as culture versus technology * Why organizations struggle with DevOps * Azure DevOps vs GitHub * GitOps explained * DevSecOps principles * Building self-organizing teams SECURITY, GOVERNANCE & SECRETS MANAGEMENT Security remains a recurring theme throughout the conversation. Maik highlights one of the most common mistakes organizations make when moving to Azure: assuming cloud environments are automatically secure. The episode explores identity management, Microsoft Entra ID, MFA, Key Vault, managed identities, federated credentials, GitHub Actions, governance strategies, and best practices for protecting enterprise cloud environments. Key takeaways: * Azure security fundamentals * Managing secrets securely * Microsoft Entra ID considerations * Key Vault best practices * Federated identity credentials * Cloud governance and compliance AI, GITHUB COPILOT & THE FUTURE OF CLOUD ENGINEERING Artificial Intelligence is impacting every area of technology, including cloud engineering and Infrastructure as Code. Maik shares how GitHub Copilot and AI-assisted development have dramatically accelerated his daily work. Rather than writing every Terraform or Bicep template manually, AI can generate infrastructure code in seconds. However, Maik stresses a critical point: engineers must still understand, validate, and review every line of AI-generated code. Organizations that blindly trust AI outputs risk introducing security issues, configuration errors, and operational challenges. The discussion covers practical AI adoption, prompt engineering, code validation, AI governance, and how engineers can use AI responsibly without losing critical technical expertise.  Topics covered: * GitHub Copilot for Infrastructure as Code * AI-assisted cloud engineering * Validating AI-generated code * Prompt engineering techniques * Responsible AI adoption * Future skills for cloud professionals CAREER ADVICE FOR CLOUD ENGINEERS The episode concludes with practical advice for professionals looking to start their Infrastructure as Code journey. Maik explains why understanding the "why" behind automation matters more than simply learning a tool and shares recommendations for choosing between Terraform and Bicep based on organizational needs. His final message is simple but powerful: do the things you love, stay engaged with the community, continue learning, and never assume technology is as easy as it first appears. Whether you're a Cloud Architect, Azure Administrator, DevOps Engineer, Platform Engineer, Security Professional, Infrastructure Engineer, IT Consultant, Microsoft MVP, or technology leader, this episode delivers valuable insights into the technologies, practices, and mindsets shaping the future of cloud computing. Become a supporter of this podcast: https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support [https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support?utm_source=rss&utm_medium=rss&utm_campaign=rss].

Gestern52 min
Episode How to Architect Low-Cost AI Agents in the Microsoft Cloud Cover

How to Architect Low-Cost AI Agents in the Microsoft Cloud

Most organizations think their AI costs are driven by model pricing.They're wrong.The biggest cost problems in Microsoft AI environments often have nothing to do with GPT-5, Azure OpenAI, or Copilot licensing. Instead, they come from hidden architectural decisions that quietly multiply costs behind the scenes.In this episode, we break down the real economics of building AI agents in Microsoft Azure, Microsoft 365, Copilot Studio, and Azure AI Foundry. You'll learn why some organizations spend thousands of dollars per month on AI while others deliver the same business outcomes for a fraction of the cost.We explore the three hidden taxes affecting nearly every enterprise AI deployment: the Context Tax, the Reasoning Tax, and the Autonomous Tax. Together, these invisible costs can turn a successful proof-of-concept into a budget crisis.More importantly, you'll learn how to eliminate them. THE PROMISE VS THE INVOICE Microsoft has made AI easier to deploy than ever before.Copilot appears inside Teams, Outlook, Word, PowerPoint, and Microsoft 365. Azure AI Foundry simplifies model deployment. Copilot Studio allows low-code agent development. Power Platform integrates AI into business processes.But simplicity often hides complexity.The moment you build a custom Copilot Studio agent, connect SharePoint knowledge sources, invoke Azure OpenAI models, or trigger autonomous workflows, you enter a world of consumption billing where every token, action, and retrieval operation has a cost.In this episode, we uncover how Microsoft's AI billing layers actually work and why understanding them is the foundation of any successful AI architecture. THE THREE HIDDEN TAXES OF ENTERPRISE AI Most organizations unknowingly pay three separate AI taxes.The Context TaxPoor retrieval design floods prompts with irrelevant content.Instead of retrieving only the information needed to answer a question, many RAG implementations pull dozens of documents into the prompt, dramatically increasing token consumption while often reducing answer quality.The Reasoning TaxMany organizations route every request to their most expensive model.Simple FAQ requests, classifications, and summarizations frequently run on frontier models when smaller and cheaper models could deliver identical outcomes.The Autonomous TaxAutonomous agents never sleep.Background workflows, Graph grounding, Power Automate actions, and event-driven agents continue consuming credits long after employees have logged off.When these three taxes combine, AI spending can spiral out of control. UNDERSTANDING COPILOT STUDIO COSTS Copilot Studio has become one of the most powerful tools in the Microsoft ecosystem.It also introduces new consumption models that many organizations underestimate.We discuss: * Copilot Credits * Capacity Packs * Pay-As-You-Go billing * Graph Grounding costs * Agent actions * Autonomous triggers * AI Builder transitions * The November 2026 licensing changes Understanding these mechanics is essential before deploying large-scale business agents. THE NOVEMBER 2026 AI BUILDER DEADLINE One of the most important dates in Microsoft's AI roadmap arrives on November 1st, 2026.On that date, seeded AI Builder credits disappear.Organizations currently relying on included AI Builder capacity may discover that previously "free" AI workloads suddenly become billable.We explain: * What changes in November 2026 * Which workloads are affected * How to prepare before the deadline * Why many organizations could face unexpected costs * How to build a transition strategy today THE COST ARCHITECTURE FRAMEWORK Reducing AI costs isn't about buying cheaper models.It's about designing better architectures.The framework discussed in this episode focuses on four core engineering principles:Semantic CachingAvoid generating answers that already exist.Using Azure API Management and vector similarity search, organizations can dramatically reduce repeat LLM calls while improving response times.Prompt CompressionMost prompts are larger than they need to be.We explore Microsoft's LLMLingua framework and how prompt compression can reduce token consumption without reducing answer quality.Model RoutingNot every request deserves GPT-5.Azure AI Foundry's Model Router enables intelligent routing between GPT-5 Nano, GPT-5 Mini, and larger frontier models based on task complexity.Capacity OptimizationLearn when Pay-As-You-Go pricing makes sense and when Provisioned Throughput Units (PTUs) become financially attractive. AZURE AI FOUNDRY AND MODEL ROUTING One of the most exciting developments in Microsoft's AI stack is model routing.Instead of selecting a single model for every task, organizations can allow the platform to automatically choose the most cost-effective model for each request.We explore: * GPT-5 Global * GPT-5 Mini * GPT-5 Nano * Azure AI Foundry Model Router * Multi-model architectures * Cost optimization strategies * Enterprise deployment patterns The result is often substantial cost reductions with little or no impact on user experience. AZURE COST MANAGEMENT FOR AI You can't optimize what you can't measure.This episode walks through practical techniques for monitoring AI costs using: * Azure Cost Management * Azure Monitor * Log Analytics * Kusto Query Language (KQL) * Azure Copilot * Resource Tagging * Cost Classification Frameworks Learn how to identify cost anomalies before they become budget problems. BUILDING A GOVERNANCE MODEL FOR AI Technology alone won't solve cost challenges.Organizations need governance.We discuss: * Cost Classes (Gold, Silver, Bronze) * Chargeback Models * Platform Team Responsibilities * Citizen Developer Governance * Budget Controls * Consumption Caps * AI Service Catalogs * Quarterly Review Processes Without governance, cost optimization efforts rarely survive long-term. THE 90-DAY IMPLEMENTATION ROADMAP To help organizations move from theory to execution, this episode presents a practical 90-day roadmap.Days 1–30: AuditGain visibility into your AI costs.Days 31–60: Quick WinsDeploy caching, retrieval optimization, and budget controls.Days 61–90: Architecture TransformationImplement compression, model routing, governance, and long-term optimization.The roadmap provides a practical path toward sustainable AI economics. REAL-WORLD CASE STUDY We conclude with a detailed case study showing how a support agent architecture was redesigned using the techniques discussed throughout the episode.The results demonstrate how: * Retrieval optimization reduced prompt size * Semantic caching eliminated redundant requests * Model routing lowered inference costs * Governance prevented future cost drift The outcome was a dramatic reduction in operating costs while maintaining service quality and user satisfaction. WHO SHOULD LISTEN? This episode is designed for: * Microsoft 365 Administrators * Copilot Administrators * Azure Architects * Enterprise Architects * IT Leaders * CIOs * CTOs * AI Engineers * Platform Engineers * Power Platform Professionals * Copilot Studio Developers * FinOps Teams * Cloud Financial Management Teams * Security & Governance Professionals If you're building AI solutions on Microsoft technologies, this episode provides a practical blueprint for controlling costs without sacrificing innovation. Become a supporter of this podcast: https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support [https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support?utm_source=rss&utm_medium=rss&utm_campaign=rss].

Gestern1 h 23 min
Episode Copilot Studio, Dataverse MCP & The Future of Agentic AI in Microsoft 365 with Nathan Rose [MVP] Cover

Copilot Studio, Dataverse MCP & The Future of Agentic AI in Microsoft 365 with Nathan Rose [MVP]

The Microsoft AI landscape is evolving at an incredible pace, and few people are closer to the transformation than Microsoft Business Applications MVP Nathan Rose. In this episode of M365 FM, host Mirko Peters welcomes Nathan for an in-depth conversation about Copilot Studio, Dataverse MCP (Model Context Protocol), Business Skills, Agentic AI, Microsoft 365 Copilot, and the future of intelligent business applications across the Microsoft ecosystem.Nathan shares his journey from the early Microsoft Dynamics CRM 2011 days to becoming a leading Power Platform Solution Architect and community voice. Along the way, he explains how the transition from traditional low-code development to AI-powered application development is reshaping careers, organizations, and enterprise software architecture. For anyone working with Microsoft 365, Power Platform, Dynamics 365, Azure AI, Copilot Studio, Dataverse, or Microsoft Copilot, this episode provides valuable insights into where the industry is heading. THE EVOLUTION FROM LOW-CODE TO AGENTIC AI The conversation begins with Nathan's experience in the Microsoft Power Platform community and how low-code tools such as Power Apps, Power Automate, Dataverse, and Power Virtual Agents opened the door for people from non-traditional technical backgrounds. As AI becomes increasingly integrated into Microsoft's platform strategy, Nathan explains why organizations are moving beyond traditional workflows and into a new era of Agentic AI.Rather than simply automating predefined processes, modern AI agents can reason, make decisions, discover tools, interact with business data, and perform complex actions autonomously. Nathan discusses why Copilot Studio is becoming one of the most important platforms in the Microsoft ecosystem and how natural language is rapidly replacing traditional development approaches.Key topics include: * Low-code vs Agentic AI * Copilot Studio evolution * Microsoft Power Platform innovation * AI-powered business applications * Prompt engineering and AI workflows * Future skills for Microsoft professionals WHAT IS DATAVERSE MCP AND WHY DOES IT MATTER? One of the most valuable parts of the discussion focuses on Dataverse MCP (Model Context Protocol), one of Microsoft's most exciting new technologies for enterprise AI solutions.Nathan explains why MCP should not simply be viewed as "the new API." Instead, MCP enables AI agents to understand context, discover capabilities, reason about data, and dynamically select the tools needed to complete a task. Using a memorable comparison, Nathan describes APIs as Spotify playlists while MCP acts more like a live DJ that continuously adapts to the environment and audience.The conversation explores how Dataverse MCP allows AI agents to interact with Microsoft Dataverse, Dynamics 365, customer records, business processes, opportunities, support cases, and enterprise data without requiring the extensive custom integrations organizations traditionally needed.Key takeaways: * Understanding Model Context Protocol (MCP) * MCP vs traditional APIs * Context-aware enterprise AI * Dataverse integration strategies * Intelligent tool discovery * Microsoft AI architecture DATAVERSE: MORE THAN JUST A DATABASE Many organizations still view Dataverse as simply another database. Nathan explains why this perspective misses the bigger picture.Dataverse serves as Microsoft's intelligent business data platform, providing a unified data layer that connects Power Apps, Power Automate, Dynamics 365, Copilot Studio, Microsoft 365 Copilot, and AI agents. Instead of managing disconnected systems and endless integrations, organizations can leverage Dataverse as a common data foundation that simplifies development, governance, security, and AI adoption.The discussion highlights why Dataverse is becoming increasingly important as organizations deploy AI agents that require access to customer data, operational information, business processes, and enterprise knowledge.Topics covered: * Dataverse architecture * Unified business data platforms * Dynamics 365 integration * Enterprise data management * AI-ready data foundations * Modern application development BUSINESS SKILLS: THE NEXT GENERATION OF ENTERPRISE AUTOMATION Nathan also introduces Dataverse Business Skills, one of the most promising emerging capabilities for Copilot Studio and AI agents.Business Skills allow organizations to define reusable business logic and procedures that agents can discover and execute dynamically. Rather than modifying, testing, and redeploying entire agents every time a process changes, organizations can update individual skills that become immediately available to AI systems through Dataverse MCP.This creates a more scalable architecture for enterprise AI, reduces deployment complexity, and enables business teams to contribute directly to automation initiatives.Key discussion points: * What Business Skills are * Microservices for AI agents * Scalable enterprise automation * Business-user driven AI development * Dynamic agent capabilities * Future Microsoft AI architecture GOVERNANCE, COMPLIANCE AND SHADOW AI No AI discussion is complete without addressing governance, compliance, security, and risk management.Mirko and Nathan discuss the growing challenge of Shadow AI, where employees use external AI tools such as ChatGPT, Claude, Perplexity, and other generative AI platforms outside corporate governance frameworks. Rather than attempting to block AI adoption completely, Nathan argues that organizations should focus on education, visibility, governance, and responsible AI implementation.The conversation also explores Microsoft's growing investments in AI governance, agent management, security controls, compliance frameworks, and enterprise oversight capabilities.Key takeaways: * AI governance best practices * Managing Shadow AI * Enterprise AI security * Responsible AI adoption * Microsoft governance capabilities * Compliance in the age of AI THE FUTURE OF COPILOT STUDIO AND MICROSOFT AI Looking toward the future, Nathan predicts that organizations will eventually operate hundreds or even thousands of specialized AI agents. These agents will handle repetitive work, automate business processes, surface insights, manage customer interactions, and support employees across departments.The discussion explores how Copilot Studio, Microsoft 365 Copilot, Dataverse MCP, Business Skills, AI orchestration, and emerging technologies from Microsoft Build are creating the foundation for this future. Nathan also shares why he believes human expertise, creativity, relationships, and strategic thinking will become even more valuable as AI takes over routine administrative tasks.Whether you are a Microsoft 365 administrator, Dynamics 365 consultant, Power Platform developer, Solution Architect, AI strategist, business leader, or technology enthusiast, this episode offers practical insights into the technologies that will define the next generation of enterprise software. IN THIS EPISODE YOU'LL LEARN * How Copilot Studio is transforming enterprise AI * Why Dataverse MCP is a game changer for business applications * The role of Business Skills in scalable agent architectures * How Agentic AI differs from traditional automation * Why governance and Shadow AI matter more than ever * The future of Microsoft 365 Copilot and AI agents * How organizations can prepare for an AI-first future * Why Dataverse is becoming the foundation of Microsoft's AI strategy * Emerging trends from Microsoft Build * Skills Microsoft professionals should focus on next Become a supporter of this podcast: https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support [https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support?utm_source=rss&utm_medium=rss&utm_campaign=rss].

10. Juni 202657 min
Episode The SLM Revolution: How Small Models Are Fixing Copilot’s Biggest Flaw Cover

The SLM Revolution: How Small Models Are Fixing Copilot’s Biggest Flaw

What if Microsoft's biggest AI breakthrough isn't a larger model?What if the future of Microsoft Copilot, enterprise AI, and Microsoft 365 productivity isn't powered by trillion-parameter frontier models at all?What if the real innovation is happening in the opposite direction?In this deep-dive episode, we explore one of the most important shifts happening in artificial intelligence today: the rise of Small Language Models (SLMs) and why they may be the key to solving Copilot's most significant architectural challenge.For years, the AI industry operated under a simple assumption: bigger models are better models. More parameters meant more intelligence, more capability, and better outcomes. That assumption helped fuel the rise of GPT-4, Claude, Gemini, and other frontier AI systems that transformed how organizations think about productivity and automation.But enterprise reality is revealing a different story.Most Microsoft 365 users are not asking AI to solve theoretical physics problems or write novels. They're summarizing email threads in Outlook. They're extracting action items from Teams meetings. They're generating document summaries in Word. They're classifying files in SharePoint. They're asking simple questions about company information, policies, procedures, and project documentation.These are narrow, repetitive, high-volume tasks.And increasingly, organizations are discovering that using the world's largest AI models for every single request may be the wrong architecture entirely.In this episode, we unpack why enterprises are rethinking their AI strategy and why Small Language Models are emerging as one of the most important developments in the Microsoft ecosystem. WHY COPILOT'S BIGGEST PROBLEM ISN'T THE LICENSE PRICE When organizations evaluate Microsoft 365 Copilot, most discussions begin with licensing costs.The conversation typically focuses on per-user pricing, deployment budgets, and ROI calculations.But in reality, the license is only the beginning.Behind every Copilot interaction sits an AI inference engine processing prompts, generating responses, and consuming computational resources. Every email summary, every meeting recap, every generated draft, and every document analysis triggers an AI workload.Multiply those requests across thousands of employees, hundreds of departments, and millions of interactions each month, and a hidden cost begins to emerge.The challenge isn't simply licensing.It's architecture.We explore how large-scale AI deployments create operational costs that most organizations fail to anticipate and why enterprises are beginning to adopt model portfolios rather than relying on a single AI model for every workload. THE HIDDEN COST OF FRONTIER MODELS Enterprise AI spending isn't just growing.It's becoming unpredictable.As AI adoption increases, organizations are seeing inference costs, compute requirements, and cloud consumption expand far beyond original expectations.In this episode, we examine: * Why AI costs scale differently than traditional software licensing * The economics of AI inference and token consumption * How routine Microsoft 365 tasks create massive AI workloads * Why enterprise AI budgets are becoming increasingly difficult to forecast * How organizations are reducing costs through hybrid model strategies You'll learn why some enterprises are achieving dramatic cost reductions by routing routine tasks to smaller models while reserving premium models for high-complexity scenarios. THE LATENCY PROBLEM NOBODY TALKS ABOUT Cost is only part of the story.Speed matters.Users expect AI to feel instant.If an employee clicks "Summarize this email thread" and waits several seconds for a response, the experience quickly becomes frustrating. When delays become common, adoption slows. When adoption slows, ROI disappears.We explore how Small Language Models dramatically reduce latency and why response times measured in milliseconds rather than seconds can fundamentally change how employees interact with AI-powered tools.The discussion covers: * User adoption psychology * Real-world Copilot usage patterns * Why latency kills productivity gains * Edge AI deployments * Local inference strategies * The relationship between performance and user trust THE DATA SOVEREIGNTY CHALLENGE For many organizations, the biggest concern isn't cost or performance.It's control.Where is your data actually processed?Who has access to it?What happens when AI workloads cross geographic boundaries?What does compliance look like in a world where AI systems may process information across multiple regions and multiple providers?This episode takes a detailed look at: * Microsoft Copilot Flex Routing * EU Data Boundary considerations * GDPR implications for AI workloads * Cross-border processing concerns * Sovereign AI strategies * Regulatory requirements in healthcare, finance, government, and critical infrastructure We explain why data sovereignty is rapidly becoming one of the most important conversations in enterprise AI and why local AI processing is gaining momentum across regulated industries. INTRODUCING MICROSOFT'S PHI FAMILY Microsoft isn't simply talking about Small Language Models.They're building them.The Phi family represents Microsoft's strategic investment in efficient, highly capable AI models designed for real-world deployment scenarios.We take a deep dive into: * Phi-3 Mini * Phi-3 Small * Phi-3 Medium * Phi-3.5 * Phi-3 Vision * Mixture-of-Experts architectures * On-device AI * Edge AI workloads You'll discover why these models are attracting so much attention and how Microsoft is positioning them as a core component of the future AI stack. CAN SMALL MODELS REALLY COMPETE? One of the biggest misconceptions in AI is that smaller models automatically mean lower quality.The reality is far more nuanced.In this episode, we examine benchmark results, real-world workloads, enterprise deployment scenarios, and the growing evidence that Small Language Models can outperform expectations when applied to the right tasks.We discuss: * MMLU performance * Instruction-following benchmarks * Summarization workloads * Document processing * Email drafting * Meeting recap generation * Knowledge retrieval * Enterprise search The goal isn't replacing frontier models.The goal is using the right model for the right job.AZURE LOCAL AND THE SOVEREIGN AI FUTUREAzure Local may become one of the most important platforms in Microsoft's AI strategy.As organizations demand greater control over where AI runs and how data is processed, local AI infrastructure is becoming increasingly attractive.We explore how Azure Local enables organizations to: * Run AI workloads closer to their data * Reduce latency * Improve compliance * Support disconnected environments * Enable edge AI deployments * Build sovereign AI architectures Whether you're operating in manufacturing, healthcare, government, defense, finance, or energy, this section provides practical insights into the future of local AI infrastructure. THE RISE OF MODEL ROUTING Perhaps the most important idea discussed in this episode is the concept of model routing.The future isn't GPT-4 versus Phi.The future is GPT-4 and Phi working together.Instead of asking which model is best, organizations are beginning to ask which model is best for each specific task.This shift introduces a new architectural pattern where: * Small models handle routine requests * Large models handle complex reasoning * Routing engines determine the optimal destination * Costs decrease * Performance improves * Governance becomes easier We explain why many experts believe this model portfolio approach represents the next evolution of enterprise AI. BUILDING A MICROSOFT 365 AI STRATEGY Technology alone is not enough.Successful AI adoption requires governance, architecture, operating models, security frameworks, and long-term planning.In the final section, we outline practical guidance for IT leaders, architects, Microsoft 365 administrators, security professionals, and business decision-makers who want to prepare for the next generation of AI-powered workplaces.You'll learn how to: * Identify suitable SLM workloads * Build hybrid AI architectures * Evaluate deployment options * Improve governance controls * Reduce AI operating costs * Increase employee adoption * Prepare for Microsoft's evolving AI roadmap Become a supporter of this podcast: https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support [https://www.spreaker.com/podcast/m365-fm-modern-work-security-and-productivity-with-microsoft-365--6704921/support?utm_source=rss&utm_medium=rss&utm_campaign=rss].

10. Juni 20261 h 27 min