Ai Change Desk

AI Change Desk | EP024: Delegation Quality Check

14 min · 6. mai 2026
episode AI Change Desk | EP024: Delegation Quality Check cover

Beskrivelse

Episode date: 2026-05-06 Format: Wednesday brief Runtime target: 9-12 minutes Agents are moving from answering questions to taking assignments. This episode connects Microsoft Copilot Cowork, Microsoft Agent 365, and Anthropic's financial-services agent templates into one operating question: when AI does the assignment, who owns the review? * Delegation is becoming embedded inside everyday work surfaces, not just chat windows. * Agent control planes help with inventory and governance, but dashboards do not replace workflow ownership. * Vertical agents in finance make review, source lineage, evidence, and fallback ownership more urgent. * Useful output is not the same as ready output. By Wednesday, May 13, 2026, run a 30-minute delegation-quality review for one AI-assisted workflow. Capture the task, approved users, sources, output artifact, reviewer, evidence, fallback owner, stop condition, and user message. * Microsoft: Copilot Cowork: From conversation to action across skills, integrations, and devices - https://www.microsoft.com/en-us/microsoft-365/blog/2026/05/05/copilot-cowork-from-conversation-to-action-across-skills-integrations-and-devices/ [https://www.microsoft.com/en-us/microsoft-365/blog/2026/05/05/copilot-cowork-from-conversation-to-action-across-skills-integrations-and-devices/] * Microsoft: Microsoft Agent 365, now generally available, expands capabilities and integrations - https://www.microsoft.com/en-us/security/blog/2026/05/01/microsoft-agent-365-now-generally-available-expands-capabilities-and-integrations/ [https://www.microsoft.com/en-us/security/blog/2026/05/01/microsoft-agent-365-now-generally-available-expands-capabilities-and-integrations/] * Anthropic: Agents for financial services - https://www.anthropic.com/news/finance-agents [https://www.anthropic.com/news/finance-agents] * Microsoft: Microsoft 365 Copilot, human agency, and the opportunity for every organization - https://www.microsoft.com/en-us/microsoft-365/blog/2026/05/05/microsoft-365-copilot-human-agency-and-the-opportunity-for-every-organization/ [https://www.microsoft.com/en-us/microsoft-365/blog/2026/05/05/microsoft-365-copilot-human-agency-and-the-opportunity-for-every-organization/]

Kommentarer

0

Vær den første til å kommentere

Registrer deg nå og bli medlem av Ai Change Desk sitt community!

Prøv gratis

Prøv gratis i 14 dager

99 kr / Måned etter prøveperioden. · Avslutt når som helst.

  • Eksklusive podkaster
  • 20 timer lydbøker i måneden
  • Gratis podkaster

Alle episoder

28 Episoder

episode AI Change Desk | EP029: Agent Reliability Evidence Check cover

AI Change Desk | EP029: Agent Reliability Evidence Check

Date: 2026-06-01 Agents are getting longer leashes: remote work sessions, stronger coding/workflow behavior, and practical observability/test tooling are all moving at the same time. This episode turns that into an operator question: when an agent can do more, what proof comes back before the work is trusted? When the agent can do more, what proof do you require before you trust the work? Run one agent reliability evidence check this week: 1. Scope receipt: what can it reach? 2. Effort receipt: how long, how hard, and how expensively can it work before checkpoint? 3. Quality receipt: what tests or reviews prove the output is usable? 4. Drift receipt: what changed since the last good run? 5. Fallback receipt: who stops, reroutes, or explains it when it fails? * OpenAI ChatGPT release notes: https://help.openai.com/en/articles/6825453-chatgpt-release-notes [https://help.openai.com/en/articles/6825453-chatgpt-release-notes] * OpenAI Codex cloud documentation: https://developers.openai.com/codex/cloud/ [https://developers.openai.com/codex/cloud/] * Anthropic Claude Opus 4.8: https://www.anthropic.com/news/claude-opus-4-8 [https://www.anthropic.com/news/claude-opus-4-8] * AWS LLM observability: https://aws.amazon.com/blogs/machine-learning/comprehensive-observability-for-amazon-sagemaker-ai-llm-inference-from-gpu-utilization-to-llm-quality/ [https://aws.amazon.com/blogs/machine-learning/comprehensive-observability-for-amazon-sagemaker-ai-llm-inference-from-gpu-utilization-to-llm-quality/] * AWS deep-agent evaluations: https://aws.amazon.com/blogs/machine-learning/evaluating-deep-agents-using-langsmith-on-aws/ [https://aws.amazon.com/blogs/machine-learning/evaluating-deep-agents-using-langsmith-on-aws/] * AWS agent test-suite datasets: https://aws.amazon.com/blogs/machine-learning/build-a-test-suite-that-grows-with-your-agent-with-dataset-management-in-amazon-bedrock-agentcore/ [https://aws.amazon.com/blogs/machine-learning/build-a-test-suite-that-grows-with-your-agent-with-dataset-management-in-amazon-bedrock-agentcore/] * OpenAI May 28 model lifecycle note: https://help.openai.com/en/articles/6825453-chatgpt-release-notes [https://help.openai.com/en/articles/6825453-chatgpt-release-notes] AI-assisted tools were used in parts of the research and production workflow. Final editorial judgment, risk posture, and release approval stayed human-led. This is operational guidance, not legal advice. These are my opinions and are not representative of any organization.

I går27 min
episode AI Change Desk | EP027: No-New-Delta Verification Discipline Check cover

AI Change Desk | EP027: No-New-Delta Verification Discipline Check

Today is Memorial Day in the United States, and there will be no Wednesday AI Change Desk episode this week. This Monday episode keeps the feed useful without forcing novelty into a quiet official-news cycle. The core operating point: no-new-delta days are not skip days. They are verification days. The May 25 source check did not find a newer relevant OpenAI release-note date displacing the May 21 Codex update. That means the operating frame should stay date-bounded: the Codex execution signal remains current, while provenance, creator distribution, and community pulse remain supporting context. Teams get into trouble when they confuse "checked today" with "changed today." A refreshed page, community chatter, or a useful trade report can create pressure to say something new. The discipline is to separate confirmed change, continuity context, and directional signal. Memorial Day is observed on the last Monday of May and honors those who died in service to the country. The episode includes a brief respectful segment acknowledging the day and the value of restraint before returning to the operational topic. Before locking any AI release note, script, stakeholder update, or internal status memo this week, add three fields: * net new official delta: yes or no * latest official date seen: source and date * carry-forward justification: why the prior frame still stands There will be no Wednesday episode this week. AI Change Desk returns with the next Monday main episode. * OpenAI ChatGPT release notes: https://help.openai.com/en/articles/6825453-release-notes [https://help.openai.com/en/articles/6825453-release-notes] * OpenAI provenance post: https://openai.com/index/advancing-content-provenance/ [https://openai.com/index/advancing-content-provenance/] * Podnews Report Card 2026 Results: https://podnews.net/article/report-card-2026-results [https://podnews.net/article/report-card-2026-results] * YouTube news from Google I/O 2026: https://blog.youtube/news-and-events/youtube-news-google-io-2026/ [https://blog.youtube/news-and-events/youtube-news-google-io-2026/] * U.S. Census Bureau Memorial Day 2026: https://www.census.gov/newsroom/stories/memorial-day.html [https://www.census.gov/newsroom/stories/memorial-day.html] AI-assisted tools were used in parts of the research and production workflow. Final editorial judgment, risk posture, and release approval stayed human-led. This is operational guidance, not legal advice. These are Michael's opinions and are not representative of any organization.

25. mai 202629 min
episode AI Change Desk | EP026: Agent Toolchain Ownership Check cover

AI Change Desk | EP026: Agent Toolchain Ownership Check

AI agents are moving from chat windows into toolchains: managed execution environments, SDKs, MCP servers, mobile approvals, workspace integrations, search agents, shopping agents, and enterprise platforms. This episode translates the week of announcements into one operator question: who owns the toolchain when the agent starts acting? * Google I/O 2026 pushed agentic Gemini deeper into developer tools, Search, Workspace, shopping, app development, and personal agent surfaces. * Anthropic announced it is acquiring Stainless, an SDK and MCP server tooling company that has generated official Anthropic SDKs. * Anthropic and KPMG announced a global alliance to embed Claude into KPMG Digital Gateway and make Claude available to more than 276,000 employees. * OpenAI Codex mobile and ChatGPT personal finance remain active control signals from the prior week: approvals and sensitive data context are moving closer to always-on workflows. Do not treat agent access as a one-time tool approval. Treat it as a toolchain lifecycle: owner, connector, permission boundary, evidence, fallback, and shutdown authority. By Wednesday, May 27, 2026, complete one agent-toolchain ownership review for your highest-impact AI workflow. Fields to capture: workflow name, agent surface, SDK/API/tool dependencies, connector owner, permission boundary, human approval point, evidence trail, fallback route, shutdown owner, and next review date. * Anthropic: Anthropic acquires Stainless — https://www.anthropic.com/news/anthropic-acquires-stainless [https://www.anthropic.com/news/anthropic-acquires-stainless] * Anthropic: KPMG integrates Claude across its core business and workforce of more than 276,000 — https://www.anthropic.com/news/anthropic-kpmg [https://www.anthropic.com/news/anthropic-kpmg] * Google: I/O 2026 collection — https://blog.google/innovation-and-ai/technology/developers-tools/google-io-2026-collection/ [https://blog.google/innovation-and-ai/technology/developers-tools/google-io-2026-collection/] * Google: I/O 2026 developer highlights — https://blog.google/innovation-and-ai/technology/developers-tools/google-io-2026-developer-highlights/ [https://blog.google/innovation-and-ai/technology/developers-tools/google-io-2026-developer-highlights/] * Google: I/O 2026 opening keynote — https://blog.google/innovation-and-ai/sundar-pichai-io-2026/ [https://blog.google/innovation-and-ai/sundar-pichai-io-2026/] * OpenAI: Work with Codex from anywhere — https://openai.com/index/work-with-codex-from-anywhere/ [https://openai.com/index/work-with-codex-from-anywhere/] * OpenAI: ChatGPT release notes — https://help.openai.com/en/articles/6825453-chatgpt-release-notes [https://help.openai.com/en/articles/6825453-chatgpt-release-notes] AI-assisted tools were used in parts of the research and production workflow. Final editorial judgment, risk posture, and release approval stayed human-led. This is operational guidance, not legal advice. These are Michael's opinions and are not representative of any organization.

20. mai 202615 min
episode AI Change Desk | EP025: Away-Mode Control Check cover

AI Change Desk | EP025: Away-Mode Control Check

Michael is back after a week away handling personal things and camping by the beach with family and dogs. The timing created the perfect operating question: while people are away from the desk, AI work keeps moving. This episode turns fresh AI workflow-surface announcements into a practical control check for operators. The core issue is not whether teams can work from anywhere. They already can. The issue is whether the organization knows what can move, what must wait, what creates evidence, and who can stop a workflow when the normal owner is offline. * OpenAI Codex moving into mobile and remote task oversight. * OpenAI personal finance in ChatGPT as a signal for sensitive connected-account workflows. * Google Gemini Intelligence across Android devices and browser contexts. * Anthropic and PwC expanding Claude deployment across professional workflows. * OpenAI launching The OpenAI Deployment Company as a deployment-layer signal. Run a 45-minute Away-Mode Control Check on one live AI workflow: 1. Map which surfaces can trigger it. 2. Classify the action state it can reach. 3. Define what happens when the owner is away. 4. Confirm what evidence is created. 5. Identify the data class touched. 6. Set final-confirmation rules. 7. Name who can stop it. * OpenAI, “Work with Codex from anywhere,” May 14, 2026. * OpenAI, “Simplify your personal finances with ChatGPT,” May 15, 2026. * Google, “A smarter, more proactive Android with Gemini Intelligence,” May 12, 2026. * Anthropic, “Expanding our partnership with PwC,” May 14, 2026. * OpenAI, “OpenAI launches The OpenAI Deployment Company,” May 11, 2026. AI-assisted tools were used in parts of the research and production workflow. Final editorial judgment, risk posture, and release approval stayed human-led. This is operational guidance, not legal advice. These are Michael’s opinions and are not representative of any organization.

18. mai 202621 min
episode AI Change Desk | EP024: Delegation Quality Check cover

AI Change Desk | EP024: Delegation Quality Check

Episode date: 2026-05-06 Format: Wednesday brief Runtime target: 9-12 minutes Agents are moving from answering questions to taking assignments. This episode connects Microsoft Copilot Cowork, Microsoft Agent 365, and Anthropic's financial-services agent templates into one operating question: when AI does the assignment, who owns the review? * Delegation is becoming embedded inside everyday work surfaces, not just chat windows. * Agent control planes help with inventory and governance, but dashboards do not replace workflow ownership. * Vertical agents in finance make review, source lineage, evidence, and fallback ownership more urgent. * Useful output is not the same as ready output. By Wednesday, May 13, 2026, run a 30-minute delegation-quality review for one AI-assisted workflow. Capture the task, approved users, sources, output artifact, reviewer, evidence, fallback owner, stop condition, and user message. * Microsoft: Copilot Cowork: From conversation to action across skills, integrations, and devices - https://www.microsoft.com/en-us/microsoft-365/blog/2026/05/05/copilot-cowork-from-conversation-to-action-across-skills-integrations-and-devices/ [https://www.microsoft.com/en-us/microsoft-365/blog/2026/05/05/copilot-cowork-from-conversation-to-action-across-skills-integrations-and-devices/] * Microsoft: Microsoft Agent 365, now generally available, expands capabilities and integrations - https://www.microsoft.com/en-us/security/blog/2026/05/01/microsoft-agent-365-now-generally-available-expands-capabilities-and-integrations/ [https://www.microsoft.com/en-us/security/blog/2026/05/01/microsoft-agent-365-now-generally-available-expands-capabilities-and-integrations/] * Anthropic: Agents for financial services - https://www.anthropic.com/news/finance-agents [https://www.anthropic.com/news/finance-agents] * Microsoft: Microsoft 365 Copilot, human agency, and the opportunity for every organization - https://www.microsoft.com/en-us/microsoft-365/blog/2026/05/05/microsoft-365-copilot-human-agency-and-the-opportunity-for-every-organization/ [https://www.microsoft.com/en-us/microsoft-365/blog/2026/05/05/microsoft-365-copilot-human-agency-and-the-opportunity-for-every-organization/]

6. mai 202614 min