Better Incidents Podcast

Alerting, Incident Response, and the SDLC

32 min · 5. okt. 2023
episode Alerting, Incident Response, and the SDLC cover

Description

In this episode we chat with veteran cloud architect Masaru Hoshi about the challenges of alert fatigue, the importance of effective alerting systems, and fostering ownership in software teams. Masaru shares insights from his 30-year career, emphasizing the need for balance, trust, and collaboration in incident response.

Comments

0

Be the first to comment

Sign up now and become a member of the Better Incidents Podcast community!

Get Started

1 month for 9 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

9 episodes

episode Focus on Assembly Time with Great Circle's Brent Chapman artwork

Focus on Assembly Time with Great Circle's Brent Chapman

When it comes to resolving an incident there are a number of metrics that can be misleading. Resolution time, for example, can fluctuate wildly. However, there’s one that we have a significant amount of influence over. Today, I’m talking to Brent Chapman, Founder at Great Circle, about how engineering teams should ditch metrics like MTTR and instead focus on what we can control; assembly time. Brent's Information: Website: https://greatcircle.com/ [https://greatcircle.com/] LinkedIn: https://www.linkedin.com/in/brentchapman/ [https://www.linkedin.com/in/brentchapman/] Twitter: https://twitter.com/brent_chapman [https://twitter.com/brent_chapman] ⁠https://esd.burningman.org/⁠ WW2 plane improvements [https://www.inc.com/minda-zetlin/steve-jobs-boeing-b-17-wwii-paul-fitts-alphonse-chapanis-code-shaping-ergonomics-design.html] Book - The Checklist Manifesto [https://amzn.to/3IcwJy2] https://slack.com/events/resolve-incidents-faster-in-slack [https://slack.com/events/resolve-incidents-faster-in-slack] https://slack.com/blog/collaboration/engineers-netflix-pagerduty-slack [https://slack.com/blog/collaboration/engineers-netflix-pagerduty-slack] https://slack.com/resources/using-slack/the-modern-incident-response [https://slack.com/resources/using-slack/the-modern-incident-response] https://slack.com/resources/using-slack/slack-for-incident-management [https://slack.com/resources/using-slack/slack-for-incident-management] https://slack.com/blog/transformation/incident-management-slack [https://slack.com/blog/transformation/incident-management-slack] https://slack.com/intl/en-in/events/minimize-incident-response-times [https://slack.com/intl/en-in/events/minimize-incident-response-times]

18. maj 202349 min