Reliability

Incident reviews that change behaviour

Blameless post-mortems are table stakes. The harder problem is making the actions stick — here's a structure that consistently does.

Brian18 January 20265 min read

Every mature engineering org runs blameless post-mortems. Fewer have the discipline to make the resulting actions stick. The structural fixes below are what consistently move the needle.

One owner, one date, one acceptance test

Every action item must have all three. No 'team to investigate'. No 'sometime next quarter'. No 'we'll know it when we see it'. If you can't write the acceptance test, the action isn't ready to commit to.

Track actions where work actually happens

Action items in the post-mortem doc are theatre. Action items in the team backlog with a label that lets you query 'all open incident actions older than 60 days' is real.

Review the actions, not just the incidents

Monthly reliability review: how many incident actions opened, how many closed, how many slipped. The trend matters more than any individual ticket.

More insights

Let's talk

Ready to build a platform that scales?

Book a free 30-minute discovery call to review your infrastructure and map out clear recommendations.

  • 30-minute discovery call, no obligation
  • Architecture review with concrete clear recommendations
  • Independent consultancy, direct, hands-on advice