# Kate Matsudaira - Software Managers' Guide to Operational Excellence (Highlights)

## Metadata
**Review**:: [readwise.io](https://readwise.io/bookreview/34572475)
**Source**:: #from/readwise #from/reader
**Zettel**:: #zettel/fleeting
**Status**:: #x
**Authors**:: [[Kate Matsudaira]]
**Full Title**:: Software Managers' Guide to Operational Excellence
**Category**:: #articles #readwise/articles
**Category Icon**:: 📰
**URL**:: [queue.acm.org](https://queue.acm.org/detail.cfm?id=3631176)
**Host**:: [[queue.acm.org]]
**Highlighted**:: [[2023-11-26]]
**Created**:: [[2023-11-25]]
## Highlights
- One of the hardest things about being the manager is owning responsibility for everything but having no direct control. ([View Highlight](https://read.readwise.io/read/01hg4ek3q7rhzzvagrjys2b2mx)) ^631771588
- *Does the team have monitoring and dashboards?* It is not enough to have the instrumentation; you need to verify it is working and know how to use it (and find it). ([View Highlight](https://read.readwise.io/read/01hg4etfdn9702f160mxansv3t)) ^631773142
- *Runbook (or playbooks)?* Has the team planned through what to do when things go wrong? Is there enough documentation for someone less familiar with the project to build the code and deploy it? Make sure it is clear how to restart, reboot, clear the cache, warm up the cache, deploy clean, etc. ([View Highlight](https://read.readwise.io/read/01hg4ewbac6bmcx5xyym8yfgc4)) ^631773385
- *Disaster recovery plans?* What happens when everything goes wrong? Are there backups? How do you restore from the backup? Have you thought about failover and redundancy? ([View Highlight](https://read.readwise.io/read/01hg4ewn84ea0r25mz1z8phn45)) ^631773435