[cloudplatform.googleblog.com] Incident management at Google — adventures in SRE-land

July 21, 2017

Have you ever wondered what happens at Google when something goes wrong? Our industry is fond of using colorful metaphors such as “putting out fires” to describe what we do.

Of course, unlike the actual firefighters, our incidents don’t normally involve risk to life and limb. Despite the imperfect metaphor, Google Site Reliability Engineers (SREs) have a lot in common with other first responders in other fields.

Read the full article at: cloudplatform.googleblog.com