Soft, contained failures are good. For those in an oncall rotation, this means fewer interrupted sleep cycles and dinner dates. However, there’s a drawback to building these more resilient, coordinated services:
there are exponentially more ways for these systems to fail. Configuring alerts to handle every possible failure isn’t maintainable.
Read the full article at: blog.scoutapp.com