Monitoring Distributed Systems: Case Studies from Google's SRE Teams

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

Author(s): Rob Ewaschuk; Betsy Beyer
Publisher: O'Reilly
Year: 2016

Language: English
Pages: 22

Cover
Web Ops
Copyright
Table of Contents
Chapter 1. Monitoring Distributed Systems
Definitions
Why Monitor?
Setting Reasonable Expectations for Monitoring
Symptoms Versus Causes
Black-Box Versus White-Box
The Four Golden Signals
Worrying About Your Tail (or, Instrumentation and Performance)
Choosing an Appropriate Resolution for Measurements
As Simple as Possible, No Simpler
Tying These Principles Together
Monitoring for the Long Term
Bigtable SRE: A Tale of Over-Alerting
Gmail: Predictable, Scriptable Responses from Humans
The Long Run
Conclusion
About the Author and Editor