Designing Data-Intensive Applications by Martin Kleppmann

Think about (latency) percentiles and not averages, and how Amazon puts it, the tail end is probably the one you need to optimize for. If considering for example 99.9 percentiles, you’re then only missing 1 out of 1000, but the clients with more data and usually higher payments, are probably the ones which need this kind of sustained performance.