TechEcho

11 comments

baqabout 6 years ago

IMHO plotting the distribution should be the first step before trying to compute its statistics. If you know the shape, you can understand the values - otherwise it's guesswork.

评论 #19561863 未加载

评论 #19557836 未加载

评论 #19560818 未加载

评论 #19562341 未加载

评论 #19559648 未加载

评论 #19559655 未加载

Rafuinoabout 6 years ago

This topic always leads me to think about this great talk from Gil Tene on how NOT to measure latencies (basically, don't use averages!).<a href="https://www.youtube.com/watch?v=lJ8ydIuPFeU" rel="nofollow">https://www.youtube.com/watch?v=lJ8ydIuPFeU</a>I'm also a huge fan of how Dormando showed latency distributions in one of his recent Memcached Extstore posts. The default is 95th percentile but you can change the percentile to what matters to you (i.e. 99th percentile if you ask me!). Scroll down to see what he did and play with it.<a href="https://memcached.org/blog/nvm-multidisk/" rel="nofollow">https://memcached.org/blog/nvm-multidisk/</a>

评论 #19559421 未加载

cromulentabout 6 years ago

There's a great story on 99% Invisible about averages, particularly when used to design cockpits for the average pilot.<a href="https://99percentinvisible.org/episode/on-average/" rel="nofollow">https://99percentinvisible.org/episode/on-average/</a>

sohkamyungabout 6 years ago

Check out this comic on "Why Not to Trust Statistics" [1]. His book, "Math With Bad Drawings" [2] has a chapter on statistics and why not to trust a single statistical measure only.[1] <a href="https://mathwithbaddrawings.com/2016/07/13/why-not-to-trust-statistics/" rel="nofollow">https://mathwithbaddrawings.com/2016/07/13/why-not-to-trust-...</a>[2] <a href="https://mathwithbaddrawings.com/2018/05/23/math-with-bad-drawings-the-book/" rel="nofollow">https://mathwithbaddrawings.com/2018/05/23/math-with-bad-dra...</a>

camel_gopherabout 6 years ago

Percentiles can be misleading, try a histogram - <a href="https://www.circonus.com/2018/11/the-problem-with-percentiles-aggregation-brings-aggravation/" rel="nofollow">https://www.circonus.com/2018/11/the-problem-with-percentile...</a>

novaleafabout 6 years ago

My own solution, which might be useful to those using javascript (nodejs or browser):I use mathjs.quantileSeq() and log 0%, 25%, 50%, 75%, and 100%. This seems to be good for "casual metric logs".I've found that this gives a good shape of the data, as well as the absolute min/max values. If you use 1% or 99% you'll miss the absolute worst performers, and I want to be at least aware of what the worst performance numbers are.<a href="https://mathjs.org/" rel="nofollow">https://mathjs.org/</a><a href="https://mathjs.org/docs/reference/functions/quantileSeq.html" rel="nofollow">https://mathjs.org/docs/reference/functions/quantileSeq.html</a>

LiamPaabout 6 years ago

Site Reliabilty Engineering goes over this in a lot more detail.<a href="https://landing.google.com/sre/books/" rel="nofollow">https://landing.google.com/sre/books/</a>

评论 #19559279 未加载

phosfoxabout 6 years ago

Reminds me of “Don’t cross a river if it is four feet deep on average.” — Nassim Nicholas Taleb

评论 #19564026 未加载

mikorymabout 6 years ago

I've used Elasticsearch + Kibana for agricultural data and similarly "expanded" the view out from averages to time series.People in agriculture love averages and it makes a lot of sense in financial data since averages preserve totals e.g.:50 ton / ha average over 100 ha = 5 000 tonsAt the same time summing each individual ha gives you 5 000 tons total.But once you realise that you can expand on this, things get really interesting. I don't know of other people working on the same problems that I am working on, but they are relevant both economically (in the sense of making money) and environmentally (in the sense of improving efficiency and managing climate).

SketchySeaBeastabout 6 years ago

More knowledge is always better, but percentiles are a little misleading as well - the 99% at 867 ms latency makes you have a moment of panic, but when you see that 95% is 60 ms, then you really realize how few of your visitors are experiencing the slow response. Might it be a problem? Possibly, and I has brought awareness to that potential, but it also has the possibility to blow it out of proportion if you don't look at the rest of the data.Edit: I'm not saying Averages are better, but that Percentiles can be misleading as well.

评论 #19558223 未加载

评论 #19558065 未加载

评论 #19557692 未加载

评论 #19557400 未加载

评论 #19557961 未加载

Lightbodyabout 6 years ago

One of my favorite (short) talks on this topic. Well worth a few minutes of your time:<a href="https://www.youtube.com/watch?v=coNDCIMH8bk" rel="nofollow">https://www.youtube.com/watch?v=coNDCIMH8bk</a>

评论 #19562500 未加载

11 comments

baqabout 6 years ago

IMHO plotting the distribution should be the first step before trying to compute its statistics. If you know the shape, you can understand the values - otherwise it's guesswork.

评论 #19561863 未加载

评论 #19557836 未加载

评论 #19560818 未加载

评论 #19562341 未加载

评论 #19559648 未加载

评论 #19559655 未加载

Rafuinoabout 6 years ago

评论 #19559421 未加载

cromulentabout 6 years ago

sohkamyungabout 6 years ago

camel_gopherabout 6 years ago

novaleafabout 6 years ago

LiamPaabout 6 years ago

Site Reliabilty Engineering goes over this in a lot more detail.<a href="https://landing.google.com/sre/books/" rel="nofollow">https://landing.google.com/sre/books/</a>

评论 #19559279 未加载

phosfoxabout 6 years ago

Reminds me of “Don’t cross a river if it is four feet deep on average.” — Nassim Nicholas Taleb

Averages Can Be Misleading: Try a Percentile (2014)

11 comments

Averages Can Be Misleading: Try a Percentile (2014)

11 comments