TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Understanding Percentiles (2021)

47 pointsby subomiover 1 year ago

3 comments

gumbyover 1 year ago
It’s just the generalization of the median to more than two buckets (in this case 100 of them).
评论 #37247942 未加载
评论 #37250587 未加载
评论 #37248071 未加载
评论 #37248060 未加载
andreashaover 1 year ago
<a href="https:&#x2F;&#x2F;webcache.googleusercontent.com&#x2F;search?q=cache:k3QOarK_7X8J:https:&#x2F;&#x2F;blog.shalvah.me&#x2F;posts&#x2F;understanding-percentiles&amp;cd=8&amp;hl=sv&amp;ct=clnk&amp;gl=se" rel="nofollow noreferrer">https:&#x2F;&#x2F;webcache.googleusercontent.com&#x2F;search?q=cache:k3QOar...</a><p>archive.is WIP <a href="https:&#x2F;&#x2F;archive.is&#x2F;wip&#x2F;vLSWG" rel="nofollow noreferrer">https:&#x2F;&#x2F;archive.is&#x2F;wip&#x2F;vLSWG</a>
Xcelerateover 1 year ago
Percentiles become painful when trying to aggregate them. Suppose you record the p99 latency of some service, but you collect these metrics at the rack or data center level. Now you ask, what is the overall p99 latency of the service? Not an easy question to answer. Especially if you automatically subsample older time series data in order to store more of it (I&#x27;ve seen people trying to perform a weighted average of subsampled percentile metrics—it turns into a mess).<p>We need an efficient way to compactly represent the entire distribution of a metric over time so arbitrary aggregations can be performed accurately. There is some research on this topic, but nothing really production-ready that I&#x27;m aware of.
评论 #37249959 未加载
评论 #37250452 未加载
评论 #37250377 未加载