TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Do not confuse a random variable with its distribution

61 点作者 reqo11 个月前

14 条评论

eru11 个月前
&gt; A random variable measures a numerical quantity which depends on the outcome of a random phenomenon.<p>Hmm, that sentence at the beginning is already wrong. Random variables can measure anything, not just numbers. Heads or Tails of a coin, or colours of cars etc.<p>It&#x27;s fine to restrict yourself to numeric random variables only. But if you are writing a rant telling other people to be more careful in their analysis, you better dot your i&#x27;s and cross your t&#x27;s yourself.
评论 #40796192 未加载
评论 #40797051 未加载
评论 #40796146 未加载
评论 #40797098 未加载
评论 #40796247 未加载
auraai11 个月前
This is pretty important in mathematical finance, where one moves from a real-world measure to a risk-neutral measure to make computations feasible.<p><a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Girsanov_theorem" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Girsanov_theorem</a> <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Risk-neutral_measure" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Risk-neutral_measure</a>
btown11 个月前
For the code-minded out there, a &quot;random variable&quot; is something of a lazily evaluated value that can be &quot;sampled&quot; and emit a quantity (or a vector&#x2F;tensor thereof) each time. And the OP article boils down to the fact that it&#x27;s generally incorrect to assume that any random variable can be represented solely by its unconditional probability distribution; a distribution is more of a visualization than a sufficient definition. Rather, one must track the entire graph of other random variables that may feed the current one (e.g. that the current one is conditional on), akin to how an Excel spreadsheet models all the dependencies of a cell.<p>The fun part comes when you can ask this computation graph: &quot;what parameters for a random variable early on in the chain would be the ones that optimize some function of variables later in the chain?&quot; And, handwaving a ton of nuance here, when those parameters are weights in a neural network, the function is a loss function on the training data, and the optimization is done by automatic differentiation (e.g. <a href="https:&#x2F;&#x2F;pytorch.org&#x2F;tutorials&#x2F;beginner&#x2F;introyt&#x2F;autogradyt_tutorial.html" rel="nofollow">https:&#x2F;&#x2F;pytorch.org&#x2F;tutorials&#x2F;beginner&#x2F;introyt&#x2F;autogradyt_tu...</a>), you have modern AI.<p>If you&#x27;re interested in the theoretical underpinnings here, Bishop&#x27;s PRML is perhaps the classic starting point: <a href="https:&#x2F;&#x2F;www.microsoft.com&#x2F;en-us&#x2F;research&#x2F;uploads&#x2F;prod&#x2F;2006&#x2F;01&#x2F;Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf" rel="nofollow">https:&#x2F;&#x2F;www.microsoft.com&#x2F;en-us&#x2F;research&#x2F;uploads&#x2F;prod&#x2F;2006&#x2F;0...</a>
panic11 个月前
There’s an interesting connection here to another article on the front page: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40794786">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40794786</a><p>In that article, squaring a number in interval arithmetic is different from multiplying two independent numbers with the same interval. Here, squaring a random variable is different from multiplying two independent random variables with the same distribution.
condwanaland11 个月前
Love to see things built with bookdown, which is such an awesome R package (although it&#x27;s successor, Quarto, is much better and simpler)
jhrmnn11 个月前
In quantum mechanics, the measurement and observation are two sides of the same coin, and the sample space is _defined_ by the random variable (observable) of interest, so it makes a little less sense to separate the two. (There is no hidden observation-independent sample space.)
评论 #40796945 未加载
kazinator11 个月前
Who confuses a random variable with its distribution, and what does that mistake look like? I don&#x27;t get it.
dwqwdqd11 个月前
Does this work?<p>X = 1 with probability 0.5, 0 with probability 0.5 Y = 0 when X = 1, 1 when X = 0 (for the \omega for which X(\omega) = 1, Y(\omega) = 0).<p>They&#x27;re both bernoulli distributions with p=0.5 (i.e. they follow the same distribution) and P(X=Y) = 0
评论 #40795790 未加载
baking11 个月前
Why make it complicated? One coin flip, X = Heads and Y = Tails. P(X = Y) = 0.
评论 #40795654 未加载
评论 #40795803 未加载
Davidzheng11 个月前
Honestly it&#x27;s fine to confuse a random variable with its distribution if you only are working with a single RV. Changing probability space without changing distribution doesn&#x27;t really matter much, probability space is more of an abstraction it&#x27;s not really measurable
tpoacher11 个月前
I&#x27;ve often felt that one of the reasons such warnings are even necessary, is because the notation we use to denote probabilities in the first place is atrocious, and clearly an abuse of notation.<p>A better convention would make clear the distinction between the set of possible outcomes, the act of obtaining a (range of) samples from that set, and the probability that those events match a value range of interest. p(x=X) is not enough to capture all that information. let alone p(x) vs p(X).
clircle11 个月前
Whoops, you posted the wrong page. The statistics page that Hacker News needs to read is the one about how the Central Limit Theorem doesn&#x27;t apply to everything damn thing.
评论 #40796626 未加载
评论 #40796911 未加载
评论 #40796583 未加载
评论 #40795956 未加载
dinobones11 个月前
These types of explanations are the reason I dislike school. This is such a stuffy and contrived way to explain things.<p>I’m so glad I have ChatGPT now, I always ask for applied examples and ask it to explain things intuitively. I would’ve been a 4.0 student if I would’ve had ChatGPT as my personal tutor when I was in school.
glitchc11 个月前
This article is overly complicated. The random variable X is a function mapping the outcome to its probability. The distribution, or the probability density function, or pdf, is the integral of that function. The cumulative density function, the cdf, is in turn the integral of the pdf.
评论 #40797052 未加载
评论 #40797047 未加载