TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Understanding Stein's Paradox (2021)

103 pointsby robertvcabout 1 year ago

10 comments

rssoconnorabout 1 year ago
I think the part on &quot;How arbitrary is the origin, really?&quot; is not correct. The origin <i>is</i> arbitrary. As the Wikipedia article points you you can pick <i>any</i> point, whether or not it is the origin, and use the James-Stein estimator to push your estimate towards that point and it will improve one&#x27;s mean squared error.<p>If you pick a point to the left of your sample, then moving your estimate to the left will improve your mean squared error on average. If you pick a point to the right of your sample, then moving your estimate to the right will improve your mean squared error as well.<p>I&#x27;m still trying to come to grips with this, and below is conjecture on my part. Imagine sampling many points from a 3-D Gaussian distribution (with identity covariance), making a nice cloud of points. Next choose any point P. P could be close to the cloud or far away, it doesn&#x27;t matter. No matter which point P you pick, if you adjust all the points from your cloud of samples in accordance to this James-Stein formula, moving them all towards your chosen point P by various amounts, then, on average they will move closer to the center of your Gaussian distribution. This happens no matter where P is.<p>The cloud is, of course, centered around the center of the Gaussian distribution. As the points are pulled towards this arbitrary point P some will be pulled away from the the center of Gaussian, some are pulled towards the center, and some are squeezed so that they are pulled away from the center in the paralled direction, but squeezed closer in the perpendicular direction. Anyhow, apparently everything ends up, on average, closer to the center of the Gaussian in the end.<p>I&#x27;m not entirely sure what to make of this result. Perhaps it means that mean squared error is a silly error metric?
评论 #40266182 未加载
评论 #40275744 未加载
评论 #40267596 未加载
mitthrowaway2about 1 year ago
Sorry, I&#x27;m siding with the physicists here. If you&#x27;re going to declare that your seemingly arbitrary choice of coordinate system is actually not arbitrary and part of your prior information about where the mean of the distribution is suspected to be, you have to put that in the initial problem statement.
评论 #40263372 未加载
评论 #40264139 未加载
credit_guyabout 1 year ago
Stein&#x27;s paradox is bogus. Somebody needs to say that.<p>Here&#x27;s one wikipedia example:<p><pre><code> &gt; Suppose we are to estimate three unrelated parameters, such as the US wheat yield for 1993, the number of spectators at the Wimbledon tennis tournament in 2001, and the weight of a randomly chosen candy bar from the supermarket. Suppose we have independent Gaussian measurements of each of these quantities. Stein&#x27;s example now tells us that we can get a better estimate (on average) for the vector of three parameters by simultaneously using the three unrelated measurements. </code></pre> Here&#x27;s what&#x27;s bogus about this: the &quot;better estimate (on average)&quot; is mathematically true ... for a certain definition of &quot;better estimate&quot;. But whatever that definition is, it is irrelevant to the real world. If you believe you get a better estimate of the US wheat yield by estimating also the number of Wimbledon spectators and the weight of a candy bar in a shop, then you probably believe in telepathy and astrology too.
评论 #40264460 未加载
评论 #40266204 未加载
评论 #40265230 未加载
jpreteabout 1 year ago
My intuition is that the problem is in using squares for the error. The volume of space available for a given distance of error in 3-space is O(N^3) the magnitude of the error, so an error term of O(N^2) doesn&#x27;t grow fast enough compared to the volume that can contain that magnitude of error.<p>But I really don&#x27;t know, it&#x27;s just an intuition with no formalism behind it.
hyperbovineabout 1 year ago
Had a bit of a chuckle at the very-2024 definition of the Stein shrinkage estimator:<p>\hat{mu} = ReLU(…)
评论 #40267512 未加载
pfortunyabout 1 year ago
I do not get it: if variance is too large, a random sample is very little representative of the mean. As simple as that?<p>Now the specific formula may be complicated. But otherwise I do not understand the “paradox”? Or am I missing something?
评论 #40264316 未加载
zeroonetwothreeabout 1 year ago
I don’t understand the picture with the shaded circle. Sure the area to the left is smaller, but it also is more likely to be chosen because in a Gaussian values closer to the mean are more likely. So the picture alone doesn’t prove anything.
评论 #40266992 未加载
fromMarsabout 1 year ago
Can someone confirm the validity of the section called ”Can we derive the James-Stein estimator rigorously?&quot;?<p>The claim that the best estimator must be smooth seemed surprising to me.
moi2388about 1 year ago
What a great read!
TibbityFlandersabout 1 year ago
I&#x27;m horrible at stats, but is this saying that if I have 5 jars of pennies, and I guess the amount in each one. Then I find the average of all my guesses, and the variance between the guesses, then I can adjust each guess to a more likely answer with this method?
评论 #40263522 未加载
评论 #40263294 未加载