Simpson’s Paradox (2016)

370 pointsby mromniaabout 6 years ago

20 comments

mcguireabout 6 years ago

I'd like to say that the author has been reading The Book of Why, but it seems that he hasn't because he missed the punch line of the section on the paradox: you need a causal model to separate the two branches of the paradox. It's as easy to construct examples where the overall view is correct as it is so construct examples where the separate views are.

评论 #19290378 未加载

评论 #19314288 未加载

knappaabout 6 years ago

The sex-discrimination lawsuit against UC Berkley seems to be a kind of academic urban myth; the administration was apparently afraid of such a lawsuit and the study was done in response to those administrative fears.

评论 #19290460 未加载

评论 #19289611 未加载

gokabout 6 years ago

The last example of software optimization causing mean slowdown because users actually use the software is so true. Another example I've seen is better ML models causing accuracy to go down; users try harder things.

freddexabout 6 years ago

I like the way this is written. Very clear and to the point, with a tone of "Hey, check out this cool thing".

评论 #19289214 未加载

IngoBlechschmidabout 6 years ago

An explorable explanation of Simpson's Paradox, neatly complementing the article, is here: <a href="https://pwacker.com/simpson.html" rel="nofollow">https://pwacker.com/simpson.html</a>

currymjabout 6 years ago

Judea Pearl’s explanations of this in terms of causality are the only way it really makes sense, in my view.<a href="https://ftp.cs.ucla.edu/pub/stat_ser/r414.pdf" rel="nofollow">https://ftp.cs.ucla.edu/pub/stat_ser/r414.pdf</a>

评论 #19291214 未加载

YeGoblynQueenneabout 6 years ago

So, this is the data that the wikipedia page on Simpson's Paradox cites for the Berkeley study, and that the author of the article has quoted:<pre><code> Men Women Department Applied Admitted Applied Admitted A [825] 62% 108 [82%] B [560] 63% 25 [68%] C 325 [37%] [593] 34% D [417] 33% 375 [35%] E 191 [28%] [393] 24% F [373] 6% 341 [7%] </code></pre> Above, I've bracketed in each pair of columns a) the sex with the most applicants and b) the sex with the most admissions, in a department. If that data is really the Berkeley data, then it's clear that the bias is against the sex with the most applicants, rather than either men or women.I can propose a mechanism for this kind of (with some abuse of terminology) selection bias. A department accepts some applications, then realises they've admitted too many applicants of one sex and start rejecting applicants from the dominant sex in an attempt to redress the balance. They make a mess of it and end up biased too far in the opposite direction than they originally started.Also note that in 4 out of 6 departments, more men applied than women, explaining why more departments appear biased against men (provided my observation holds).However, I can't be sure whether this is actually the original data because it's nowhere to be found on my pdf copy of the study (Sex bias in graduate admission) which I believe I got from here: <a href="https://homepage.stat.uiowa.edu/~mbognar/1030/Bickel-Berkeley.pdf" rel="nofollow">https://homepage.stat.uiowa.edu/~mbognar/1030/Bickel-Berkele...</a>. If anyone knows where this data actually comes from, I'd welcome a pointer.

评论 #19292586 未加载

评论 #19292571 未加载

sreanabout 6 years ago

Simpson's Paradox is one of the many phenomena that shows how different applied ML is from regular software engineering. Another one is feedback loops between decomposed subproblems.In ML encapsulation, shielding away of inner details often does not work. One needs to know what is happening on the other side of the abstraction boundary. This is a problem for managers and PM coning to ML from a purely software engineering background. They are used to encapsulation and decomposition serving them well and they expect the same.

评论 #19290352 未加载

评论 #19289783 未加载

jzlabout 6 years ago

Observation #2: the paradox is essentially describing statistical gerrymandering. :)

评论 #19292681 未加载

esquire_900about 6 years ago

This is the exact feeling I've been having for years, nicely described in an easy to understand language. At least in data science and (god forbid) behavioral psychology, you can answer any question any way you like - statistically valid - by slightly shifting the level of focus (as described here), definitions or angle of attack. The more data, the easier.Thanks for putting it in such a clear way :)

throway88989898about 6 years ago

Neatly phrased:Trends which appear in slices of data may disappear or reverse when the groups are combined.

评论 #19289635 未加载

sopooneoabout 6 years ago

In simples case at least, such as with the kidney stones, can we reduce our risk of reaching wrong conclusions by increasing our sample size of patients and randomizing which receive each treatment?

评论 #19289602 未加载

评论 #19289342 未加载

jzlabout 6 years ago

Cool article. My knowledge of statistics is really rusty, but isn't this another way approaching the topic of "Bayesian Thinking"? If you think about the scenarios in the article from the standpoint of predicting any given outcome in advance, male vs. female and hard department vs. easy department should be treated as "priors". Or to put it another way, Bayesian thinking means asking the question "What is the chance of X happening given Y?"A nice intro to the topic: <a href="https://betterexplained.com/articles/an-intuitive-and-short-explanation-of-bayes-theorem/" rel="nofollow">https://betterexplained.com/articles/an-intuitive-and-short-...</a>Which explains why a positive test on a mammogram means you only have an 8% chance of having breast cancer:>The chance of getting a real, positive result is .008. The chance of getting any type of positive result is the chance of a true positive plus the chance of a false positive (.008 + 0.09504 = .10304).>So, our chance of cancer is .008/.10304 = 0.0776, or about 7.8%.>Interesting — a positive mammogram only means you have a 7.8% chance of cancer, rather than 80% (the supposed accuracy of the test). It might seem strange at first but it makes sense: the test gives a false positive 9.6% of the time (quite high), so there will be many false positives in a given population. For a rare disease, most of the positive test results will be wrong.

评论 #19290464 未加载

评论 #19290943 未加载

TicklishTigerabout 6 years ago

That is not a paradox. It's just the fact that a theory about something might not hold when you take a closer look at that something.In the articles example, the admission rates of a university seemed to indicate that there is a bias against women.Zooming in and looking at the admission rates of the individual departments seem to indicate that there is a bias against men.The article makes it sound like the first theory was wrong. And the second theory - the bias against men - is the real truth.Zooming in further might indicate the opposite again.Take two boxers. So far, one of them has won 86% of his fights and the other one has won 100%. According to the article, "The data is clear".Now we add more data:One fighter is Mike Tyson. He won 50 of his 58 fights. The other one is me. I did one fight in kindergarden and won it. But to be honest: I would not want to fight Tyson. As paradox as it sounds.

评论 #19289473 未加载

评论 #19289334 未加载

评论 #19289490 未加载

评论 #19289337 未加载

评论 #19289347 未加载

air7about 6 years ago

This is one of my favorite paradoxes too. Here's why:"... given the same table, one should sometimes follow the partitioned and sometimes the aggregated data, depending on the story behind the data, with each story dictating its own choice. Pearl considers this to be the real paradox behind Simpson's reversal." [0][0]<a href="https://en.wikipedia.org/wiki/Simpson%27s_paradox" rel="nofollow">https://en.wikipedia.org/wiki/Simpson%27s_paradox</a>

评论 #19298131 未加载

emmelaichabout 6 years ago

I sometimes wonder why people expect there to be any fixed, categorical semantic relationship between any set of numbers and set of natural language statements.Very rarely do the words or the numbers cover even a tiny amount of the possible interpretations.

_bxg1about 6 years ago

This is basically how gerrymandering works, isn't it?

jdhzzzabout 6 years ago

I am reminded of this XKCD comic <a href="https://xkcd.com/2080/" rel="nofollow">https://xkcd.com/2080/</a>.

clircleabout 6 years ago

Iirc, you can guard against simpson's paradox by designing/collecting balanced data

评论 #19289619 未加载

lettergramabout 6 years ago

Idk why the 2016 needs to be in the title here. I understand for date relevant content, but this is not.

评论 #19289860 未加载

评论 #19289823 未加载

20 comments

mcguireabout 6 years ago

评论 #19290378 未加载

评论 #19314288 未加载

knappaabout 6 years ago

评论 #19290460 未加载

评论 #19289611 未加载

gokabout 6 years ago

freddexabout 6 years ago

I like the way this is written. Very clear and to the point, with a tone of "Hey, check out this cool thing".

评论 #19289214 未加载

IngoBlechschmidabout 6 years ago

An explorable explanation of Simpson's Paradox, neatly complementing the article, is here: <a href="https://pwacker.com/simpson.html" rel="nofollow">https://pwacker.com/simpson.html</a>

currymjabout 6 years ago

评论 #19291214 未加载

YeGoblynQueenneabout 6 years ago

评论 #19292586 未加载

评论 #19292571 未加载

sreanabout 6 years ago

评论 #19290352 未加载

评论 #19289783 未加载

jzlabout 6 years ago

Observation #2: the paradox is essentially describing statistical gerrymandering. :)

评论 #19292681 未加载

esquire_900about 6 years ago

throway88989898about 6 years ago

Neatly phrased:Trends which appear in slices of data may disappear or reverse when the groups are combined.

评论 #19289635 未加载

sopooneoabout 6 years ago

In simples case at least, such as with the kidney stones, can we reduce our risk of reaching wrong conclusions by increasing our sample size of patients and randomizing which receive each treatment?

评论 #19289602 未加载

评论 #19289342 未加载

jzlabout 6 years ago

评论 #19290464 未加载

评论 #19290943 未加载

TicklishTigerabout 6 years ago

评论 #19289473 未加载

评论 #19289334 未加载

评论 #19289490 未加载

评论 #19289337 未加载

评论 #19289347 未加载

air7about 6 years ago

评论 #19298131 未加载

emmelaichabout 6 years ago

_bxg1about 6 years ago

This is basically how gerrymandering works, isn't it?

jdhzzzabout 6 years ago

I am reminded of this XKCD comic <a href="https://xkcd.com/2080/" rel="nofollow">https://xkcd.com/2080/</a>.

clircleabout 6 years ago

Iirc, you can guard against simpson's paradox by designing/collecting balanced data

评论 #19289619 未加载

lettergramabout 6 years ago

Idk why the 2016 needs to be in the title here. I understand for date relevant content, but this is not.

评论 #19289860 未加载

评论 #19289823 未加载