Seven basic rules for causal inference

218 pointsby RafelMri9 months ago

14 comments

>Controlling for a collider leads to correlationThis is a big one that most people are not aware of. Quite often, in economics, medicine, and epidemiology, you'll see researchers adjust for everything in their regression model: income, physical activity, education, alcohol consumption, BMI, ... without realizing that they could easily be inducing collider bias.A much better, but rare, approach is to sit down with some subject matter experts and draft up a DAG - directed acyclic graph - that makes your assumptions about the causal structure of the problem explicit. Then determine what needs to be adjusted for in order to get a causal estimate of the effect. When you're explicit about your causal assumptions, it makes it easier for other researchers to propose different causal structures, and see if your results still hold up under alternative causal structures.The DAGitty tool [1] has some cool examples.[1] <a href="https://www.dagitty.net/dags.html" rel="nofollow">https://www.dagitty.net/dags.html</a>

评论 #41293712 未加载

abeppu9 months ago

At the bottom, the author mentions that by "correlation" they don't mean "linear correlation", but all their diagrams show the presence or absence of a clear linear correlation, and code examples use linear functions of random variables.They offhandedly say that "correlation" means "association" or "mutual information", so why not just do the whole post in terms of mutual information? I think the main issue with that is just that some of these points become tautologies -- e.g. the first point, "independent variables have zero mutual information" ends up being just one implication of the definition of mutual information.

评论 #41292561 未加载

评论 #41293166 未加载

0823498723498729 months ago

Can these seven be reduced to three basic rules?- controlling for a node increases correlation among pairs where both are ancestors- controlling for a node does not affect (the lack of) correlation among pairs where at least one is categorically unrelated (shares no ancestry with that node)- controlling for a node decreases correlation among pairs where both are related but at least one is not an ancestor

currymj9 months ago

Rule 2 (“causation creates correlation”) would be strongly disputed by a lot of people. It relies on the assumption of “faithfulness” which is not discussed until the bottom of the article.This is a very innocent sounding assumption but it’s actually quite strong. In particular it may be violated when there are control systems or strategic agents as part of the system you want to study — which is often the case for causal inference. In such scenarios (eg the famous thermostat example) you could have strong causal links which are invisible in the data.

评论 #41290410 未加载

评论 #41292933 未加载

评论 #41292101 未加载

评论 #41290078 未加载

评论 #41291011 未加载

评论 #41292043 未加载

crystal_revenge9 months ago

> Independent variables are not correlatedBut it's important to remember that dependent variables can also be not correlated. That is no correlation does not imply independence.Consider this trivial case:X ~ Uniform(-1,1)Y = X^2Cor(X,Y) = 0Despite the fact that Y's value is absolutely determined by the value of X.

评论 #41294804 未加载

评论 #41294656 未加载

nomilk9 months ago

Humble reminder of how easy R is to use. Download and install R for your operating system: <a href="https://cran.r-project.org/bin/" rel="nofollow">https://cran.r-project.org/bin/</a>Start it in the terminal by typing:<pre><code> R </code></pre> Copy/paste the code from the article to see it run!

评论 #41294295 未加载

评论 #41293844 未加载

评论 #41294236 未加载

Vecr9 months ago

Are the assumptions "No spurious correlation", "Consistency", and "Exchangeability" ever actually true? If a dataset's big enough you should generally be able to find at least one weird correlation, and the others are limits of doing statistics in the real world.

评论 #41293210 未加载

Rhapso9 months ago

I'm keeping this link, taking a backup and handing it out whenever i can. It is succinct and effective.These are concepts i find myself constantly having to explain and teach and they are critical to problem solving.

dkga9 months ago

I highly suggest this paper here for a more complete view of causality that nests do-calculus (at least in economics):Heckman, JJ and Pinto, R. (2024): “Econometric causality: The central role of thought experiments”, Journal of Econometrics, v.243, n.1-2.

评论 #41290805 未加载

评论 #41291484 未加载

chrsig9 months ago

> Rule 8: Controlling for a causal descendant (partially) controls for the ancestorperhaps this is a quaint or wildly off base question, but an honest one, please forgive any ignorance:Isn't this essentiallydefining the partial derivative? Should one arrive at the calculus definition of a partial derivative by following this?

评论 #41293940 未加载

shiandow9 months ago

This is missing my favourite rule.0. The directions of all arrows not part of a collider are statistically meaningless.

评论 #41289998 未加载

lordnacho9 months ago

This is brilliant. The whole causal inference thing is something I only came across after university, either I missed it or it is a hole in the curriculum, because it seems incredibly fundamental to our understanding of the world.The thing that made be read into it was a quite interesting sentence from lesswrong, saying that actually the common idea that correlation does not imply causation is wrong. Now it's not wrong in the face-value sense, it's wrong in the sense that actually you can use correlations to learn something about causation, and there turns out to be a whole field of study here.

评论 #41289863 未加载

评论 #41292028 未加载

评论 #41291636 未加载

arunsupe9 months ago

Great post. It's nice that these rules can be trivially demonstrated by simulation. The simulation (and visuals) helps validate the concepts.

raymondh9 months ago

Is there a simple R example for Rule 4?

评论 #41293022 未加载