TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Misleading Graph Generator

68 pointsby gulbrandrabout 11 years ago

12 comments

jereabout 11 years ago
This isn&#x27;t a graph generator. It&#x27;s just a graph. You can plug data into it and get a graph, I guess, but the same goes for Excel.<p>A misleading graph generator that automatically matched concurrent data sets based on correlation would be quite interesting, but this isn&#x27;t it.
评论 #7287225 未加载
mrtksnabout 11 years ago
How is this graphs fault? The only fault the graph has is that it clearly displays the data, the problem is in the idea that is represented. It&#x27;s formally called &quot;Correlation does not imply causation&quot; and it&#x27;s fault of the person who is suggesting it.<p>There is a famous satirical version of this too: <a href="http://en.wikipedia.org/wiki/File:PiratesVsTemp(en).svg" rel="nofollow">http:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;File:PiratesVsTemp(en).svg</a><p>It has nothing to do with graphs, graphs are are just visual tools and people reading data from them are supposed to evaluate it just like reading data from any other tool and not jump into conclusions.<p>What the graph in question suggests is that both &quot;transistor count&quot; and &quot;average life expectancy in germany&quot; have risen trough time and if you are reading this as &quot;rising transistor count increases life expectancy&quot; it&#x27;s your fault. Why not read it like &quot;from 1971 to 2011 both transistor count and life expectancy increased steadily, maybe because of the advances in technology - we should look into it, it&#x27;s too early to say anything&quot;
评论 #7285689 未加载
评论 #7286175 未加载
评论 #7285972 未加载
评论 #7286032 未加载
评论 #7288161 未加载
评论 #7287760 未加载
Homunculiheadedabout 11 years ago
I hear more and more chanting of &quot;correlation does not equal causation!&quot; which is great if your goal is to form a causal model of the world, but there are plenty of insights you can arrive at from correlation alone.<p>For starters in the world of machine learning and predictive analytics, it doesn&#x27;t really matter if X causes Y so long as X is a consistently good predictor of Y. Maybe powerlines being over someone&#x27;s home are not the cause of cancer, but if their presence can be used to predict cancer rates that&#x27;s a good thing.<p>More important imho is the idea of latent or hidden variables. Two things that are clearly correlated but also seem to not have a causal relationship (just as transistors and longevity) may share a latent variable, that may be either non-quantifiable or completely unobservable. For either case measuring outputs that share a common latent variable and thus correlate with each other might be the only way to attempt to measure hidden, non-quantifiable causes.<p>For example employee happiness might be the cause of employee retention. However you can&#x27;t currently measure or observe &#x27;happiness&#x27;, but there may be many, seemingly, unrelated employee activities that correlate with retention because they are also driven by this same latent variable. Studying them is the only way to get a quantifiable understanding of this latent cause.<p>tl;dr somethimes correlation is just as important as causation.
robert_tweedabout 11 years ago
Ironically, this graph does an excellent job of showing a correlation that <i>really exists</i> between the two data sets, albeit a non-linear one.<p>The only misleading thing about this graph is the title, which states a causal link with no evidence of one.
评论 #7285854 未加载
zealonabout 11 years ago
Maybe a bit off-topic here, but I think there is a cause-effect relationship between number of transistors and life expectancy. More transistors implies more computing power. More computing power leads to better&#x2F;faster information processing, including medical information. This leads to faster patient diagnostics, better treatments (pharmaceutical innovations), earlier and more precise health warnings (lab tests an medical equipment), and so on.<p>Faster and better information processing leads also to higher food quality (food processing plants), higher life quality (environmental temperature and humidity control), etc.<p>Germany is a highly industrialized country, so information processing power causes a big social impact.
评论 #7286321 未加载
NPMaxwellabout 11 years ago
For causation, you need A&#x2F;B testing -- real A&#x2F;B testing (randomized controlled experiments).<p>Not everyone understands the need for A&#x2F;B testing and not everyone understands how to do A&#x2F;B testing.<p>I recently worked on modeling to forecast revenues for a large multinational. One of the variables that was essential for an accurate model was whether prior data came from before or after the company hired their current A&#x2F;B testing guru. With the new guru, revenues rose every week. With the previous &quot;guru&quot;, changes by Engineering had no impact on revenues.
jkremsabout 11 years ago
Everyone&#x27;s talking about &quot;correlation vs. causation&quot; but I&#x27;m pretty sure the page explicitly states that that&#x27;s not what this is about:<p>&gt; if its axes are chosen unwisely<p>The graph is misleading because it makes it look like the transistor count and the life expectancy grow at the exact same rate. It pretends there&#x27;s a linear correlation. And that&#x27;s the misleading part.
IvyMikeabout 11 years ago
Those who have not read &quot;How to Lie With Statistics&quot; are doomed to reinvent it.<p><a href="https://archive.org/details/HowToLieWithStatistics" rel="nofollow">https:&#x2F;&#x2F;archive.org&#x2F;details&#x2F;HowToLieWithStatistics</a>
评论 #7287247 未加载
squigs25about 11 years ago
This is only one misleading graph, which, to be honest, I find misleading.
breunigsabout 11 years ago
I feel that many missed my point: It’s not to say causation never implies correlation, that there’s no common source or that latent factors are fiction.<p>In my experience many people believe something is true, just because of “math” or “data”. So, this is basically a variation of the joke that 73.37% of people put more trust into statistics, if the value isn’t rounded. Since you all critically discussed the topic, you are far beyond of what this can little project can teach you.<p>If you have suggestions on how to express this more clearly, please let me know.
ehsanu1about 11 years ago
<a href="http://www.correlated.org/" rel="nofollow">http:&#x2F;&#x2F;www.correlated.org&#x2F;</a>
yohaabout 11 years ago
I think this is a very good idea. Lots of people still mix up correlation and causation. Either this tool will make them understand what this is wrong, either they will enjoy finding lots of weird relations between data sources.