TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Appreciating R: The Ease of Testing Linear Model Assumptions

41 点作者 jtloong超过 5 年前

3 条评论

triska超过 5 年前
R (and its predecessor S) is so great, thank you for writing and sharing this!<p>In a way, R is very similar to APL and J in that many operations automatically scale from scalars to vectors, and you can therefore conveniently express applications of operations to many elements at once. In fact, even its assignment operator is chosen to syntactically resemble that of APL:<p><a href="https:&#x2F;&#x2F;blog.revolutionanalytics.com&#x2F;2008&#x2F;12&#x2F;use-equals-or-arrow-for-assignment.html" rel="nofollow">https:&#x2F;&#x2F;blog.revolutionanalytics.com&#x2F;2008&#x2F;12&#x2F;use-equals-or-a...</a><p>Regarding the quote:<p><i>&quot;All statistical models and tests have underlying mathematical assumptions on the types of conditions upon we can generate reliable results (Hoekstra et al., 2012).&quot;</i><p>Indeed! There is hence also a very close connection between statistics and logic.
wjnc超过 5 年前
Have I gone overboard in remembering that Gauss-Markov is overrated? It&#x27;s nice that some assumptions guarantee BLUE, but then there is real world data and modelling. Understand where your data come from, what distributions they have, what the relationship is with your dependent variable. Testing for these assumptions might be helpful as a teaching tool, but please get to the level to be able to demonstrate that your model can fail a few of these (like heteroskedacity) and still work (or recognize you need more than a linear model).
ivan_ah超过 5 年前
It&#x27;s not surprising that R has good tools for statistical assumptions checking. It has <i>everything</i> with an API perfectly suited for stats professionals. In the Python world scipy and statsmodels implement most of the important tests, but probably not as many as the ones available in R.<p>One recent project that I looks very promising in the Python world is called Tea: a high-level language for expressing statistical analysis questions. Basically the user describes the characteristics of the data they have, their assumptions, and their hypothesis, and then the tea runtime checks all the assumptions and figures out which test can be applied to test the hypothesis. You still have to know some stats jargon, but the user-interface between human and machine is revolutionary! Here is a bunch of links I collected about tea: <a href="https:&#x2F;&#x2F;www.one-tab.com&#x2F;page&#x2F;aUF1eWnDT8CIyrwScWD2uA" rel="nofollow">https:&#x2F;&#x2F;www.one-tab.com&#x2F;page&#x2F;aUF1eWnDT8CIyrwScWD2uA</a>