TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Principles of good data analysis

108 点作者 gjreda大约 11 年前

4 条评论

westurner大约 11 年前
Helpful; thanks!<p>&quot;Ten Simple Rules for Reproducible Computational Research&quot; <a href="http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1003285" rel="nofollow">http:&#x2F;&#x2F;www.ploscompbiol.org&#x2F;article&#x2F;info%3Adoi%2F10.1371%2Fj...</a><p>* Rule 1: For Every Result, Keep Track of How It Was Produced<p>* Rule 2: Avoid Manual Data Manipulation Steps<p>* Rule 3: Archive the Exact Versions of All External Programs Used<p>* Rule 4: Version Control All Custom Scripts<p>* Rule 5: Record All Intermediate Results, When Possible in Standardized Formats<p>* Rule 6: For Analyses That Include Randomness, Note Underlying Random Seeds<p>* Rule 7: Always Store Raw Data behind Plots<p>* Rule 8: Generate Hierarchical Analysis Output, Allowing Layers of Increasing Detail to Be Inspected<p>* Rule 9: Connect Textual Statements to Underlying Results<p>* Rule 10: Provide Public Access to Scripts, Runs, and Results<p>Sandve GK, Nekrutenko A, Taylor J, Hovig E (2013) Ten Simple Rules for Reproducible Computational Research. PLoS Comput Biol 9(10): e1003285. doi:10.1371&#x2F;journal.pcbi.1003285
评论 #7454845 未加载
评论 #7455189 未加载
ethikal大约 11 年前
Great post! You mention that it is important to &quot;be skeptical&quot; - I concur and would add that it&#x27;s helpful to approach the analysis from a non-biased standpoint. Even if you are going into your analysis with certain goals in mind, it is not only more ethical, but also more persuasive, to indicate any inconsistencies in your findings.
zengr大约 11 年前
I think for &quot;Profile your data&quot;, some tools like OpenRefine really help. <a href="http://openrefine.org" rel="nofollow">http:&#x2F;&#x2F;openrefine.org</a>
gjreda大约 11 年前
I forgot to mention reproducibility. Show your work (share the code).