TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Lean, mean data science machine

13 点作者 jeroenjanssens超过 11 年前

1 comment

gjreda超过 11 年前
I&#x27;m super interested in the chapter on creating reusable command line tools.<p>I&#x27;ve found the command line to be ideal for performing a lot of simple, memory-intensive tasks (filtering&#x2F;munging&#x2F;sorting&#x2F;etc. a massive text file).<p>However, after data collection (and munging), data science is typically A LOT of _exploratory_ analysis. I think it&#x27;s extremely important that all practitioners approach analysis with the mindset of making it easily reproducible (and if possible, flexible - don&#x27;t hard code date ranges, file paths, etc.).<p>I tend to stick with IPython Notebook (and heavily recommend it). I fear that heavy analysis at the command line would consist of too many one-liners and thus be difficult to read and maintain.
评论 #6867358 未加载