TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

The Forgotten Job of a Data Scientist: Editing

22 点作者 rouli大约 11 年前

2 条评论

novum大约 11 年前
Data Scientist (noun): A statistician who lives in San Francisco.<p>(only half joking)
评论 #7729653 未加载
michaelochurch大约 11 年前
&quot;Data scientist&quot; is a mess of a job title. It seems to be as much of a reaction against the commoditization of software engineering (which leaves the smartest, and by correlation, usually the most mathematically literate, 10% of programmers ill-suited for the average software job) as it is a real distinction.<p>There are plenty of &quot;data scientists&quot; who use canned tools and play around with parameters because that&#x27;s all &quot;the business&quot; thinks it needs.<p>You want to trim complexity for a reason that any data scientist worth his salt (and there are plenty of celebrity engineers in SF making $500k who aren&#x27;t worth their salt and don&#x27;t know this) should already know: bias-variance tradeoff (see also: underfitting and overfitting). If your model is too flexible&#x2F;complex, it will begin absorbing noise. That leads to a model that performs extremely well on training data but fails miserably on unseen data. There are well-studied techniques for preventing this, but I&#x27;d guess that fewer than 20% of self-described or titled &quot;data scientists&quot; are familiar with them.
评论 #7729662 未加载