TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

The Forgotten Job of a Data Scientist: Editing

22 pointsby rouliabout 11 years ago

2 comments

novumabout 11 years ago
Data Scientist (noun): A statistician who lives in San Francisco.<p>(only half joking)
评论 #7729653 未加载
michaelochurchabout 11 years ago
&quot;Data scientist&quot; is a mess of a job title. It seems to be as much of a reaction against the commoditization of software engineering (which leaves the smartest, and by correlation, usually the most mathematically literate, 10% of programmers ill-suited for the average software job) as it is a real distinction.<p>There are plenty of &quot;data scientists&quot; who use canned tools and play around with parameters because that&#x27;s all &quot;the business&quot; thinks it needs.<p>You want to trim complexity for a reason that any data scientist worth his salt (and there are plenty of celebrity engineers in SF making $500k who aren&#x27;t worth their salt and don&#x27;t know this) should already know: bias-variance tradeoff (see also: underfitting and overfitting). If your model is too flexible&#x2F;complex, it will begin absorbing noise. That leads to a model that performs extremely well on training data but fails miserably on unseen data. There are well-studied techniques for preventing this, but I&#x27;d guess that fewer than 20% of self-described or titled &quot;data scientists&quot; are familiar with them.
评论 #7729662 未加载