TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Statistics vs. Machine Learning, fight

46 点作者 zjj超过 15 年前

3 条评论

tel超过 15 年前
Statistics has a pretty interesting stigma in science. <i>It's something you're not allowed to question.</i> Generally.<p>You can question if someone's models are right. You can question if their controls narrowed the experiment well enough. You can question if their interpretation makes any sense. What you can't question is if something like a t-test, LLS, or ANOVA is the best way to pull meaningful parameters out of the data. Just look at how much resistance Bayesian methods face in publication.<p>This is a fundamental friction that statistics has to overcome as long as its still called statistics. I think of ML as the parts of statistical research that escaped through the window opened by Shannon back when he invented information theory. It's a bird now, free to invent its words for the world and try crazy stuff that the religion of "Statistics" could never accept.<p>This isn't to say that Statistics isn't growing. The article itself does a good job pointing out just how similar recent Stat has been to ML. However, if you see a research paper in some of the more core, less data intensive sciences that dares to drop "SVM", "Ridge Regression", "Clustering", "Bayes", or god forbid "Machine Learning" itself you see scowls: isn't your data <i>normal enough</i>? Why do you need to do something fancy when I can work out the z-score of that result right here on my pocket calculator/slide rule.<p>(<i>Lets go ahead and concede that ML certainly has a lot of broken yet overhyped parts which helps form the nucleus of an argument not to infect scientific knowledge with some untested infrastructure. Growing pains.</i>)<p>It's a classic fight between tradition and innovation with all the usual arguments available, but what makes this different is that such a huge community of people who thrive off the image of really, <i>really</i> knowing things pretty much take the frequentist methodologies as an unimpeachable gold standard. Things can get dicey when you start to ask what the actual meaning of a p-value is, how we really know anything about "estimators", why people work so hard not to use computers. <i>It works! Stop asking questions.</i>
jibiki超过 15 年前
"What differs most is the teaching style. CS has far better lecture notes. Of course, the stats people wrote a very good book; but better lecture notes win because I can access them later and send them to people for free."<p>So very true. When I was a freshman in high school, we had mandatory "study hall" periods where we just had to sit in a room and be quiet. I spent most of them doing crosswords and sleeping, but I also spent a lot of time looking at printouts of these notes:<p><a href="http://rutherglen.ics.mq.edu.au/wchen/lnentfolder/lnent.html" rel="nofollow">http://rutherglen.ics.mq.edu.au/wchen/lnentfolder/lnent.html</a><p>It's really important to realize that not everyone who is interested in a field has access to a library full of relevant publications.
patio11超过 15 年前
And AI is the red-headed stepchild caught in the crossfire, where any approach that actually produces worthwhile results is promptly excommunicated from the field.<p>"If I create a program which successfully predicts which humans are trustworthy and which are untrustworthy, would that be AI?"<p>"Are you kidding?! That would revolutionize the field! That's harder than winning the Turing Prize! That's so far ahead of anything ever done it is hard to even imagine what it would look like!"<p>"The algorithm exists. Its output is called a FICO score."<p>"Bah, that isn't AI."
评论 #850376 未加载