TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Technical Debt in Machine Learning Systems (2015) [pdf]

107 点作者 earino超过 5 年前

3 条评论

alfalfasprout超过 5 年前
The combination of glue code and pipeline jungles are, along with feature engineering, one of the biggest pain points we&#x27;ve observed in users. This stuff gets copied and pasted everywhere, turns unmaintainable, and then is next to impossible to optimize.<p>It&#x27;s as if a lot of ML framework authors believe that most users are researchers... in reality, data is rarely clean, rarely in the right format, and usually needs to be intermingled and transformed with other data before it can be useful.
评论 #21975863 未加载
joe_the_user超过 5 年前
This seems like a very useful framework to consider ML systems in.<p>The thing about an ML system as such is that such a system is intended to turn big mounds of data into a predictions&#x2F;classification without a human having to directly considered the multitude of questions otherwise addressed in large scale software design. IE, a multitude of boundaries and criteria are replaced by one criteria - &quot;it works&quot;. The thing is that this set of boundaries and criteria still exists even if they individual setting the system considers the situation solved. This manifests both as the world changing over time and as other people not being perhaps as satisfied with the results of the system as those who created it, this being just two potential gotchas.
mistrial9超过 5 年前
YNews <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=17341128" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=17341128</a>