TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

The New AI Moats

6 点作者 allenleein大约 2 年前

3 条评论

scottlawson大约 2 年前
To say there is a moat or there is no moat is a bit silly because it&#x27;s not a binary thing &quot;moat or no moat&quot;. It&#x27;s an oversimplification to a point where it obscures what is really happening, which is that some moats are getting smaller, sometikes significantly smaller.<p>The author correctly points out recent trends where things that were significant moats, like large text and image datasets are now more accessible to individuals.<p>But to conclude as a main point that therefore data is not a moat anymore is just absurd. Datasets serve different problem domains. We now have accessible datasets for internet text and code, and images from the internet. Talking about whether data is a moat only makes sense with context of what problem domains you are talking about.<p>Everyone can download the entire text corpus of wikipedia. That&#x27;s not a moat.<p>Can everyone download an enormous high quality self driving car dataset, comparable to what Google has constructed from Google maps and many human labelers? Obviously that data is a huge moat.<p>How about a comprehensive protein dataset? That&#x27;s something that&#x27;s gotten much more accessible especially with Alpha Fold, which is great.<p>But there are still other problem domains where data is a huge moat because collecting the data is super duper expensive and&#x2F;or requires an expert team of biologists, for example.<p>To make an analogy, data is like drugs. There are different classes of drugs. Different drugs are used to solve different problems. Some cure you, some harm you, some are well understood and some are not. Some are more accessible than others. But it would be folly to talk about drugs as being good or bad or cheap or expensive without context of problem domains.
version_five大约 2 年前
Data is still a moat. I understand the point, that it&#x27;s not a big differentiator among foundation or demonstration models. But most nontrivial commercial ML applications are not going to work with public datasets. What you can scrape off the web is just not representative.
theage大约 2 年前
Moat madness! Might as well be asking what&#x27;s our strategy to sit on our laurels when we get there?<p>The winner will be the one who wanted winning enough to set sail without a destination, not the one who had dreams of hoarding the current year&#x27;s tech interactions forever.