TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Introducing the Open Images Dataset

168 点作者 hurrycane超过 8 年前

9 条评论

imh超过 8 年前
Lawyers are funny:<p>&gt;Today, we introduce Open Images, a dataset consisting of ~9 million URLs ... having a Creative Commons Attribution license* .<p>Then the footnote below:<p>&gt;* While we tried to identify images that are licensed under a Creative Commons Attribution license, we make no representations or warranties regarding the license status of each image and you should verify the license for each image yourself.<p>I think this might be the most blatant instance I&#x27;ve ever seen of, &quot;We have to write this even though it&#x27;s essentially impossible for you to actually follow our directions.&quot;
评论 #12617076 未加载
评论 #12616678 未加载
transcranial超过 8 年前
Interesting that the base data consists of URLs. I guess it makes sense given copyright issues. Anybody know what the ballpark expected half-life of such URLs?
评论 #12615624 未加载
评论 #12615113 未加载
diyseguy超过 8 年前
Any guesses on how large the resulting dataset would be if you actually downloaded all the images? I imagine the urls will get removed in a hurry as everybody starts automating it.
评论 #12616108 未加载
devindotcom超过 8 年前
First video, now images - wonder if speech and others are on the way?<p>It&#x27;s nice that they&#x27;re doing this, helps advance the art I think. But it also puts a lot of smaller operations in unis sort of under the Google system in that they&#x27;re best compared to Google&#x27;s ML work and others using these datasets. It&#x27;s a small way of stacking the deck to make Google and DeepMind more embedded in the community.<p>That said, its utility for others surely outweighs the strategic advantage gained here, so I for one welcome these libraries. A lot of work goes into them. Hopefully others will release theirs as well.
评论 #12616000 未加载
评论 #12617244 未加载
zappo2938超过 8 年前
I&#x27;m glad I&#x27;m getting a return on all the effort clicking street signs and store fronts on reCaptcha.
pilooch超过 8 年前
I&#x27;ve put an efficient downloader here for the interested crowd: <a href="https:&#x2F;&#x2F;github.com&#x2F;beniz&#x2F;openimages_downloader" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;beniz&#x2F;openimages_downloader</a> It&#x27;s a fork of the one script I used to grab Imagenet.
dharma1超过 8 年前
Is there a link to the trained model somewhere?
评论 #12616515 未加载
rocky1138超过 8 年前
Are there any other libraries that are similar?
Omnipresent超过 8 年前
Looking forward to someone trying tensorFlow CNN on this