Cloud Video Intelligence API

259 点作者 hurrycane大约 8 年前

20 条评论

tyre大约 8 年前

I think their model should take a second pass on the words and probabilities, independent of the video.Look at their example:<pre><code> Animal: 97.76% Tiger: 90.11% Terrestrial animal: 68.17% </code></pre> So we are 90% sure it is a tiger but only 68% sure it is a land animal? I don't think that makes sense.It could be that this is a weakness of seeding AI data with human inputs. I can believe that 90% of people who saw the video would agree that it is a tiger, while fewer would agree it is a terrestrial animal, because they don't know what terrestrial means.

评论 #13824961 未加载

评论 #13824314 未加载

评论 #13827499 未加载

评论 #13827381 未加载

评论 #13827272 未加载

sna1l大约 8 年前

I wonder if Snapchat is/will become a large user of this service? Depending on the average response time of this API, Snapchat could get much better ad targeting analyzing their Stories content.I imagine that they have something similar in house that they run since it is pretty vital to their core business, but you never know.

评论 #13823203 未加载

评论 #13823231 未加载

评论 #13825783 未加载

wyc大约 8 年前

I think the most commercially successful application of computer vision has been quality-control devices (citation needed). Agriculture is very interested in CV for a return-optimization technique known as precision farming. Manufacturers pay for inspection of production throughout the pipeline. To predict where a mass-market CV could be successful, I think we should look for industries with similar problems but cannot currently afford a bespoke custom modeling solution.

评论 #13827410 未加载

评论 #13823422 未加载

tambourine_man大约 8 年前

It amazes me how smart these guys at google are, and yet, they can't design a mobile site if their lives depended on it:<a href="http://imgur.com/bXGuNfL" rel="nofollow">http://imgur.com/bXGuNfL</a>

评论 #13825995 未加载

评论 #13825642 未加载

skewart大约 8 年前

I'm curious about how much use these general-purpose computer vision APIs are actually getting. How many companies out there really want to sift through a lot of photos to find ones that contain "sailboat"? I'm inclined to think a lot more companies would want to find "one of these five different specific kinds of sailboats performing this action", which is definitely not among the tens of thousands of predefined labels that Google, and Amazon, offer with their general purpose models.High-quality custom model training as a service seems much more compelling.

评论 #13823300 未加载

评论 #13823957 未加载

评论 #13823346 未加载

评论 #13823019 未加载

评论 #13823039 未加载

评论 #13822934 未加载

评论 #13823029 未加载

timc3大约 8 年前

I have been on the beta program for this and generally the results in our testing have been very good. I particularly like how granular the data can get.

评论 #13823041 未加载

bitmapbrother大约 8 年前

It was really entertaining listening to Fei-Fei Lee talk about AI and ML at Google Cloud. If you get the chance check it out on YouTube. I especially liked how she referred to video as once being the "dark matter" of vision AI.

评论 #13823080 未加载

评论 #13823067 未加载

imh大约 8 年前

The demo picture they chose is interesting. It's obviously a tiger, and is identified as such with only 90% probability. I appreciate the difficulty of the problem and how big of a success it is to achieve even that level of confidence, but that low level of confidence really shows how far we are from being able to simply trust computer vision. Still useful from an information retrieval perspective, I expect.

评论 #13823944 未加载

aub3bhat大约 8 年前

I think there is a need for a comprehensive system for image and video data analytics. Much like how we today have relational databases (postgres, MYSQL) and full text search engines (lucene/Solr). The approach Google or Amazon have been taking which involves providing a "tagging" API is frankly unimaginative.I am working on Deep Video Analytics an Open Source Visual Search and Analytics platform for images and videos. The goal of Deep Video analytics is to become a quickly customizable platform for developing visual & video analytics applications, while benefiting from seamless integration with state or the art models released by the vision research community. Its currently in very active development but still well tested and usable without having to write any code.<a href="https://github.com/AKSHAYUBHAT/DeepVideoAnalytics" rel="nofollow">https://github.com/AKSHAYUBHAT/DeepVideoAnalytics</a><a href="https://deepvideoanalytics.com" rel="nofollow">https://deepvideoanalytics.com</a>

评论 #13824714 未加载

ar15saveslives大约 8 年前

Correct me if I'm wrong, but this is just a frame-by-frame labeling. You can download whatever pre-trained CNN, pass individual frames through it and get the same result.

评论 #13823714 未加载

评论 #13823611 未加载

评论 #13823834 未加载

vaiski大约 8 年前

There's alternative out there from a company called Valossa. More comprehensive than what Google is now offering. Https://val.ai

评论 #13827687 未加载

frakkingcylons大约 8 年前

As a Cloud Prediction API user, it makes me a bit uneasy to see it left out of the image of their product suite. Is it effectively in maintenance mode now? I feel like TensorFlow is overkill for what I need and my use case doesn't fit into image/speech/video detection.

soared大约 8 年前

Sounds similar to a company I worked with that took security camera footage from restaurants and identified employee theft and process inefficiencies.

jimmcslim大约 8 年前

I wonder if you could use this to upload recordings from your DVR and have it determine the likely timecode of commercial breaks...

zitterbewegung大约 8 年前

Not the first <a href="https://clarifai.com" rel="nofollow">https://clarifai.com</a> has a similar service .

CRUDmeariver大约 8 年前

Is there any storage-related cost (i.e. retreival or egress cost) when you call this on a file stored on Google Cloud Storage?

hartator大约 8 年前

It's awesome, but I can't really see any application beside content filtering and supericial content classification.

评论 #13824658 未加载

joaoaccarvalho大约 8 年前

When you use these Google APIs, can Google keep/ use your data in any way?

评论 #13824387 未加载

chimtim大约 8 年前

what is the "video" bit here? This is just running image recognition on a bunch of frames.

评论 #13823858 未加载

评论 #13823756 未加载

kneel大约 8 年前

Cronenberg inception porn is coming