TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Is It a Duck or a Rabbit? For Google Cloud Vision, Depends on Image Rotation

260 pointsby _0nacabout 6 years ago

22 comments

ratelabout 6 years ago
I quite surprised at the comments on HN so far as nobody seems to see the significance of this. Yes, the image is ambiguous. The point is that Google Cloud Vision gives an unambiguous answer of that image based on the rotation. Transformations of an image are regularly used to improve the results of image recognition. That process fails quit dramatically if in the course of a transformation the answer given is presented with higher confidence than should be.
评论 #19338337 未加载
评论 #19341743 未加载
评论 #19336985 未加载
评论 #19338405 未加载
评论 #19338393 未加载
minimaxirabout 6 years ago
Creator of the animation here. Most of the relevant information&#x2F;context behind the animation (including a link to the repo) is in this Reddit comment: <a href="https:&#x2F;&#x2F;reddit.com&#x2F;r&#x2F;dataisbeautiful&#x2F;comments&#x2F;aydqig&#x2F;_&#x2F;ehzyozr&#x2F;?context=1" rel="nofollow">https:&#x2F;&#x2F;reddit.com&#x2F;r&#x2F;dataisbeautiful&#x2F;comments&#x2F;aydqig&#x2F;_&#x2F;ehzyo...</a>
评论 #19338327 未加载
评论 #19339511 未加载
评论 #19335972 未加载
abhisuri97about 6 years ago
When the output switches to rabbit the picture actually resembles a rabbit. I am unsure if this experiment was supposed to be a “haha look how stupid AI is” type thing or not, but it seems like the cloud vision api is performing as intended.
评论 #19336604 未加载
评论 #19336005 未加载
评论 #19336467 未加载
WhuzzupDomalabout 6 years ago
It&#x27;s a seagull not a duck. Don&#x27;t confuse the dumb AI even more by not knowing what a duck doesn&#x27;t look like in the first place. Jeez.
评论 #19335982 未加载
Illniyarabout 6 years ago
That image is a visual illusion. I find it hard myself to detect that it&#x27;s a rabbit when it&#x27;s ears are horizontal like a mouth.<p>Not sure what is the purpose of it, is it to show that even computers vision algorithms can get confused by visual illusions?
评论 #19336158 未加载
Felzabout 6 years ago
Is it concerning that there are short, sudden drops in prediction in the middle of a block otherwise solidly classified as rabbit&#x2F;duck? I don&#x27;t know much ML, does anyone know why it&#x27;d be so discontinuous?
评论 #19338338 未加载
评论 #19345139 未加载
staredabout 6 years ago
While the title is clickbaity (as in adversarial examples for fooling neural networks e.g. by adding a baseball ball to a whale to make it shark), I think it shows a nice phenomenon. I.e. a given illusion works similarily for humans and AI alike.<p>Vide &quot;dirty mind&quot; pictures like posting <a href="https:&#x2F;&#x2F;images.baklol.com&#x2F;13_jpegbd9cb76b39e925881bdb2956fd32ac91.jpeg" rel="nofollow">https:&#x2F;&#x2F;images.baklol.com&#x2F;13_jpegbd9cb76b39e925881bdb2956fd3...</a> to Clarifai <a href="https:&#x2F;&#x2F;clarifai.com&#x2F;models&#x2F;nsfw-image-recognition-model-e9576d86d2004ed1a38ba0cf39ecb4b1" rel="nofollow">https:&#x2F;&#x2F;clarifai.com&#x2F;models&#x2F;nsfw-image-recognition-model-e95...</a> gives 88% for NSFW.
评论 #19336691 未加载
Criper1Tookusabout 6 years ago
It would be cool to visualize this as a kind of pie chart, based on where the ears&#x2F;beak is pointing. Blue for directions where it sees duck, red for rabbit, and empty for neither.
dustedabout 6 years ago
Looks like proof to me, that the classification works correctly.
footaabout 6 years ago
I wonder whether it would stay consistent if you gave it a solid background line
miguelmotaabout 6 years ago
This seems like a serious concern. What&#x27;s a possible solution to this problem? Should all orientations be considered valid types? so in this case the image should be both a duck and a rabbit as the response?
EugeneOZabout 6 years ago
On still (not animated, not rotated) preview I saw rabbit first, then in a second I found it can be a duck also, and now it takes efforts to see rabbit again (but I can do it).
Gunstig2Snathabout 6 years ago
I was ONLY seeing clockwise in all images until the counter-clockwise one went about 8 rotations and all of a sudden I saw it counter-clockwise. Now I can’t unsee it.
ChlorophZekabout 6 years ago
When I look at the anticlockwise one I can see it as going either direction. When I look at the clockwise one I can only see it going clockwise
iscrewyouabout 6 years ago
Does Google Cloud like Duck or Rabbit? That’s where the answer lies.<p>In addition, if Cloud could taste one, it would really help itself with the answer.
AlphaWeaverabout 6 years ago
I wonder if this was hardcoded&#x2F;specifically trained to do this for this image?
评论 #19336845 未加载
mrashesabout 6 years ago
There is a children&#x27;s book about this pairing: <a href="https:&#x2F;&#x2F;www.amazon.com&#x2F;Duck-Rabbit-Amy-Krouse-Rosenthal&#x2F;dp&#x2F;0811868656" rel="nofollow">https:&#x2F;&#x2F;www.amazon.com&#x2F;Duck-Rabbit-Amy-Krouse-Rosenthal&#x2F;dp&#x2F;0...</a>
Seldaekabout 6 years ago
But does it see a blue or a white dress?
MoD411about 6 years ago
it cannot be a rabbit because there is no nose nor mouth
评论 #19337259 未加载
zaviabout 6 years ago
WAI
randomsearchabout 6 years ago
It’s a drawing of a creature that looks a bit like a rabbit or a duck from different angles but is very clearly neither, at best a bad drawing. That’s the failure here - it’s classifying into one of its categories when it shouldn’t be classifying at all.
评论 #19336002 未加载
laichzeit0about 6 years ago
This is the infamous Duck-Rabit illusion, right? The classifier seems to be doing a good job.<p><a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Rabbit%E2%80%93duck_illusion" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Rabbit%E2%80%93duck_illusion</a>