TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Mobile Visual Search

27 点作者 kljensen大约 15 年前

12 条评论

showerst大约 15 年前
I gave this a try too with some borrowed items around the office; here are my results:<p><pre><code> Blackberry - "Blackberry Cell Phone" Flat Screen TV - No match (but there were tons of reflections) Motorola Cell phone - "Cell Phone" Ikea French Press - "Glass Coffee Press"</code></pre> Extra impressive since it was transparent!<p><pre><code> Klean Kanteen metal water bottle, non logo side - "Silver Metal Water Bottle" Klean Kanteen metal water bottle, logo side - "Kleen Kanteen" Black Stapler, odd angle - "Black Stapler" Roll Call Newspaper - "Roll Call Capitol Hill Newspaper" Vanity Fair cover - "Vanity Fair" Economist, Feb 13 - "Economist New Dangers for World Economy" (The title) Microsoft Natural Keyboard 3000 - "Ergonomic Keyboard" </code></pre> (I was a bit disappointed in this one, it's a very distinct product)<p><pre><code> Front of the book "Ambient Findability" - "Ambient Findability" Back(!) of the book "Ambient Findability" - "O'reilly Ambient Findability" Trader Joes store brand Apple Juice - "Apple Juice Trader Joes" Urban Outfitters sunglasses - "Sunglasses" Ken Cole messenger bag - "Black leather pouch" </code></pre> Overall I'm VERY impressed, although it looks like it's mostly just reading any available text, and that they're not canonicalizing their entries (So 'ambient findability' and 'oreilly ambient findability' aren't really records pointing to the book, just strings of text).<p>Still, identifying an upside down stapler from the side, and a book from the back cover is pretty darn impressive.<p>I also noticed experienced a noticeable battery drain and warm iphone from using this thing for 10 minutes or so, but definitely nothing show stopping.
jcromartie大约 15 年前
I'm pretty positive that they have a bunch of people typing these things up as they come in.<p>I took roughly the same picture of the same book twice, and it returned two distinct results:<p>"Godel Escher Bach An Eternal Golden Braid" vs. "Godel Escher Back Book"<p>Then I tried a WarCraft III CD case:<p>"Warcraft Dvd" vs. "Warcraft Video Game"<p>Some noodles:<p>"Neoguri Snack" vs. "Neoguri Spicy Seafood Udon Noodles"<p>JBL Creature Speakers:<p>"Jbl Speakers" vs. "Computer Speaker" vs. "Jbl Computer Speaker"
评论 #1168675 未加载
nitesher大约 15 年前
They use crowdsourcing:<p>We're a collaboration of scientists at UC Berkeley and UC Davis and our image labeling engine can take any photo and accurately label it. We use computer vision and an innovative crowdsourcing network to recognize images, and the architecture learns over time, so it gets smarter with each search.
评论 #1168082 未加载
rubyrescue大约 15 年前
Desk objects are fine but the people that would really use this, to me, are fashionistas trying to figure out whose shoes that girl over there at the bar is wearing...if they can make this work in 'real world' social environments, they've got something pretty interesting.
eam大约 15 年前
I couldn't resist to download this app, so I did, and tested it. Took a picture of a red logitech wireless mouse and it was able to recognized in a few seconds. It knew it was a red logitech brand mouse, though it didn't' get the model right, but nonetheless it was able to classify it pretty well. I also took a picture of my toshiba laptop, this time it only knew it was a laptop.<p>Overall it's pretty awesome! They're doing a fine job with this thus far. I applaud them.
ilamont大约 15 年前
This is very cool, if the tool works as shown for most queries.<p>One thing I noticed: The battery meter on the iPhone went from nearly full to nearly empty in the course of the demo. I was wondering whether that type of app could draw that much juice, or if there was another explanation -- lots of outtakes, failed search examples that didn't make it into the demo, etc.
aliston大约 15 年前
Has anyone used their API? I can see a lot of potentially interesting applications, but my gut feeling is that this is still a problem in the academic realm -- though it might work in some textbook cases, it would probably fail/tag incorrectly in a lot of real world cases.
liuliu大约 15 年前
Can someone point the foundation paper for their technology? I've Googled some of their scientists, and most of them are working on neural related area (some work on sparse coding, I guess that is remotely related to this).
jhuckestein大约 15 年前
Check the IQEngines link on the bottom and their demo on <a href="http://www.iqengines.com/demo.php" rel="nofollow">http://www.iqengines.com/demo.php</a> . Pretty impressive (iff it's legit).
tbgvi大约 15 年前
I see Amazon, eBay, etc.. it would be cool if they added something like Milo for local search. Might be a better use case (save money by finding a lower price for something locally)
kljensen大约 15 年前
Video w/ fingers over book is uber-cool. Need to touch up their website &#38; make video more like the original dropbox vid. (I.e. entertaining)
Tichy大约 15 年前
Also try Google Goggles, I suppose.