Shirt Without Stripes

1676 pointsby elsamukoabout 5 years ago

141 comments

DenisMabout 5 years ago

This problem is known as "attribution" - you have a "no" or "without" in the sentence, but you don't know where it belongs. One could (and one does) argue that the problem cannot be solved with statistical methods (ML), especially not in any domain where accuracy is required, such as medical recored analysis: "no evidence of cancer" and "evidence of no cancer" are very different things.Zooming out, the language field breaks into several subfields:- A large group of Chomsky followers in academia are all about logical rules but very little in the way of algorithmic applicability, or even interest in such.- A large and well-funded group of ML practitioners, with a lot of algorithmic applicability, but arguably very shallow model of the language fails in cases like attribution. Neural networks might yet show improvement, but apparently didn't in this case.- A small and poorly funded group of "comp ling", attempting to create formalisms (e.g. HPSG) that are still machine-verifiable, and even generative. My girlfriends is doing PhD in this area, in particular dealing with modeling WH questions, so I get some glimpse into it; it's a pity the field is not seeing more interest (and funding).

评论 #22928734 未加载

评论 #22927449 未加载

评论 #22927193 未加载

评论 #22928497 未加载

评论 #22927832 未加载

评论 #22931245 未加载

评论 #22933632 未加载

评论 #22929537 未加载

评论 #22927158 未加载

评论 #22927189 未加载

评论 #22926560 未加载

评论 #22927489 未加载

评论 #22930902 未加载

评论 #22934244 未加载

评论 #22927905 未加载

评论 #22932770 未加载

评论 #22932852 未加载

评论 #22935875 未加载

评论 #22926863 未加载

评论 #22928336 未加载

评论 #22927030 未加载

rgovostesabout 5 years ago

The point that the author is making, in a very understated way, is that all three companies have PR websites that breathlessly describe their advanced AI capabilities, yet they cannot understand a very simple query that young children can.

评论 #22926402 未加载

评论 #22926210 未加载

评论 #22931239 未加载

评论 #22927418 未加载

评论 #22925434 未加载

评论 #22926614 未加载

评论 #22925794 未加载

评论 #22926159 未加载

seifertericabout 5 years ago

I have noticed in the past few years google results have become noticeable worse for similar reasons. Google used to _surprise_ me with how good it was able to understand what I was really looking for even when I put in vague terms. I remember being shocked on several occasions when putting in half remembered sentences, lyrics, expressions from something I had heard years ago and it being the first! result. I almost never have this experience anymore. Instead it seems to almost always return the "dumb" result, i.e. the things I was not looking for, even trying to avoid using clever search terms. It's almost like it is only doing basic word matching or something now. Also, usually the first page is all blogspam SEO garbage now.

评论 #22926887 未加载

评论 #22927192 未加载

评论 #22926875 未加载

评论 #22925968 未加载

评论 #22925930 未加载

评论 #22929322 未加载

评论 #22927667 未加载

评论 #22928695 未加载

transrealabout 5 years ago

Searching "men without pants" versus "men with pants" gives much better results.This is a case where, while it makes sense to say the sentence, it's not a common use of language, and at the end of the day, the search engine will find what's written down, it's not a natural language processor yet (despite any marketing).Shirt stores don't advertise "Shirts without stripes - 20% off", they describe them as "Solid shirts" or "Plain shirts". Men's fashion blogs talk about picking "solid shirts" or "plain shirts" for a particular look. If I walked into a clothing store and asked for "shirts without stripes", the sales person would most likely laugh and say "er, you mean you want plain shirts?".Plain shirts/solid shorts are the most common way to refer to these, and people seem to be searching this way:<a href="https://trends.google.com/trends/explore?date=all&q=solid%20shirts,plain%20shirts,shirts%20without%20stripes" rel="nofollow">https://trends.google.com/trends/explore?date=all&q=solid%20...</a>Regarding moving towards natural language processing - the "without" part is not as important as knowing the context.My kids will ask me to get from the bakery things like "the round bread with a hole and seeds", which I know means "sesame bagel", or "the sticky bread", which means "cinnamon twists" - which I understand because I know the context. Sometimes they say "I want the red thingy", and I need to ask a bunch of questions to eventually get at what they want (sometimes it's a red sweater, sometimes it's cranberry juice).Unless Google starts asking questions back, I don't think there is any way it can give you what you want right away.

评论 #22929628 未加载

评论 #22934005 未加载

wkyleabout 5 years ago

Vaguely similar to a joke from Ninotchka that Zizek often uses about the difference between 'coffee without cream' and 'coffee without milk'. He usually uses it to reference the concept of negation in the Hegelian dialectic, but he's also mentioned the difficulty of computers understanding negation in the context of the coffee/cream example.The joke from Zizek: <a href="https://www.youtube.com/watch?v=wmJVsaxoQSw" rel="nofollow">https://www.youtube.com/watch?v=wmJVsaxoQSw</a>

albertzeyerabout 5 years ago

Why should it not be possible to solve this with statistical methods? The model just needs to be able to understand the important meaning of "no" in here, in the context of the whole sentence. I would guess that most modern NNs from the NLP area (Transformer or LSTM) would be able to correctly differentiate the meaning. The problem is, I think there is no fancy NN (yet) behind Google search, and the other web searches.To extend on that, you can think of the human brain as just another (powerful) statistical model.

评论 #22926788 未加载

评论 #22930197 未加载

评论 #22928066 未加载

评论 #22927781 未加载

评论 #22928470 未加载

caust1cabout 5 years ago

My favorite was "What do vegetarians eat" which was broken for years: <a href="https://twitter.com/Caust1c/status/855193855422943234" rel="nofollow">https://twitter.com/Caust1c/status/855193855422943234</a>

评论 #22925651 未加载

GuB-42about 5 years ago

Fun experiment on Google:- Shirt Without Stripes: shirts where the description contains both "without" and "stripes". Example: a shirt without collar, with stripes.- "Shirt Without Stripes": a mess, with and without stripes, suggesting an unusual search query. In fact, the linked article site is the first result in web search.- Stripeless shirt: sexy women in strapless shirts- "stripeless shirt": pictures of Invader Zim...- "stripeless" shirt: mostly shirts without stripes, but there are some shirts with stripes that are described as stripeless...The last one may give us a hint at the problem. If you have to mention a shirt is without stipes, you are probably comparing is to a shirt with stripes. For example imagine a forum, some guy is posting a picture of a shirt with stripes, I can expect some people to ask questions like "do they sell this shirt without stripes"? Or maybe the seller himself may have a something like "shirt without stripes available here (link)" in the description. So the search engines tie "shirt without stripes" to pictures of shirts with stripes.I remember an incident where searching for "jew" on Google led to antisemitic websites. The reason was simply that that exact word was rarely used in other contexts. Mainstream and Jewish source tend to use the words "jews" and "jewish" but not "jew". And because Google doesn't look at the dictionary meanings of words but rather what people use them for, you get issues like that.

评论 #22929211 未加载

bentonaabout 5 years ago

To me, the most interesting implication here is that this must not adversely affect Google's ad revenue. If it did, they would surely fix it. This, in turn, means that apparently we have been trained to interface with search engines such that this is not a problem.Sometimes I wonder how much my brain has changed to use search engines / how much of it is dedicated to effective googling. Makes me feel like a cyborg.

评论 #22925857 未加载

评论 #22925674 未加载

评论 #22925590 未加载

评论 #22926067 未加载

评论 #22925993 未加载

评论 #22925516 未加载

评论 #22925813 未加载

Tade0about 5 years ago

My Operating Systems professor (Tomasz Jordan Kruk, PhD) in college had an appropriate anecdote for this:"Humans usually don't intuitively understand the word 'no'. Please imagine a non-pink elephant."

评论 #22925691 未加载

评论 #22927321 未加载

captainmuonabout 5 years ago

This is something that has annoyed me since the Altavista times. I want to search for "madonna but not the singer", and find pictures of the holy icon. I can do "madonna -singer", but that fails if the page mentions the word "singer" a single time. Even if it is "This is a page about madonna statues, but not about the famous singer."It would be great if I could add negative keywords to a website, or mark text as "don't index" or "index with a negative weight". But probably, people would game this in ways I can't imagine.There is probably a clever ML solution for this, like having meaning-vectors for distinct ideas, and pushing pages that are close to one meaning away from the other meaning. Classification is easy if you have a keywords like "painting" and "catholic", but if it is "virgin" or "prayer" then it could be either meaning, so there is never a bullet-proof solution.

cscurmudgeonabout 5 years ago

A few years back (in around 2012) I attended an NLP talk.The theme of this talk was how they did a study that showed prepositions and articles do have meaning. A big deal was made out of the results.I think things like this happens when people consider engineering approximations such as bag of words to be the truth over time.

评论 #22926619 未加载

c3534labout 5 years ago

If you select "I don't like this recommendation" for a video on youtube, you will get to provide feedback on why you did so: either "I don't like this video" or "I've already watched this video." I've pressed the latter on literally thousands of videos at this point, and after well over a year of this, YouTube still hasn't figured out that I don't want to be recommended videos that I've already watched.Likewise, Google says I should log into their website for personalized search results, but after years of always clicking on Python 3 results over Python 2.7 results, it never learned to show me the correct result.Eventually I realized that personalized recommendations are more or less just a thin cover for collecting vast amounts of data with no benefit to the consumer. I believe we have the technology to do better, but we don't use it. In fact, we seem to be using it less and less.

评论 #22933663 未加载

ChuckMcMabout 5 years ago

I love this. It is such an easy to grasp example of what is "wrong" with search. Historically, searching was keyword based so documents with "shirt" and "stripes" would rank highly, even though none of those pages had the keyword "without".As humans we know immediately that the search is for documents about shirts where stripes are not present. But the term 'without' doesn't make it through to the term compositor step which is feeding terms in a binary relationship. We might make such a relationship asQ = "shirt" AND NOT "stripes"You could onebox it (the Google term for a search short circuit path that recognizes the query pattern and some some specific action, for example calculations are a onebox) and then you get a box of shirts with no stripes and an bunch of query results with.You can n-gram it, by ranking the without-stripes n-gram higher than the individual terms, but that doesn't help all that much because the English language documents don't call them "shirts without stripes", generally they are referred to as "plain shirts" or "solid shirts" (plain-shirt(s) and solid-shirt(s) respectively). But you might do okay punning without-stripes => plain or to solid.From a query perspective you get better accuracy with the query "shirts -stripes". This algorithmic query uses unary minus to indicate a term that should not be on the document but it isn't very friendly to non-engineer searchers.Finally you can build a punning database, which is often done with misspellings like "britney spears" (ok so I'm dating my tenure with that :-)) which takes construction terms like "without", "with", "except", "exactly" and creates an algorithmic query that is most like the original by simple substitution. This would map "<term> without <term>" => "<term> -<term>". The risk there is that "doctors without borders" might not return the organization on the first page (compare results from "doctors without borders" and "doctors -borders", ouch!)When people get sucked into search it is this kind of problem that they spend a lot of time and debate on :-)

ggggtezabout 5 years ago

Perhaps, but would you really say "Hi, I'm wearing a shirt without stripes"?It's a completely artificial construct. Simply the fact that this hacker-news entry is the #1 search result shows that real human people do not perform this search in significant quantity. But we can quantify that with data to backup the assumption [1][2]. When people want to buy a shirt without stripes, they do not describe the shirt by what it doesn't have.In fact, it's trivial to cherry pick a random selection of words that on the face of it sounds like something a human might search for, but it turns out never occurs in practice. Add to that the fact that the term is being searched without quotes [3], which results in the negation not actually being attached to anything.Do you go to a store to buy it along with your Pants Without Suspenders, Socks Without Animal Print, and other items defined purely by what they don't have?[1] <a href="https://trends.google.com/trends/explore?geo=US&q=%22white%20tshirt%22,%22shirt%20without%20stripes%22" rel="nofollow">https://trends.google.com/trends/explore?geo=US&q=%22white%2...</a> [2] <a href="https://trends.google.com/trends/explore?geo=US&q=%22plain%20shirt%22,shirt%20without%20stripes" rel="nofollow">https://trends.google.com/trends/explore?geo=US&q=%22plain%2...</a> [3] <a href="https://trends.google.com/trends/explore?geo=US&q=plain%20shirt,shirt%20without%20stripes" rel="nofollow">https://trends.google.com/trends/explore?geo=US&q=plain%20sh...</a>

VohuManaabout 5 years ago

Is it just me or does it feel like in the last couple years all of these companies have had the quality of their search go down? I've noticed large portions of my search will go ignored and it will just grab the most popular terms in my search rather than searching all terms.

rbettsabout 5 years ago

This is also confusing what you search for vs. what the vendor thinks you will buy. Product catalog searches often intentionally return items outside your search parameters.

shaneclevelandabout 5 years ago

I would never search for something this way. If I wanted to find a 4WD car, I wouldn't search for "cars without 2WD."Likewise, here, I would search for solid-colored shirts.And these services are limited to the content/terminology utilized by the cataloged sites/products.If I am selling a "black shirt" or a "solid black shirt," it is not google's job to catalog it as a "shirt without stripes," unless I advertise it as a "black shirt without stripes."I would use natural language to test a services' NLP ability.

评论 #22925734 未加载

评论 #22925792 未加载

评论 #22926925 未加载

评论 #22925665 未加载

woodersabout 5 years ago

We're a company coming out of the YC W20 batch working on the product attribution problem <a href="http://glisten.ai/" rel="nofollow">http://glisten.ai/</a>.There's too many products nowadays to be manually attributed (e.g. pattern=stripes), making it hard return good results even with entity resolution for queries. We train classifiers to categorize products, including what something is not, using their images and descriptions.

schmichaelabout 5 years ago

Google Photo's search is a similar source of amusement for me. While it's quite good, it also fails fairly regularly and sometimes amusingly. For me "turtle" includes understandable mistakes like fish, a snail, and a rock that does look a bit like a turtle. However "turtle" also includes this, a picture of sequined slippers reflecting light?! <a href="https://i.imgur.com/4aSlA4B.jpg" rel="nofollow">https://i.imgur.com/4aSlA4B.jpg</a>I'm guessing one of those reflections looks like a turtle? Or maybe a pattern on the floor, wall, or rug?Although there are examples where I'm unsure if the AI is dumber than my 4yo or smarter than me. This is a result for "truck": <a href="https://i.imgur.com/JcgXZAG.jpg" rel="nofollow">https://i.imgur.com/JcgXZAG.jpg</a>Even (especially?) my 4yo knows those are Brio trains, not trucks. However, trains have components called trucks! <a href="https://en.wikipedia.org/wiki/Steam_locomotive_components" rel="nofollow">https://en.wikipedia.org/wiki/Steam_locomotive_components</a> I'm unsure whether or not any of the wheel assemblies on these toy trains are considered trucks, so either the AI is extremely smart or slightly dumber than a 4yo.

评论 #22926370 未加载

antmanabout 5 years ago

A good display of the current state of search engines.

评论 #22925872 未加载

评论 #22925599 未加载

评论 #22925311 未加载

joshmnabout 5 years ago

Proof that SELECT with GROUP BY doesn't work if your tags aren't correct.Joking aside, it doesn't surprise me that this isn't being picked up — aren't most of these AI teams more R&D than actual public-facing? Maybe I'm just cynical though.

dEnigmaabout 5 years ago

This contrasts with my query of "guys in jean jumpers singing too ra loo ra loo" a few years back, which Google correctly identified as "Come on Eileen" by Dexys Midnight Runners. To this day my favourite search experience.

js2about 5 years ago

If it were butter, you'd want an unstriped shirt. If it were provolone, you'd want a non-striped shirt. But because it's neither of those, I think you just want a "shirt" or maybe, a "plain shirt". Indeed, I get much better results with either of the latter two search terms. There's no need to mention stripes at all, since no pattern is the default state, isn't it?

lifeisstillgoodabout 5 years ago

Weirdly, searching for'shirt no stripes'on Google returned this web page at top of the organic results.So at some point, searching for a shirt online will involve this conversation. Even more confusing.(Although I expect my filter bubble will play a part in that)

评论 #22925697 未加载

throw345hnabout 5 years ago

Only slightly related but a couple of years back I got an alexa as a gift. When you open the alexa app, they had the option to add list of todos as a reminder. The first thing I did is to say something like - Alexa, add a reminder to get milk and eggs and paper. The app literally added a single item like this - milkANDeggsANDpaper.After that I facepalmed myself and turned it off.

评论 #22926236 未加载

katorabout 5 years ago

Joe: "Hey is Lisa back from vacation?"Larry: "I saw a red Lamborghini in the parking lot!"Most people will assume Lisa is driving a red Lamborghini and back from Vacation, meanwhile, all the bots are searching for Lamborghini vacations and trying to figure out what's going on in the conversation.

partomniscientabout 5 years ago

"shirts without stripes" results: <a href="https://www.amazon.com/s?k=shirts+without+stripes&ref=nb_sb_noss" rel="nofollow">https://www.amazon.com/s?k=shirts+without+stripes&ref=nb_sb_...</a>"shirts -stripes" results: <a href="https://www.amazon.com/s?k=shirts+-stripes&ref=nb_sb_noss_2" rel="nofollow">https://www.amazon.com/s?k=shirts+-stripes&ref=nb_sb_noss_2</a>So basically the AI doesn't convert "without x" to "-x" even though the basic capability needed is there. This is why AI is a hard problem, especially when it meets the real world.It's 2020 and we're still quibbling about the terminology used in SQL, what did we expect?

mabboabout 5 years ago

It's not enough to say "Oh, we should add a rule that 'without' means negate the next word" because that only applies to this one situation, in this one language. Let's generalize the problem: We aren't correctly translating from English (or other spoken languages) to Computer/Logic.The state of the art in machine translation (from what I've read at least) is translating from language-A to a language-less "concept space" and then from there to language-B. Could that be done where the output language is something a search engine can use to find what you want correctly?Given that pattern, I suspect we could see much better results in cases like this.

slaymaker1907about 5 years ago

I think that this is actually really encouraging in showing that we still have a ways to go in improving search engines. A lot of people treat search engines as a solved problem, at least for non-question answering aspects.

评论 #22925715 未加载

评论 #22925487 未加载

alanbernsteinabout 5 years ago

Or you can do this: <a href="https://www.google.com/search?q=shirt+-stripes" rel="nofollow">https://www.google.com/search?q=shirt+-stripes</a>

评论 #22925338 未加载

评论 #22925480 未加载

评论 #22925313 未加载

评论 #22925295 未加载

评论 #22925527 未加载

评论 #22925284 未加载

varelazabout 5 years ago

Problem here is not about negation, but there is no product that's described as "shirt without stripes". Stripes and shirt will come together in a different sense, since Google cannot find whole phrase it has to find parts. For example check for "shirt without shoulders"

pugworthyabout 5 years ago

Interestingly, Google can handle these searches just fine..."birds without flight""cars without wheels""cats without tails""dogs without hair""intersections without lights""poems without rhyme""shirts without collars" (also "sleeves", "shoulders", "buttons", "logos", "pockets", and more)

评论 #22926923 未加载

评论 #22926929 未加载

imgabeabout 5 years ago

Humans can kind of make some assumptions based on context, but it's really just a poorly defined, vague query.What if you walked into a store and asked an associate for a shirt without stripes? What would you get?Probably some further questions for clarification. What about checked shirts? Floral prints? Plaid? Do you want no pattern at all? T-shirt? Polo shirt? Dress shirt?Granted, the AI results are particularly bad because they give you the one thing that you specifically didn't ask for, but that's also the only information you provided. Defining a query in terms of what you don't instead of what you do isn't going to go well.What if you went to google and said "Show me all the webpages that aren't about elephants"? Sure, you'd get something, but would it be anything useful?

hombre_fatalabout 5 years ago

This is a good example of the bar HNers must have these days when they bafflingly assert that Google is somehow getting worse from what they remember.Google has gotten better, it's just HNer expectations that have changed as they expect more and more magic.For example, the subtitle on the repo is "Stupid AI" when this query has never worked in these search engines, and it won't anytime soon.You'd think the technical HN crowd would be more advanced than to make the same mistakes that (they complain that) stakeholders/users/gamers make when they mistakenly think everything is much easier than it actually is. Things aren't "stupid" just because they can't yet read your mind.

评论 #22926359 未加载

rjurneyabout 5 years ago

That darn conceptual search sure is hard :) The technical approach to achieving this involves a sentence embedding that then uses vector search to match documents based on a distance metric like cosine similarity. If you encode a description of a shirt in an embedding trained on all shopping item descriptions, it should match up with the search query. The trick is in getting a sentence embedding from a short query to match a longer description in a document description - long summaries of text in embeddings tends to average too much and cloud meaning. The other problem is including the vector search feature without screwing up other searches.

kinkrtyavimoodhabout 5 years ago

On a meta note, I am a bit tired of HN submissions being used more as "Writing Prompts" rather than as links to substantive material.This thread is an excellent example. The author of the linked page didn't have the decency to actually make a substantive point, instead sharing three screenshots and posting the link here, chumming the HN waters with the kind of stuff that brings in the sharks from far and wide.Bashing on big cos: CheckVague pronouncements about AI: CheckGeneric side-swipes about 'ad revenue': CheckThis is why a coherent thesis is required to even initiate a proper discussion, because in the absence of that it invariably devolves to lowest-common-denominator shit-flinging.

ltbarcly3about 5 years ago

Here's another fun fact about how commerce search engines work (I spent a couple of years on this):Negations sidestep almost all of the algorithms that try to provide an improved result set, and fall through to pure text relevancy. So try searching on amazon for shirt, then search for: shirt -xkxkxkxk. Since xkxkxkxk doesn't match any documents, the negation should have no effect, but it does, the effect it has is to sidestep all the fancy relevancy work and hardcoded query rewrite rules, domcat rules, demand and sales/impression statistics etcetc, and give you basically awful search results. You don't even get shirts.

twodaveabout 5 years ago

I'm actually not sure I expect this much from a search engine. Typically there is going to be a useful word to describe what you want without having to hope it can understand "no" or "without" (for example, without stripes -> "solid" or even "NOT striped" in many cases).Anyone with a programming background knows there is an art to forming useful search queries--it is an acquired skill. I'd personally much rather the engine bring back predictable results given mundane rules and keywords than attempt to understand sentences using an opaque method of understanding.

评论 #22926941 未加载

pvtmertabout 5 years ago

Since there is no context is provided, I do not expect it to understand prepositions itself.Given exact query to human, they create environment thus context themselves.It may also depend on whom you are asking to. For example, myself, entering this site to find out news about software & tech. Also since 'Stripe' is a company name, I assumed link will get the list of shirt shops who do not accept Stripe as a payment method/provider. (Thus some kind of protest related thing)I literally thought about that yesterday and did not see the page thinking "That's too much for tonight".Now seeing topic is somewhat very different.

civil_engineerabout 5 years ago

Wikipedia gets it wrong too: Try “men without hats” <a href="https://en.wikipedia.org/wiki/Men_Without_Hats" rel="nofollow">https://en.wikipedia.org/wiki/Men_Without_Hats</a>

thedeviantdevabout 5 years ago

Try searching Google for 'white couples' and 'black couples'.The former returns lots of mixed race couples, mostly not white couples. However the latter returns black couples.What is going on here? Similar phenomenon perhaps?

heavenlyblueabout 5 years ago

To be fair, the only thing that Google needs to do internally is to match this query to “shirt -stripe” and then you’ll get the necessary answer. The bigger question is why they are not doing that.

quickthrower2about 5 years ago

"Plain shirt" works a charm though. What is a 'shirt without stripes' anyway? That could be a shirt with diamonds? Or a plain one? Or a Hawaiian shirt?What is the expected result, can we agree?

评论 #22932025 未加载

lgesslerabout 5 years ago

A good demonstration of the linguistic fact that far from being meaningless, prepositions (adpositions, more generally) are actually highly consequential for meaning and are highly ambiguous between different meanings. Here's a paper that'll give you a good appreciation of this from an NLP perspective if you're curious: <a href="https://www.aclweb.org/anthology/W16-1712.pdf" rel="nofollow">https://www.aclweb.org/anthology/W16-1712.pdf</a>

V-2about 5 years ago

I believe the future of AI, as showcased by this simple usecase, is not one central AI such as Google search engine recognizing the context, but rather each of us having a "smart assistant" with a personalized, trained understanding of the contexts that we mean.And it's only that smart assistant that automates coping with the deficiencies of a one-size-fits-all central solution, finding me shirts with no stripes by using a rather dumb search engine. (Or "a pizza I would like", etc.)

dk8996about 5 years ago

I'm kinda late to this conversation but there are companies and Engineers trying to solve this problem basically adding more "semantics" to visual content. Good place to start is with this blog from Pinterest.<a href="https://medium.com/pinterest-engineering/pinsage-a-new-graph-convolutional-neural-network-for-web-scale-recommender-systems-88795a107f48" rel="nofollow">https://medium.com/pinterest-engineering/pinsage-a-new-graph...</a>

dailypeekerabout 5 years ago

Why is everything a git repo when it could have been a blog post?

评论 #22926111 未加载

obarthelemyabout 5 years ago

Reminds me of the the challenge:"Don't think of a cow !"What did you just think of ? A cow, of cowrse.If you want a shirt w/o stripes, just google "plain shirt" or "dress shirt -stripes.

dtunkelangabout 5 years ago

As others have pointed out, most search engines don't support natural language search in general, let alone natural language negation in particular.There are several reasons for this, including the following:1) Natural language understanding for search has gotten a lot better, but it is still not as robust as keyword matching. The upside of delighting some users with natural language understanding doesn't yet justify the downside of making the experience worse for everyone else.2) Most users today don't use natural language search queries. That is surely a chicken-and-egg problem: perhaps users would love to use natural language search if it worked as well or better than keyword search. But that's where we are today. So, until there's a breakthrough, most search engine developers see more incremental gain from optimizing some form of keyword search than from trying to support natural language search.3) Even if the search engine understands the search query perfectly, it still has to match that interpretation against the documentation representation. In general, it's a lot easier to understand a query like "shirt with stripes" than to reliably know which of the shirts in the catalog do or don't have stripes. No one has perfectly clean, complete, or consistent data. We need not just query understanding, but item understanding too.4) Negation is especially hard. A search index tends to focus on including accurate content rather than exhaustive content. That makes it impossible to distinguish negation from not knowing. It's the classic problem of absence of evidence is not being evidence of absence. This is also a problem for keyword and boolean search -- negating a word generally won't negate synonyms or other variations of that word.5) The people maintaining search indexes and searchers co-evolve to address -- or at least work around -- many of these issues. For example, most shoppers don't search for a "dress without sleeves"; they search for a "sleeveless dress". Everyone is motivated to drive towards a shared vocabulary, and that at least addresses the common cases.None of this is to say that we shouldn't be striving to improve the way people and search engines communicate. But I'm not convinced that an example like this one sheds much light on the problem.If you're curious to learn more about query understanding, I suggest you check out <a href="https://queryunderstanding.com/introduction-c98740502103" rel="nofollow">https://queryunderstanding.com/introduction-c98740502103</a>

bryanrasmussenabout 5 years ago

I think, looking at shirt without stripes and shirt with out stripes in Google images, that without is decompounded, which then ends up giving you shirt with stripes, however the slight difference between the two searches "shirt without stripes" and "shirt with out stripes" is that the there are some exact hits mixed in also, so there are some results for "shirt without stripes" mixed in with the decompounded query.Just my theory.

holdenc137about 5 years ago

Wake me up when I can google "anything but crocodiles"

评论 #22925517 未加载

jpswadeabout 5 years ago

Google search doesn’t work that way, it’s still based on how we link to things.Nobody would describe a plain shirt as a shirt without stripes unless it’s within that context.

rubatugaabout 5 years ago

Also, what really angers me is when websites don't support the minus operator for search queries. It's a simple feature introduced decades ago!

bryanrasmussenabout 5 years ago

<a href="https://www.google.com/search?q=plain+shirt" rel="nofollow">https://www.google.com/search?q=plain+shirt</a><a href="https://www.amazon.com/s?k=plain+shirt" rel="nofollow">https://www.amazon.com/s?k=plain+shirt</a>on edit: <a href="https://www.google.com/search?q=shirt+-stripes" rel="nofollow">https://www.google.com/search?q=shirt+-stripes</a>

评论 #22925737 未加载

评论 #22925938 未加载

评论 #22925682 未加载

评论 #22926094 未加载

评论 #22925751 未加载

neycodaabout 5 years ago

How does a search engine know whether you wanted shirts that didn't have stripes or results that contained the words shirts, without, and stripes?

ddebernardyabout 5 years ago

The correct query would have been "shirt -stripes". That works fine, or at least does on Google. But yeah, sentence parsing fail.

评论 #22925786 未加载

raindropmabout 5 years ago

Apparently, this kinda works in Thai language too(and I think other language also) The search keyword is "เสื้อไม่มีแถบ" which is literally translated as 'Shirt without stripes'. It's common words to speak, unlike 'without' in English.The result, of course, show shirt with some kind of stripe, albeit not prominent like the English one.

vekkerabout 5 years ago

I worked on an ingredient parser a few years ago. This exact kind of thing made things a lot more difficult than they seemed at first.

rjurneyabout 5 years ago

The latest embeddings/networks like BERT can handle encoding this logic. They take the surrounding words in context when they're encoded.Google can do this now, for example in a prototype. The tough thing is to get it to consumer-grade quality without messing up other searches. The QA process is utterly brutal because one weird search can be a scandal.

wordabbyabout 5 years ago

On a positive note Google used to have trouble with a query like "words with q without u", now the top 5 pages at least all show the correct results, eg: <a href="https://word.tips/words-with/q/without/u/" rel="nofollow">https://word.tips/words-with/q/without/u/</a>

CPLXabout 5 years ago

Related: <a href="https://www.google.com/search?q=mountains+without+women+in+a+bikini&source=lnms&tbm=isch&sa=X&ved=2ahUKEwiygeHatPfoAhWGknIEHe8QDSkQ_AUoAXoECAwQAw&biw=1747&bih=947" rel="nofollow">https://www.google.com/search?q=mountains+without+women+in+a...</a>

aj7about 5 years ago

Since I was a teenager, if someone energetically asserts a statement is “true” or “false,” I drop the true or false and evaluate the statement. In essence, their only communication to me is, ‘I think this is important!’ Often, why they think it’s important is more pressing than whether the statement is true.

adamredwoodsabout 5 years ago

I wonder if this is a need for humans need to learn search queries. "-stripes" instead of "without stripes".Or does input need to have basic filters applied before handing to ML? "without X" or "no X" = "-X"? Can be foiled with "shirt without having stripes".

leonardopucciabout 5 years ago

I think that query analysis in terms of volume of actual people using this query will show that very little people if any actually type "shirt without stripes". Once enough people do it, feedback is accumulated that results are bad (by CTR analysis), and results will auto-correct.

aaron695about 5 years ago

Not sure if people actually search for "Shirt Without Stripes" or this was picked for academia over what is actually neededBut make a script that scrapes the top X results for these sites. Get your own AI / humans to rate it.Make it competitive for these large sites <==> give them an incentive.

softwaredougabout 5 years ago

In search we know it’s easy to cherry pick queries and criticize any search engine. A search engine is optimizing for billions of queries. Most of which are on the long tail.The real question is “shirts without stripes” really a query people enter? Or representative of a real pattern in the data?

评论 #22926250 未加载

评论 #22926390 未加载

mirimirabout 5 years ago

Searching "plain shirts" does in fact yield results for shirts without patterns. And "paisley shirts" works too.So it's not such a big deal that negation doesn't work.Also, "shirts -stripes" does seem to work in both Amazon and Google. Or at least, I see no striped shirts.

moultanoabout 5 years ago

Ya'll might be interested in this paper. <a href="https://arxiv.org/abs/1907.13528" rel="nofollow">https://arxiv.org/abs/1907.13528</a>> in particular, it shows clear insensitivity to the contextual impacts of negation.

cfvabout 5 years ago

While I'm sure this is A Hard Problem to solve by NLP I for whatever reason was under the impression that this is trivial to special-case.As in, "X without Y" sounds like a common enough use case to have it's own little parser branch in places as big as Google or Amazon

评论 #22929886 未加载

realoabout 5 years ago

Tried this on amazon.ca (instead of .com) and got quite a different, but also amazing, result...<a href="https://www.amazon.ca/s?k=shirt+without+stripes&ref=nb_sb_noss" rel="nofollow">https://www.amazon.ca/s?k=shirt+without+stripes&ref=nb_sb_no...</a>

carapaceabout 5 years ago

<pre><code> shirt -stripes </code></pre> > "Am I going crazy or is it the world around me!?"Fishbone - Drunk Skitzo <a href="https://youtu.be/SaPGH4Yd_zc?t=231" rel="nofollow">https://youtu.be/SaPGH4Yd_zc?t=231</a>(Apologies for the snarky low-content flip reply.)

arnaudsmabout 5 years ago

I wonder why this problem hasn't been resolved yet, considering we had NLP systems capable of this for a decade now. Maybe it's too hard to scale to production. Or Pagerank is still better most of the time. Or plain old monopoly and risk aversion.

评论 #22925907 未加载

need_more_bortabout 5 years ago

So I’d have to ask, is the problem the AI doesn’t intuit “without stripes”, find shirts that satisfy that condition (what kind of shirts? Dress shirts? T shirts?) and then do an image search identifing shirts and their quality of stripeyness

GnarfGnarfabout 5 years ago

Why not simply say "shirt -stripes" (negation in front of "stripes").

harimau777about 5 years ago

It seems to me that the problem isn't so much that this search performs incorrectly. Rather it is that many search engines have removed the tools that allowed a user to specify exactly what they are looking for (e.g. shirt -stripes).

cachestashabout 5 years ago

Key question here, do all three even profess to using AI/ML in the search feature?

dpcanabout 5 years ago

Yes, exclusive search is a huge problem.You have to know to search for "solid colored shirt", but when you can't think of this variation of search, or maybe there isn't one, exclusion is your only option, and it's broken.

rammy1234about 5 years ago

I see Bing is poor of the lot. It did not understand the "without" keyword

DangerousPieabout 5 years ago

Counterexample: <a href="https://www.google.com/search?client=firefox-b-d&q=Doctors+Without+Borders" rel="nofollow">https://www.google.com/search?client=firefox-b-d&q=Doctors+W...</a>

评论 #22926775 未加载

kitplummerabout 5 years ago

Why is this a Github repo? I can't get past the abuse of a git repository.

评论 #22934292 未加载

mv4about 5 years ago

Not surprising. Also, "shirt -stripe" might product better results.

评论 #22927410 未加载

MaysonLabout 5 years ago

Reminds me of the difficult time I had finding socks without elastic.

otikikabout 5 years ago

Poignant but accurate.On Amazon's side of things I would also include the obnoxious "Hey you just bought a pair of sneakers so now I will change all your recommendations to sneakers".

tiborsaasabout 5 years ago

I get the author's point, but if you think about it, a search engine is a database that serves you results you want to see. Why should a search engine be fine tuned towards things you don't want see?If it's meaningful for some reason, then it works:<a href="https://www.google.com/search?q=woman+without+makeup&tbm=isch" rel="nofollow">https://www.google.com/search?q=woman+without+makeup&tbm=isc...</a>If it's an user error (like a dumb query) it fails and it shouldn't be a surprise:<a href="https://www.google.com/search?q=sea+without+ships&tbm=isch" rel="nofollow">https://www.google.com/search?q=sea+without+ships&tbm=isch</a>

l0b0about 5 years ago

"shirt -stripes" seems to work on google.com at least, even though they and others like DDG have been getting really bad at ignoring "-foo" terms recently.

_0w8tabout 5 years ago

Searching in Russian on Yandex gives the same ridiculous results.

ddlutzabout 5 years ago

Noticed something interesting, if you search for 'shirt without sleeves' in google images, you DO get sleeveless shirts. So why doesn't this work with stripes?

评论 #22930688 未加载

peter_retiefabout 5 years ago

If you search for "plain shirt" its good If you add "plain shirt no stripes" it adds stripes Strangely "striped shirt" has some plain results.

lavpabout 5 years ago

For the google search, I get better results by typing "shirt -stripes". Still not perfect, but it's better than the seemingly redundant 'without'.

Despoisjabout 5 years ago

What about using the "-" sign to filter results instead of relying on complex language understanding?=> "shirt -stripes" works pretty well on google at least

Skunkletonabout 5 years ago

Good thing search engines generally support a more machine-centric process for communicating intent. Try searching for "shirt -stripes".We are in a funny place with UIs.

dvhabout 5 years ago

<a href="https://www.google.com/search?q=leap+years+in+1900s" rel="nofollow">https://www.google.com/search?q=leap+years+in+1900s</a>

prvcabout 5 years ago

On a related note, Google seems to favor forum replies which instruct users to perform a search in order to find the answer to the question that they had asked.

fortran77about 5 years ago

Here's a better example:"shirt without sleeves"That something that someone may actually search for. (At least the guys at my gym would!) And Amazon gets it mostly wrong.

melvinramabout 5 years ago

`shirt -stripes` gives the type of results one would expect. I guess we haven't reached that level of natural language processing yet.

willdeardenabout 5 years ago

One time I ordered from stitch fix and asked for “a shirt which is not red, white, and blue” and got a red, white, and blue shirt.

wwarnerabout 5 years ago

I thought the minus prefix instead of "without" would exclude "stripes", but it doesn't (any more).

__ryan__about 5 years ago

Similarly, ask Siri to “play all of my music, except classical music”. Siri responds “OK, classical music coming up.”

activatedgeekabout 5 years ago

I'm sorry but what's the point being made here?Search results could be better? Sure.Can we find adversarial examples? Almost always.

impostervtabout 5 years ago

This should be the next captcha model.

hajimuzabout 5 years ago

It’s like the plot in the movie Inception. How to plan an idea “don’t think elepant” to an human idea?

flamtapabout 5 years ago

In the Amazon app, a search for “shirt without stripes” now get corrected to “shirt without strips”.

cgb223about 5 years ago

When the Robot Wars come we’ll all be wearing striped shirts as camouflage to confuse their AIs

robbiemitchellabout 5 years ago

In my experience, semantic embedding is simply not very good at taking negation into account.

scepticallyabout 5 years ago

Just a short question: Why does someone choose to publish something like that on github?

评论 #22934305 未加载

skizmabout 5 years ago

Could this be an SEO opportunity to capture some simple negative phrases like this?

billiamabout 5 years ago

It's a shopping problem, not a language problem, according to these companies.

voldacarabout 5 years ago

Doesn't Peter Norvig work at google? maybe they should pick up his book

java-manabout 5 years ago

Bag of words does not work.

diegorbaqueroabout 5 years ago

This is expected, not wanted though, I would expect some semantic analysis translated into "shirt -stripes", but what you really mean is "solid color shirt". This is a tough one but surely something that can be tackled with research

评论 #22925557 未加载

评论 #22925626 未加载

tempodoxabout 5 years ago

Stripes seem to hold an irresistible attraction for impostor “AI”.

holdenc137about 5 years ago

wake me up when I can google "Anything but crocodiles"

评论 #22925708 未加载

AA-BA-94-2A-56about 5 years ago

Did nobody tell OP you can search "shirt -stripes"?

paulftwabout 5 years ago

Are they also building AI cars that drive without accidents?

DonHopkinsabout 5 years ago

Stay positive: "solid shirts" works just fine.

_pmf_about 5 years ago

But muh "autonomous driving is almost there".

longtermdabout 5 years ago

A pretty accurate descriptions of the State of AI ;D jk

downshunabout 5 years ago

It's called plain, as in plain shirt. PEBKAC

PeterCorlessabout 5 years ago

Someone needs to learn how to use the ¬ operator.

dvduvalabout 5 years ago

Most important is what the advertisements at the top show. The organic results are so yesterday. The Google Ads AI should already be teaching you that All you base belong to us.

ChrisArchitectabout 5 years ago

hm, 'without' is a tough one. You're not looking for a zebra without stripes. You're looking for a horse.

评论 #22925492 未加载

评论 #22926737 未加载

jlmcguireabout 5 years ago

without must be a tough one? It does seem that bing is the worst at figuring this out from the pictures.

评论 #22925537 未加载

kleer001about 5 years ago

"shirt -stripes" works, wtf?

softwarejoshabout 5 years ago

its obvious but the solution is shirt -stripes until we make ai interpret attributes

sparrishabout 5 years ago

Learn you some Google-foo. Don't say what it isn't. Say what it is. "shirt solid color"

评论 #22925341 未加载

评论 #22925546 未加载

评论 #22925387 未加载

评论 #22925340 未加载

danielovichdkabout 5 years ago

Result without correct match!

noughtmeabout 5 years ago

Probably not surprising that most people don’t know, but negative keyword search works on all these platforms:shirt -stripes

评论 #22925440 未加载

评论 #22925801 未加载

评论 #22925363 未加载

评论 #22925698 未加载

eggie5about 5 years ago

simple LTR w/ clickstream data would fix this easy

arkanciscanabout 5 years ago

BUT THEY HAD MILK!

kebmanabout 5 years ago

shirt -stripes Thank me later. :D

aaronsnoswellabout 5 years ago

Brilliant :)

renewiltordabout 5 years ago

This is amusing but not a problem.

inopinatusabout 5 years ago

Tortoise: But we must be careful in combining sentences. For instance you’d grant that “Politicians lie” is true, wouldn’t you?Achilles: Who could deny it?Tortoise: Good. Likewise, “Cast-iron sinks” is a valid utterance, isn’t it?Achilles: Indubitably.Tortoise: Then, putting them together, we get “Politicians lie in cast iron sinks”. Now that’s not the case, is it?---- Douglas Hofstadter, Gödel, Escher, Bach: An Eternal Golden Braid. Basic Books, 1979

WilliamEdwardabout 5 years ago

querying "shirt no stripes" yields slightly better results.

JabavuAdamsabout 5 years ago

bag of words?

merittabout 5 years ago

The point of this isn't asking how to apply boolean search operators, it's showing that the largest AI-focused companies in the world absolutely suck at NLP.

评论 #22925688 未加载

评论 #22925642 未加载

评论 #22925358 未加载

评论 #22925584 未加载

评论 #22927443 未加载

crimsonalucardabout 5 years ago

In terms of AI, the following is literally the best I have ever seen and it's not even done by a professional (meaning you can make it too):<a href="https://aidungeon.io/" rel="nofollow">https://aidungeon.io/</a>

aerovistaeabout 5 years ago

This comment would go from "unreadable" to "interesting" if you had phrased it as:"Vaguely similar to a joke from _the movie_ Ninotchka that _the Slovenian philosopher_ Zizek often uses...."Give people context. Don't assume people know what you know.

评论 #22930098 未加载

评论 #22927864 未加载

评论 #22927810 未加载

评论 #22926072 未加载

评论 #22926722 未加载

评论 #22926232 未加载

ytersabout 5 years ago

Human level NLP is the halting problem, so unsurprising that AI cannot do simple expressions.

lostmsuabout 5 years ago

You got it backwards. This is the opposite of the problem I and many other people tend to have with search engines lately. I do not want the damn thing to combine words, exclude unpopular ones, and search for synonyms without me telling it to explicitly. As little as you can semantics please.If Google wants to group words by semantics, they should have a semantical grouping operator. For example "shirts (without stripes)". What if I am looking for a song text with these exact words in random positions?If what author wants was implemented, it would make my experience with Google even worse, unless it could think for me also. But then why would it need me in the first place?

141 comments

DenisMabout 5 years ago

评论 #22928734 未加载

评论 #22927449 未加载

评论 #22927193 未加载

评论 #22928497 未加载

评论 #22927832 未加载

评论 #22931245 未加载

评论 #22933632 未加载

评论 #22929537 未加载

评论 #22927158 未加载

评论 #22927189 未加载

评论 #22926560 未加载

评论 #22927489 未加载

评论 #22930902 未加载

评论 #22934244 未加载

评论 #22927905 未加载

评论 #22932770 未加载

评论 #22932852 未加载

评论 #22935875 未加载

评论 #22926863 未加载

评论 #22928336 未加载

评论 #22927030 未加载

rgovostesabout 5 years ago

评论 #22926402 未加载

评论 #22926210 未加载

评论 #22931239 未加载

评论 #22927418 未加载

评论 #22925434 未加载

评论 #22926614 未加载

评论 #22925794 未加载

评论 #22926159 未加载

seifertericabout 5 years ago

评论 #22926887 未加载

评论 #22927192 未加载

评论 #22926875 未加载

评论 #22925968 未加载

评论 #22925930 未加载

评论 #22929322 未加载

评论 #22927667 未加载

评论 #22928695 未加载

transrealabout 5 years ago

评论 #22929628 未加载

评论 #22934005 未加载

wkyleabout 5 years ago

albertzeyerabout 5 years ago

评论 #22926788 未加载

评论 #22930197 未加载

评论 #22928066 未加载

评论 #22927781 未加载

评论 #22928470 未加载

caust1cabout 5 years ago

评论 #22925651 未加载

GuB-42about 5 years ago

评论 #22929211 未加载

bentonaabout 5 years ago

评论 #22925857 未加载

评论 #22925674 未加载

评论 #22925590 未加载

评论 #22926067 未加载

评论 #22925993 未加载

评论 #22925516 未加载

评论 #22925813 未加载

Tade0about 5 years ago

评论 #22925691 未加载

评论 #22927321 未加载

captainmuonabout 5 years ago

cscurmudgeonabout 5 years ago

评论 #22926619 未加载

c3534labout 5 years ago

评论 #22933663 未加载

ChuckMcMabout 5 years ago

ggggtezabout 5 years ago

VohuManaabout 5 years ago

rbettsabout 5 years ago

This is also confusing what you search for vs. what the vendor thinks you will buy. Product catalog searches often intentionally return items outside your search parameters.

shaneclevelandabout 5 years ago

评论 #22925734 未加载

评论 #22925792 未加载

评论 #22926925 未加载

评论 #22925665 未加载

woodersabout 5 years ago

schmichaelabout 5 years ago

评论 #22926370 未加载

antmanabout 5 years ago

A good display of the current state of search engines.

评论 #22925872 未加载

评论 #22925599 未加载

评论 #22925311 未加载

joshmnabout 5 years ago

dEnigmaabout 5 years ago

js2about 5 years ago

lifeisstillgoodabout 5 years ago

评论 #22925697 未加载

throw345hnabout 5 years ago

评论 #22926236 未加载

katorabout 5 years ago

partomniscientabout 5 years ago

mabboabout 5 years ago

slaymaker1907about 5 years ago

评论 #22925715 未加载

评论 #22925487 未加载

alanbernsteinabout 5 years ago

Or you can do this: <a href="https://www.google.com/search?q=shirt+-stripes" rel="nofollow">https://www.google.com/search?q=shirt+-stripes</a>

评论 #22925338 未加载

评论 #22925480 未加载

评论 #22925313 未加载

评论 #22925295 未加载

评论 #22925527 未加载

评论 #22925284 未加载

varelazabout 5 years ago

pugworthyabout 5 years ago

评论 #22926923 未加载

评论 #22926929 未加载

imgabeabout 5 years ago

hombre_fatalabout 5 years ago

评论 #22926359 未加载

rjurneyabout 5 years ago

kinkrtyavimoodhabout 5 years ago

ltbarcly3about 5 years ago

twodaveabout 5 years ago

评论 #22926941 未加载

pvtmertabout 5 years ago

civil_engineerabout 5 years ago

Wikipedia gets it wrong too: Try “men without hats” <a href="https://en.wikipedia.org/wiki/Men_Without_Hats" rel="nofollow">https://en.wikipedia.org/wiki/Men_Without_Hats</a>

thedeviantdevabout 5 years ago

heavenlyblueabout 5 years ago

quickthrower2about 5 years ago

"Plain shirt" works a charm though. What is a 'shirt without stripes' anyway? That could be a shirt with diamonds? Or a plain one? Or a Hawaiian shirt?What is the expected result, can we agree?

评论 #22932025 未加载

lgesslerabout 5 years ago

V-2about 5 years ago

dk8996about 5 years ago

dailypeekerabout 5 years ago

Why is everything a git repo when it could have been a blog post?

评论 #22926111 未加载

obarthelemyabout 5 years ago

Reminds me of the the challenge:"Don't think of a cow !"What did you just think of ? A cow, of cowrse.If you want a shirt w/o stripes, just google "plain shirt" or "dress shirt -stripes.

dtunkelangabout 5 years ago

bryanrasmussenabout 5 years ago

holdenc137about 5 years ago

Wake me up when I can google "anything but crocodiles"

评论 #22925517 未加载

jpswadeabout 5 years ago

Google search doesn’t work that way, it’s still based on how we link to things.Nobody would describe a plain shirt as a shirt without stripes unless it’s within that context.

rubatugaabout 5 years ago

Also, what really angers me is when websites don't support the minus operator for search queries. It's a simple feature introduced decades ago!

bryanrasmussenabout 5 years ago

评论 #22925737 未加载

评论 #22925938 未加载

评论 #22925682 未加载

评论 #22926094 未加载

评论 #22925751 未加载

neycodaabout 5 years ago

How does a search engine know whether you wanted shirts that didn't have stripes or results that contained the words shirts, without, and stripes?

ddebernardyabout 5 years ago

The correct query would have been "shirt -stripes". That works fine, or at least does on Google. But yeah, sentence parsing fail.

评论 #22925786 未加载

raindropmabout 5 years ago

vekkerabout 5 years ago

I worked on an ingredient parser a few years ago. This exact kind of thing made things a lot more difficult than they seemed at first.

rjurneyabout 5 years ago

wordabbyabout 5 years ago

CPLXabout 5 years ago

aj7about 5 years ago

adamredwoodsabout 5 years ago

leonardopucciabout 5 years ago

aaron695about 5 years ago

softwaredougabout 5 years ago

评论 #22926250 未加载

评论 #22926390 未加载

mirimirabout 5 years ago

moultanoabout 5 years ago

cfvabout 5 years ago

评论 #22929886 未加载

realoabout 5 years ago

carapaceabout 5 years ago

arnaudsmabout 5 years ago

评论 #22925907 未加载

need_more_bortabout 5 years ago

GnarfGnarfabout 5 years ago

Why not simply say "shirt -stripes" (negation in front of "stripes").

harimau777about 5 years ago

cachestashabout 5 years ago

Key question here, do all three even profess to using AI/ML in the search feature?

dpcanabout 5 years ago

rammy1234about 5 years ago

I see Bing is poor of the lot. It did not understand the "without" keyword

DangerousPieabout 5 years ago

Counterexample: <a href="https://www.google.com/search?client=firefox-b-d&q=Doctors+Without+Borders" rel="nofollow">https://www.google.com/search?client=firefox-b-d&q=Doctors+W...</a>

评论 #22926775 未加载

kitplummerabout 5 years ago

Why is this a Github repo? I can't get past the abuse of a git repository.

评论 #22934292 未加载

mv4about 5 years ago

Not surprising. Also, "shirt -stripe" might product better results.

评论 #22927410 未加载

MaysonLabout 5 years ago

Reminds me of the difficult time I had finding socks without elastic.

otikikabout 5 years ago

Poignant but accurate.On Amazon's side of things I would also include the obnoxious "Hey you just bought a pair of sneakers so now I will change all your recommendations to sneakers".

tiborsaasabout 5 years ago

l0b0about 5 years ago

"shirt -stripes" seems to work on google.com at least, even though they and others like DDG have been getting really bad at ignoring "-foo" terms recently.

_0w8tabout 5 years ago

Searching in Russian on Yandex gives the same ridiculous results.

ddlutzabout 5 years ago

Noticed something interesting, if you search for 'shirt without sleeves' in google images, you DO get sleeveless shirts. So why doesn't this work with stripes?

评论 #22930688 未加载

peter_retiefabout 5 years ago

If you search for "plain shirt" its good If you add "plain shirt no stripes" it adds stripes Strangely "striped shirt" has some plain results.

lavpabout 5 years ago

For the google search, I get better results by typing "shirt -stripes". Still not perfect, but it's better than the seemingly redundant 'without'.

Despoisjabout 5 years ago

What about using the "-" sign to filter results instead of relying on complex language understanding?=> "shirt -stripes" works pretty well on google at least

Skunkletonabout 5 years ago

Good thing search engines generally support a more machine-centric process for communicating intent. Try searching for "shirt -stripes".We are in a funny place with UIs.

dvhabout 5 years ago

<a href="https://www.google.com/search?q=leap+years+in+1900s" rel="nofollow">https://www.google.com/search?q=leap+years+in+1900s</a>

prvcabout 5 years ago

On a related note, Google seems to favor forum replies which instruct users to perform a search in order to find the answer to the question that they had asked.

fortran77about 5 years ago

Here's a better example:"shirt without sleeves"That something that someone may actually search for. (At least the guys at my gym would!) And Amazon gets it mostly wrong.

melvinramabout 5 years ago

`shirt -stripes` gives the type of results one would expect. I guess we haven't reached that level of natural language processing yet.

willdeardenabout 5 years ago

One time I ordered from stitch fix and asked for “a shirt which is not red, white, and blue” and got a red, white, and blue shirt.

wwarnerabout 5 years ago

I thought the minus prefix instead of "without" would exclude "stripes", but it doesn't (any more).

__ryan__about 5 years ago

Similarly, ask Siri to “play all of my music, except classical music”. Siri responds “OK, classical music coming up.”

activatedgeekabout 5 years ago

I'm sorry but what's the point being made here?Search results could be better? Sure.Can we find adversarial examples? Almost always.

impostervtabout 5 years ago

This should be the next captcha model.

hajimuzabout 5 years ago

It’s like the plot in the movie Inception. How to plan an idea “don’t think elepant” to an human idea?

flamtapabout 5 years ago

In the Amazon app, a search for “shirt without stripes” now get corrected to “shirt without strips”.

cgb223about 5 years ago

When the Robot Wars come we’ll all be wearing striped shirts as camouflage to confuse their AIs

robbiemitchellabout 5 years ago

In my experience, semantic embedding is simply not very good at taking negation into account.

scepticallyabout 5 years ago

Just a short question: Why does someone choose to publish something like that on github?

评论 #22934305 未加载

skizmabout 5 years ago

Could this be an SEO opportunity to capture some simple negative phrases like this?

billiamabout 5 years ago

It's a shopping problem, not a language problem, according to these companies.

voldacarabout 5 years ago

Doesn't Peter Norvig work at google? maybe they should pick up his book

java-manabout 5 years ago

Bag of words does not work.

diegorbaqueroabout 5 years ago

评论 #22925557 未加载

评论 #22925626 未加载

tempodoxabout 5 years ago

Stripes seem to hold an irresistible attraction for impostor “AI”.

holdenc137about 5 years ago

wake me up when I can google "Anything but crocodiles"

评论 #22925708 未加载

AA-BA-94-2A-56about 5 years ago

Did nobody tell OP you can search "shirt -stripes"?

paulftwabout 5 years ago

Are they also building AI cars that drive without accidents?

DonHopkinsabout 5 years ago

Stay positive: "solid shirts" works just fine.

_pmf_about 5 years ago

But muh "autonomous driving is almost there".

longtermdabout 5 years ago

A pretty accurate descriptions of the State of AI ;D jk

downshunabout 5 years ago

It's called plain, as in plain shirt. PEBKAC

PeterCorlessabout 5 years ago

Someone needs to learn how to use the ¬ operator.

dvduvalabout 5 years ago

Most important is what the advertisements at the top show. The organic results are so yesterday. The Google Ads AI should already be teaching you that All you base belong to us.

ChrisArchitectabout 5 years ago

hm, 'without' is a tough one. You're not looking for a zebra without stripes. You're looking for a horse.

评论 #22925492 未加载

评论 #22926737 未加载

jlmcguireabout 5 years ago

without must be a tough one? It does seem that bing is the worst at figuring this out from the pictures.

评论 #22925537 未加载

kleer001about 5 years ago

"shirt -stripes" works, wtf?

softwarejoshabout 5 years ago

its obvious but the solution is shirt -stripes until we make ai interpret attributes

sparrishabout 5 years ago

Learn you some Google-foo. Don't say what it isn't. Say what it is. "shirt solid color"

评论 #22925341 未加载

评论 #22925546 未加载

评论 #22925387 未加载

评论 #22925340 未加载

danielovichdkabout 5 years ago

Result without correct match!

noughtmeabout 5 years ago

Probably not surprising that most people don’t know, but negative keyword search works on all these platforms:shirt -stripes

评论 #22925440 未加载

评论 #22925801 未加载

评论 #22925363 未加载

评论 #22925698 未加载

eggie5about 5 years ago

simple LTR w/ clickstream data would fix this easy

arkanciscanabout 5 years ago

BUT THEY HAD MILK!

kebmanabout 5 years ago

shirt -stripes Thank me later. :D

aaronsnoswellabout 5 years ago

Brilliant :)

renewiltordabout 5 years ago

This is amusing but not a problem.

inopinatusabout 5 years ago

WilliamEdwardabout 5 years ago

querying "shirt no stripes" yields slightly better results.

JabavuAdamsabout 5 years ago

bag of words?

merittabout 5 years ago

The point of this isn't asking how to apply boolean search operators, it's showing that the largest AI-focused companies in the world absolutely suck at NLP.

评论 #22925688 未加载

评论 #22925642 未加载

评论 #22925358 未加载

评论 #22925584 未加载

评论 #22927443 未加载

crimsonalucardabout 5 years ago

aerovistaeabout 5 years ago

评论 #22930098 未加载

评论 #22927864 未加载

评论 #22927810 未加载

评论 #22926072 未加载

评论 #22926722 未加载

评论 #22926232 未加载

ytersabout 5 years ago

Human level NLP is the halting problem, so unsurprising that AI cannot do simple expressions.

lostmsuabout 5 years ago