TechEcho

6 comments

TekMolover 7 years ago

Wow, is this really state of the art?<pre><code> Joe did not buy a car today. He was in buying mood. But all cars were too expensive. Why didn't Joe buy a car? Answer: buying mood </code></pre> I think I have seen similar systems for decades now. I thought we would be further along meanwhile.I have tried for 10 or 20 minutes now. But I can't find any evidence that it has much sense of syntax:<pre><code> Paul gives a coin to Joe. Who received a coin? Answer: Paul </code></pre> All it seems to do is to extract candidates for "who", "what", "where" etc. So it seems to figure out correctly that "Paul" is a potential answer for "Who".No matter how I rephrase the "Who" question, I always get "Paul" as the answer. "Who? Paul!", "Who is a martian? Paul!", "Who won the summer olympics? Paul", "Who got a coin from the other guy? Paul!"Same for "what" questions:<pre><code> Gold can not be carried in a bag. Silver can. What can be carried in a bag? Answer: Gold</code></pre>

评论 #15263592 未加载

评论 #15265267 未加载

评论 #15263475 未加载

评论 #15263936 未加载

评论 #15263826 未加载

评论 #15264652 未加载

评论 #15264763 未加载

mampover 7 years ago

This is very brittle: it works really well on the pre-canned examples but the vocabulary seems very tightly linked. It doesn't handle something as simple as:'the patient had no pain but did have nausea'Doesn't yield any helpful on semantic role labeling and didn't even parse on machine comprehension. If I vary it to say ask 'did the patient have pain?' the answer is 'nausea'.CoreNLP provides much more useful analysis of the phrase structure and dependencies.

sanxiynover 7 years ago

In "Adversarial Examples for Evaluating Reading Comprehension Systems" <a href="https://arxiv.org/abs/1707.07328" rel="nofollow">https://arxiv.org/abs/1707.07328</a>, it was found that adding a single distracting sentence can lower F1 score of BiDAF (which is used in demo here) from 75.5% to 34.3% on SQuAD. In comparison, human performance goes from 92.6% to 89.2%.

评论 #15264345 未加载

vbuwivbiuover 7 years ago

"the squid was walked by the woman""what is the fifth word in that sentence ?"Answer: squid

strinover 7 years ago

We need more demos of AI models: there is what people claim their model does, and there is what the model actually does.

wyldfireover 7 years ago

How does this compare with spacy?

评论 #15265360 未加载

6 comments

TekMolover 7 years ago

评论 #15263592 未加载

评论 #15265267 未加载

评论 #15263475 未加载

评论 #15263936 未加载

评论 #15263826 未加载

评论 #15264652 未加载

评论 #15264763 未加载

mampover 7 years ago

sanxiynover 7 years ago

评论 #15264345 未加载

vbuwivbiuover 7 years ago

"the squid was walked by the woman""what is the fifth word in that sentence ?"Answer: squid

strinover 7 years ago

We need more demos of AI models: there is what people claim their model does, and there is what the model actually does.

wyldfireover 7 years ago

How does this compare with spacy?

评论 #15265360 未加载

AllenNLP – An open-source NLP research library, built on PyTorch

6 comments

AllenNLP – An open-source NLP research library, built on PyTorch

6 comments