TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

DeepMind uses the Daily Mail as a huge training corpus for text comprehension

49 pointsby ilyaeckalmost 10 years ago

10 comments

paulsutteralmost 10 years ago
The original Deepmind paper [1] is based on a really smart idea. Algorithmic development relies on measuring the performance of any proposed algorithm. For reading comprehension, performance is evaluated using Q&amp;A about the corpus. Its difficult to find a large corpus with a comprehensive set of questions about the content.<p>Deepmind is cleverly converting the Daily Mail article summaries into questions by removing a proper noun. For example:<p>Question: Producer X will not press charges against Jeremy Clarkson, his lawyer says.<p>Answer: Oisin Tymon<p>They are using the Daily Mail corpus to develop their algorithm, and that&#x27;s smart. They aren&#x27;t relying on it as an important source of information. Maybe all you guys with the dismissive comments have a better idea?<p>[1] <a href="http:&#x2F;&#x2F;arxiv.org&#x2F;pdf&#x2F;1506.03340.pdf" rel="nofollow">http:&#x2F;&#x2F;arxiv.org&#x2F;pdf&#x2F;1506.03340.pdf</a><p>EDIT: Thanks Otik, reworded the opening sentance
评论 #9763055 未加载
评论 #9762811 未加载
评论 #9764018 未加载
评论 #9762938 未加载
评论 #9763110 未加载
johntaitorgalmost 10 years ago
Another use for the Daily Mail: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=xPlEIryW8zA" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=xPlEIryW8zA</a>
Animatsalmost 10 years ago
The Daily Mail? Not the Times?
评论 #9763348 未加载
peteretepalmost 10 years ago
If you thought institutionalised racism, sexism and chauvinism were bad now, the singularity is going to <i>suck</i>.
SixSigmaalmost 10 years ago
This is how Judge Death starts.<p>The crime is life, the sentence death.
KaiserProalmost 10 years ago
I know why they did it, because its the lowest common denominator for celeb gossip.<p>However its shit for science, overtly paedophilic, and the only western news sit that seems to have a special section devoted to curating and promoting ISIS propaganda.<p>Why is it bad? because if you are looking for facts, the daily mail is a bad source.<p>If you are looking for natural language, its a good source, however its full of nuanced racism, sexism, classism &amp; basically everything else thats wrong with britian.<p>It&#x27;ll be good at describing house prices though.
评论 #9763401 未加载
评论 #9763004 未加载
latenightcodingalmost 10 years ago
This is great are they modelling a neural network to detect bull shit ?
nbevansalmost 10 years ago
Somewhat worrying that they are feeding DeepMind a diet of DailyMail articles!<p>Maybe this is why Skynet turned rogue. Reading daily trash about celebrities and body image dysmorphia inducing trash, is enough to make anyone go mad.
jacknewsalmost 10 years ago
Haha, absolutely classic! What will it end up comprehending? David Beckham&#x27;s love life? How brown skinned foreigners are taking all the jobs? Etc.
评论 #9762728 未加载
评论 #9763544 未加载
评论 #9763033 未加载
评论 #9762742 未加载
Fede_Valmost 10 years ago
Great, so we will have a racist and reactionary AI that thinks celebrity gossip and immigrant bashing are the most important things in the world.<p>For the non-AIs reading this thread, I highly reccomend <a href="https:&#x2F;&#x2F;addons.mozilla.org&#x2F;en-US&#x2F;firefox&#x2F;addon&#x2F;kitten-block&#x2F;" rel="nofollow">https:&#x2F;&#x2F;addons.mozilla.org&#x2F;en-US&#x2F;firefox&#x2F;addon&#x2F;kitten-block&#x2F;</a> a plugin that replaces every daily mail link with a random picture from kittens &amp; tea, in case you mistakenly click on a daily mail link.<p>Snark aside, the paper is really cool though :)
评论 #9763065 未加载