TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

The Algorithm That Can Crack CAPTCHA (and is changing AI)

24 点作者 urs2102超过 11 年前

4 条评论

fchollet超过 11 年前
Being familiar with Dileep&#x27;s past research: the algorithm used by Vicarious here is very likely based on the Hierarchical-Temporal Memory (HTM) technology developed by Hawkins, Dileep and the folks at Numenta [1].<p>HTM is documented in a Numenta white paper [2] and in an older form in Dileep&#x27;s PhD thesis [3]. Numenta has also open-sourced an implementation of HTM, NuPIC [4].<p>It is important to note that, while HTM is a very interesting approach to machine learning, it has never been shown to beat (or even concurrence) the state of the art on any problem.<p>I have been following Numenta&#x2F;Grok&#x27;s progress closely because they seem to me to have the most potential for pushing this tech forward. Also I like their open-source strategy compared to the 100% secret approach of Vicarious. Vicarious has an impressively bombastic PR (that you can see in action in the above article) but basically nothing to show for it (until now, that is, if this breaking of CAPTCHA turns out to be real and the be more reliable than existing state-of-the-art CAPTCHA-breaking algorithms).<p>Long before they had any working technology, Vicarious founders had been throwing around quotes such as: &quot;[our goal is] in five years to make a human-level vision system. Anything a human can recognize visually our algorithm would recognize visually.&quot; (2011) [5]<p>And today, after creating a CAPTCHA-breaking program (which has already been done in the past; I&#x27;d like to see a benchmark comparison), they&#x27;re basically claiming they&#x27;ve created a human brain.<p>They also claim they will have a strong AI by 2026 [6]. Bombastic much?<p>[1] <a href="https://groksolutions.com/landing-page.html" rel="nofollow">https:&#x2F;&#x2F;groksolutions.com&#x2F;landing-page.html</a><p>[2] <a href="http://numenta.org/resources/HTM_CorticalLearningAlgorithms.pdf" rel="nofollow">http:&#x2F;&#x2F;numenta.org&#x2F;resources&#x2F;HTM_CorticalLearningAlgorithms....</a><p>[3] <a href="http://alpha.tmit.bme.hu/speech/docs/education/02_DileepThesis.pdf" rel="nofollow">http:&#x2F;&#x2F;alpha.tmit.bme.hu&#x2F;speech&#x2F;docs&#x2F;education&#x2F;02_DileepThes...</a><p>[4] <a href="http://numenta.org/nupic.html" rel="nofollow">http:&#x2F;&#x2F;numenta.org&#x2F;nupic.html</a><p>[5] <a href="http://blogs.wsj.com/venturecapital/2011/02/10/vicarious-systems-says-its-artificial-intelligence-is-the-real-deal/" rel="nofollow">http:&#x2F;&#x2F;blogs.wsj.com&#x2F;venturecapital&#x2F;2011&#x2F;02&#x2F;10&#x2F;vicarious-sys...</a><p>[6] <a href="http://www.artificialbrains.com/vicarious" rel="nofollow">http:&#x2F;&#x2F;www.artificialbrains.com&#x2F;vicarious</a>
ColinWright超过 11 年前
I point you here:<p><a href="https://news.ycombinator.com/item?id=6633515" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6633515</a><p>Quoting:<p>The popularity of this story evidences how universally hated are CAPTCHAs. here are some of the submissions:<p><a href="https://news.ycombinator.com/item?id=6625245" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6625245</a> (forbes.com)<p><a href="https://news.ycombinator.com/item?id=6625247" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6625247</a> (kurzweilai.net)<p><a href="https://news.ycombinator.com/item?id=6625351" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6625351</a> (technologyreview.com) &lt;- Main discussion<p><a href="https://news.ycombinator.com/item?id=6626405" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6626405</a> (vimeo.com) &lt;- video of process in action<p><a href="https://news.ycombinator.com/item?id=6627848" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6627848</a> (vicariousinc.tumblr.com)<p><a href="https://news.ycombinator.com/item?id=6628086" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6628086</a> (cbc.ca)<p><a href="https://news.ycombinator.com/item?id=6628092" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6628092</a> (wired.com)<p><a href="https://news.ycombinator.com/item?id=6629173" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6629173</a> (wired.com)<p><a href="https://news.ycombinator.com/item?id=6629559" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6629559</a> (newscientist.com)<p><a href="https://news.ycombinator.com/item?id=6629656" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6629656</a> (dailydot.com)<p><a href="https://news.ycombinator.com/item?id=6629708" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6629708</a> (mashable.com)<p>========<p>There are two other comments worth reading.<p><a href="https://news.ycombinator.com/item?id=6629173" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6629173</a>:<p><pre><code> That&#x27;s a lot of fancy words to say that they overfit to their training data. That makes this sound like a very typical result in supervised machine learning (if it&#x27;s a result at all). They have used an algorithm to learn a brittle heuristic that works in the cases it was trained to work on.</code></pre>
andrewcooke超过 11 年前
anyone have any actual information on the algorithm? this is just fluff.
评论 #6741724 未加载
评论 #6741694 未加载
评论 #6743199 未加载
vezzy-fnord超过 11 年前
Is this really some master algorithm that can crack all kinds of CAPTCHAs?<p>CAPTCHA is pretty much an umbrella term for a variety of anti-spam protection and human verification procedures, many of them having nothing in common beyond the name.<p>Thus &quot;cracking CAPTCHA&quot; is meaningless unless you specify what type(s).<p>You&#x27;d need textual analysis, sentiment analysis, expression evaluation, OCR, full-scale image recognition, puzzle solving and a variety of other techniques to scratch most implementations. So it can&#x27;t be one algorithm. It&#x27;d have to be many.<p>Then in most cases it&#x27;d probably be simpler to figure out some client-side exploit in the way CAPTCHAs handle and send data. Find a weak link, much like DC949 did with reCAPTCHA.