TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

word2vec in yhat: Word vector similarity

59 点作者 dfrodriguez143超过 11 年前

6 条评论

Radim超过 11 年前
For people interested in a cleaned-up, commented and de-obfuscated word2vec, I recently ported the original C code to Python [1].<p>My HN submission of this endeavour received no love, but I think it&#x27;s worthwhile nevertheless as the Python code is not only more concise, readable and extendable, but the training&#x27;s actually faster too [2].<p>[1] <a href="https://github.com/piskvorky/gensim/blob/develop/gensim/models/word2vec.py" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;piskvorky&#x2F;gensim&#x2F;blob&#x2F;develop&#x2F;gensim&#x2F;mode...</a><p>[2] <a href="http://radimrehurek.com/2013/09/word2vec-in-python-part-two-optimizing/" rel="nofollow">http:&#x2F;&#x2F;radimrehurek.com&#x2F;2013&#x2F;09&#x2F;word2vec-in-python-part-two-...</a>
评论 #6470855 未加载
评论 #6471702 未加载
评论 #6473321 未加载
judk超过 11 年前
Word2vec seemed intuitively obvious me, but I really have a hard time believing that it works in only 1000 dimensions, generating results beyond cherry picked demo examples.<p>Are there really only 1000 independent concepts in the English language?
评论 #6470501 未加载
评论 #6471342 未加载
评论 #6470610 未加载
3JPLW超过 11 年前
Very cool. I missed the original word2vec software discussion back in August: <a href="https://news.ycombinator.com/item?id=6216044" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=6216044</a><p>And the paper itelf is a very worthwhile read: <a href="http://arxiv.org/abs/1301.3781" rel="nofollow">http:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;1301.3781</a>
dhammack超过 11 年前
The vectors learned from word2vec are pretty amazing. A few days after the tool was released I wrote a script which uses the vector representations to figure out which word in a list isn&#x27;t like the others [1]. Things like:<p>-&gt;math shopping reading science<p>I think shopping doesnt belong in this list!<p>-&gt;rain snow sleet sun<p>I think sun doesnt belong in this list!<p>etc.<p>[1] <a href="https://github.com/dhammack/Word2VecExample" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;dhammack&#x2F;Word2VecExample</a>
gojomo超过 11 年前
Eventually computers will be talking about us behind our backs in these high-dimensional vectors, only occasionally translating down to English approximations, to humor us. &quot;Goo goo, gah gah, human?&quot;
评论 #6471838 未加载
gojomo超过 11 年前
Cool web demo powered by word2vec, by Christopher Moody:<p><a href="http://thisplusthat.me/" rel="nofollow">http:&#x2F;&#x2F;thisplusthat.me&#x2F;</a>