TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Trying and Failing to Interpret Embeddings

2 点作者 tedtimbrell4 个月前

2 条评论

PaulHoule4 个月前
I remember riding the bus from Ithaca down to the office of an AI startup at Union Square the summer BERT came out and trying to train a scikit-learn classifier on my laptop to classify Word2Vec embeddings on things like: part of speech, &quot;is this a color word?&quot;, etc. and found I just couldn&#x27;t. If I had a tiny number of examples (&lt;10) it might seem to work but if I put more examples in it would break.<p>My conclusion was that &quot;Word2Vec sucks&quot;, probably a lot of people tried the same thing and either came to that conclusion or thought they did something wrong. People don&#x27;t usually publish negative results so I&#x27;ve never read about anybody doing it. It takes bravery. Great work!<p>The diagrams on this page are a disgrace in my mind<p><a href="https:&#x2F;&#x2F;nlp.stanford.edu&#x2F;projects&#x2F;glove&#x2F;" rel="nofollow">https:&#x2F;&#x2F;nlp.stanford.edu&#x2F;projects&#x2F;glove&#x2F;</a><p>what it comes down to is that they are projecting down from a N=50 space to an N=2 space. You have a lot of dimensions to play with so if you have, say 20 points, you can find some projection where those points lie wherever you want, even if it was just a random point cloud.<p>It&#x27;s really a lie because if they tried to map 100 cities to their ZIP codes it wouldn&#x27;t work at all, that&#x27;s what I found trying to make classifiers.
评论 #42660754 未加载
minimaxir4 个月前
The main reason the vector arithmetic of Word2Vec worked is due to how it was trained (directly training the network with a shallow network such that the entire knowledge for the model is contained within the embeddings). This is not the case with any modern embedding model.<p>At most with current models, you can average embeddings together.
评论 #42660565 未加载