TechEcho

8 comments

minimaxiralmost 9 years ago

I'm not fond of the "magic AI does everything" narrative, especially since the code is available on GitHub (<a href="https://github.com/overlap-ai/words2map" rel="nofollow">https://github.com/overlap-ai/words2map</a>) and it's not magic. That being said, the code is optimized for efficient memory usage (important with the pre-built word2vec models), and since it MIT-licensed, I might be able to develop a few pretty visualizations. :)

评论 #12169455 未加载

mddaalmost 9 years ago

"We are now at a point in history when algorithms can learn, like people, about pretty much anything. " seems pretty disingenuously worded.One infers from a quick read ~"Algorithms are now like people, and can learn about anything." But careful parsing of the commas shows that the sentence is true, but in the precise sense that "People can learn about anything. Now, algorithms can also learn about anything." - and the extent of learning/understanding is not being compared.Perhaps I'm nit-picking, but this statement appears to have been constructed to support an AI pitch, and is literally true, but no 'actual AI' is involved (and no-one is actually claiming it is... unless you /want to believe/).

评论 #12173956 未加载

ilyaeckalmost 9 years ago

Question to Y-hat folks: why cluster in 2D? Granted, clustering in 300D is hard :) Still, the 2D projection must add a significant metric distortion. Why not a middle ground, say, 5-10D ?

评论 #12169390 未加载

评论 #12169116 未加载

评论 #12168682 未加载

评论 #12168556 未加载

vinchucoalmost 9 years ago

Nitpicking: NOT (human + robot) ≈ cyborg BUT average(human + robot) ≈ cyborgSome things that come to mind:I'd be interested to see other vector operations such as projection of one word into another in the examples. Also, only nouns yet.How is ≈ defined, if the distance to the closest word vector is not necessarily unique?Finally, what is the proportion of words that maintain human meaning when averaged to those that are nonsense? What are the most "meaningful" words, in that sense?

vonnikalmost 9 years ago

how is this different than TSNE?<a href="https://lvdmaaten.github.io/tsne/" rel="nofollow">https://lvdmaaten.github.io/tsne/</a>anyone looking for an explanation of word2vec may find this helpful:<a href="http://deeplearning4j.org/word2vec" rel="nofollow">http://deeplearning4j.org/word2vec</a>

评论 #12169765 未加载

ganeshkrishnanalmost 9 years ago

Hi, I was in the middle of creating "user personalities" using K-means clustering.Is it ok to reference your document for our papers? MIT licence is awesome and let us reuse your tech. Our site is at www.shoten.xyz if you are interested to know what we are doing

评论 #12170585 未加载

sixhobbitsalmost 9 years ago

human + robot ≈ cyborgelectricity + silicon ≈ solar cellsvirtual reality + reality ≈ augmented reality--These always seem impressive in word vector models, but in reality, I imagine that "robot" and "cyborg" were already pretty close. The fact that adding "human" nudged the vector closer is likely not as meaningful as it would be nice to believe. The same for "electricity/solar cells" and "virtual reality/augmented reality"Still a really nice application for word2vec, and I'm looking forward to seeing other similarly practical implementations in future.

评论 #12170944 未加载

visargaalmost 9 years ago

I think you can also get pretty good suggestions with plain old bag-of-words, tf-idf and k-means.

评论 #12170566 未加载

8 comments

minimaxiralmost 9 years ago

评论 #12169455 未加载

mddaalmost 9 years ago

评论 #12173956 未加载

ilyaeckalmost 9 years ago

Question to Y-hat folks: why cluster in 2D? Granted, clustering in 300D is hard :) Still, the 2D projection must add a significant metric distortion. Why not a middle ground, say, 5-10D ?

评论 #12169390 未加载

评论 #12169116 未加载

评论 #12168682 未加载

评论 #12168556 未加载

vinchucoalmost 9 years ago

vonnikalmost 9 years ago

评论 #12169765 未加载

ganeshkrishnanalmost 9 years ago

评论 #12170585 未加载

sixhobbitsalmost 9 years ago

评论 #12170944 未加载

visargaalmost 9 years ago

I think you can also get pretty good suggestions with plain old bag-of-words, tf-idf and k-means.

评论 #12170566 未加载

Making Sense of Everything with words2map

8 comments

Making Sense of Everything with words2map

8 comments