TechEcho

10 comments

sdenton4over 3 years ago

hm. I'd like to believe, but the arguments here seem a bit obtuse.No one measures vector distance using the hamming distance on binary representations. That's silly. We use L1 or L2, usually, and the binary encoding of the numbers is irrelevant.It sounds like the LSH is maaaaybe equivalent to vector quantization. In which case this would be a form of regularization, which sometimes works well, and sometimes meh.

评论 #28678901 未加载

评论 #28678758 未加载

评论 #28678300 未加载

foxesover 3 years ago

I like to speculate for reasons this might or might not make sense at several levels, although mostly just conjecturing. The fact everything works is very interesting, but it seems so hard to come up with something concrete.You have a map from some high dimensional vector space ~ k^N -> H, some space of hashes. H sort of looks one dimensional. I assume that actually the interesting geometry of your training data lies on a relatively low dimensional subvariety/subset in k^N, so maybe its not actually that bad? It could be a really twisted and complicated curve.However you still need to somehow preserve the relative structure right? Things that are far apart in k^N need to be far apart in H. Seems like you want things to at least approximately be an isometry. Although there are things like space filling curves that might do this for some degree.Also maybe even though H looks low dimensional, it can actually capture quite a bit (if your data is encoded as a coefficient of a power of 2, you could think of powers of 2 as some sort of basis, so maybe it is also pretty high dimensional).

usefulover 3 years ago

Contrastive and triplet loss is pretty cool for generating hashes. I'd imagine the trick they are alluding to is a rewrite the loss function to be more aware of locality instead of trying to minimize/maximize distance.Or they are just shingling different ML hash functions, which is kinda lazy.

LuisMondragonover 3 years ago

Hi, my interest got piqued. I'm developing a similarity feature where I compare embeddings of a sentence and its translation. I wanted to know if the hashing method would be faster that the pytorch multiplication by which I get the sentence similarities. Going from strings to bytes, hashing and comparing is very fast. But if I get the embeddings, turn them into bytes, hash them and compare them, both methods take almost the same time.I used this Python library: <a href="https://github.com/trendmicro/tlsh" rel="nofollow">https://github.com/trendmicro/tlsh</a>.

评论 #28697233 未加载

andyxorover 3 years ago

This idea goes back to "sparse distributed memory" developed by NASA research in the 80s. It's a content-addressable memory where content hashes are encoded & decoded by neural network and similar items are in proximity to each other in the embedding space, and similarity measured via Hamming distance. <a href="https://en.wikipedia.org/wiki/Sparse_distributed_memory" rel="nofollow">https://en.wikipedia.org/wiki/Sparse_distributed_memory</a>

a-dubover 3 years ago

using fancy neural nets for learning hash functions from data is indeed pretty cool, but hash functions fit to data isn't new. see "perfect hash functions."lsh is most famously used for approximating jaccard distances, which even if you're not doing stuff like looking at lengths or distances in l1 or l2, is still a vector operation.lsh is best described in jeff ullman's mining massive datasets textbook (available free online), which describes how it was used for webpage deduplication in the early days at google.

jonbaerover 3 years ago

I feel like I have been talking about LSH for years

评论 #28678109 未加载

jgalt212over 3 years ago

let's just take it to its logical extension and make every model just one big look up table (with hashes as keys). /s

sayonaramanover 3 years ago

Whoever wrote the article must have done a cursory search at best, I'm surprised they didn't mention semantic hashing by Salakhutdinov & Hinton (2007) <a href="https://www.cs.utoronto.ca/~rsalakhu/papers/semantic_final.pdf" rel="nofollow">https://www.cs.utoronto.ca/~rsalakhu/papers/semantic_final.p...</a>Edit: also, talking about LSH, must check out FAISS library <a href="https://github.com/facebookresearch/faiss" rel="nofollow">https://github.com/facebookresearch/faiss</a> and the current SOTA <a href="http://ann-benchmarks.com/" rel="nofollow">http://ann-benchmarks.com/</a>

评论 #28685612 未加载

评论 #28696672 未加载

评论 #28678450 未加载

nanisover 3 years ago

> "_If this peaked your interest_"It didn't.[1][1]: <a href="https://www.merriam-webster.com/words-at-play/pique-vs-peak-vs-peek" rel="nofollow">https://www.merriam-webster.com/words-at-play/pique-vs-peak-...</a>

10 comments

sdenton4over 3 years ago

评论 #28678901 未加载

评论 #28678758 未加载

评论 #28678300 未加载

foxesover 3 years ago

usefulover 3 years ago

LuisMondragonover 3 years ago

评论 #28697233 未加载

andyxorover 3 years ago

a-dubover 3 years ago

jonbaerover 3 years ago

I feel like I have been talking about LSH for years

评论 #28678109 未加载

jgalt212over 3 years ago

let's just take it to its logical extension and make every model just one big look up table (with hashes as keys). /s

Vectors are over, hashes are the future of AI

10 comments

Vectors are over, hashes are the future of AI

10 comments