TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: I made a Pinterest clone using SigLIP image embeddings

98 pointsby verseover 1 year ago
Click an image to get similar images.<p>I crawled Tumblr and used SigLIP to get vector embeddings for many images.<p>When you click an image, it finds the most similar vector embeddings in the database, and returns the corresponding images.

11 comments

yorwbaover 1 year ago
Sometimes there are duplicate results, e.g. <a href="https:&#x2F;&#x2F;mood-amber.vercel.app&#x2F;images&#x2F;0b733fc2-7093-4443-8872-016961c54000" rel="nofollow">https:&#x2F;&#x2F;mood-amber.vercel.app&#x2F;images&#x2F;0b733fc2-7093-4443-8872...</a> has two copies of <a href="https:&#x2F;&#x2F;mood-amber.vercel.app&#x2F;images&#x2F;f920a599-bbd7-4805-3317-4a8531fb5800" rel="nofollow">https:&#x2F;&#x2F;mood-amber.vercel.app&#x2F;images&#x2F;f920a599-bbd7-4805-3317...</a> right next to each other. (The link UUID is the same, so I assume this is an issue with the search algorithm, not simply duplicate data that got scraped.)
评论 #39402561 未加载
lulzxover 1 year ago
Also, check <a href="https:&#x2F;&#x2F;same.energy&#x2F;" rel="nofollow">https:&#x2F;&#x2F;same.energy&#x2F;</a>
wucaworldover 1 year ago
Very cool! How did you get the collage layout? I noticed images in each column don’t have the same size. I assume images get Centre cropped?
评论 #39396794 未加载
omezeover 1 year ago
Cool! I haven’t tried SigLIP out yet but it seems to be the new hotness over CLIP… I just dont have a good project idea yet
Tiberiumover 1 year ago
Is there a repo, especially for training? I&#x27;d like to see how SigLIP performs on a dataset of only anime images.
评论 #39400226 未加载
gammalostover 1 year ago
There are some interesting images there. Why are you not including the source of the images?
GamerAliasover 1 year ago
Good stuff! Do you have any intuitive sense of whether SigLIP is particularly stronger than CLIP here? Also vector DB over Faiss index?
评论 #39401761 未加载
评论 #39401830 未加载
squamover 1 year ago
Cool project! Thanks for sharing
Yenrabbitover 1 year ago
Neat! How many images are in the dataset out of curiosity?
convolvatronover 1 year ago
how far we&#x27;ve come since <a href="https:&#x2F;&#x2F;www.karlsims.com&#x2F;genetic-images.html" rel="nofollow">https:&#x2F;&#x2F;www.karlsims.com&#x2F;genetic-images.html</a><p>quite a bit, but surprisingly not
ijhuygft776over 1 year ago
nice, we always need more clones and improvements.... hope you get traction.<p>I never click Pinterest links because the experience is too bad.
评论 #39401465 未加载