TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: I made a Pinterest clone using SigLIP image embeddings

98 点作者 verse超过 1 年前
Click an image to get similar images.<p>I crawled Tumblr and used SigLIP to get vector embeddings for many images.<p>When you click an image, it finds the most similar vector embeddings in the database, and returns the corresponding images.

11 条评论

yorwba超过 1 年前
Sometimes there are duplicate results, e.g. <a href="https:&#x2F;&#x2F;mood-amber.vercel.app&#x2F;images&#x2F;0b733fc2-7093-4443-8872-016961c54000" rel="nofollow">https:&#x2F;&#x2F;mood-amber.vercel.app&#x2F;images&#x2F;0b733fc2-7093-4443-8872...</a> has two copies of <a href="https:&#x2F;&#x2F;mood-amber.vercel.app&#x2F;images&#x2F;f920a599-bbd7-4805-3317-4a8531fb5800" rel="nofollow">https:&#x2F;&#x2F;mood-amber.vercel.app&#x2F;images&#x2F;f920a599-bbd7-4805-3317...</a> right next to each other. (The link UUID is the same, so I assume this is an issue with the search algorithm, not simply duplicate data that got scraped.)
评论 #39402561 未加载
lulzx超过 1 年前
Also, check <a href="https:&#x2F;&#x2F;same.energy&#x2F;" rel="nofollow">https:&#x2F;&#x2F;same.energy&#x2F;</a>
wucaworld超过 1 年前
Very cool! How did you get the collage layout? I noticed images in each column don’t have the same size. I assume images get Centre cropped?
评论 #39396794 未加载
omeze超过 1 年前
Cool! I haven’t tried SigLIP out yet but it seems to be the new hotness over CLIP… I just dont have a good project idea yet
Tiberium超过 1 年前
Is there a repo, especially for training? I&#x27;d like to see how SigLIP performs on a dataset of only anime images.
评论 #39400226 未加载
gammalost超过 1 年前
There are some interesting images there. Why are you not including the source of the images?
GamerAlias超过 1 年前
Good stuff! Do you have any intuitive sense of whether SigLIP is particularly stronger than CLIP here? Also vector DB over Faiss index?
评论 #39401761 未加载
评论 #39401830 未加载
squam超过 1 年前
Cool project! Thanks for sharing
Yenrabbit超过 1 年前
Neat! How many images are in the dataset out of curiosity?
convolvatron超过 1 年前
how far we&#x27;ve come since <a href="https:&#x2F;&#x2F;www.karlsims.com&#x2F;genetic-images.html" rel="nofollow">https:&#x2F;&#x2F;www.karlsims.com&#x2F;genetic-images.html</a><p>quite a bit, but surprisingly not
ijhuygft776超过 1 年前
nice, we always need more clones and improvements.... hope you get traction.<p>I never click Pinterest links because the experience is too bad.
评论 #39401465 未加载