TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Sisi – Semantic Image Search CLI tool, locally without third party APIs

128 pointsby zcbenz8 months ago
I wrote this tool to get familiar with CLIP model, I know many people have written similar tools with CLIP before, but I&#x27;m new to machine learning and writing a classic tool helps my study.<p>The unusual thing with my version is, it is in pure Node.js, with the power of node-mlx, a Node.js machine learning framework.<p>The repo in the link is mostly about implementing indexing and CLI, the code of the model implementation lives as a Node.js module: <a href="https:&#x2F;&#x2F;github.com&#x2F;frost-beta&#x2F;clip">https:&#x2F;&#x2F;github.com&#x2F;frost-beta&#x2F;clip</a> .<p>Hope this helps other learners!

11 comments

notsylver8 months ago
I was planning to do this myself lol. I was going to use SQLite as the index, and use `sqlite-vec` or something similar to query for similar files directly. I think the only other thing I was planning were more filters, `&quot;positive term&quot; -&quot;negative term&quot;` to be able to negate results, `&gt;90&quot;search&quot;` to find images that match by &gt;90% and some generic filters like `--size &gt;1mb` to help narrow it down when you are looking for a specific image. Quantizing embeddings to make them smaller&#x2F;faster also seemed interesting but I haven&#x27;t tried doing it yet.
progx8 months ago
Uses only 1 core 100% under linux, can this be changed?<p>10 images, each ~20 kb size, took more than 10 minutes to index, is that normal without GPU-acceleration?
评论 #41556050 未加载
评论 #41559075 未加载
spullara8 months ago
Very cool! Here is a similar python version.<p><a href="https:&#x2F;&#x2F;github.com&#x2F;spullara&#x2F;photoindex">https:&#x2F;&#x2F;github.com&#x2F;spullara&#x2F;photoindex</a><p>Oh and if you want to run something locally on your iphone you can use my app I am still testing:<p><a href="https:&#x2F;&#x2F;x.com&#x2F;getrememberwhen" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;getrememberwhen</a>
sureIy8 months ago
This is cool. Is there also a way to show contents of the image as indexed? i.e. image 1 has cat and dog<p>There are a lot of tool&#x2F;apps that let you “search images” but not much that lets you just as easily “read images”
kjeldsendk8 months ago
I have wanted to clean up my photo collection for ages and remove any nsfw picture that might hide somewhere.<p>Would this be able to do that and how likely is it It will see a pc release.
评论 #41562387 未加载
petesergeant8 months ago
I&#x27;ve been enjoying <a href="https:&#x2F;&#x2F;github.com&#x2F;mazzzystar&#x2F;Queryable">https:&#x2F;&#x2F;github.com&#x2F;mazzzystar&#x2F;Queryable</a> on iPhone
y04nn8 months ago
How does CLIP compare to YOLO[1]? I haven&#x27;t looked into image classification&#x2F;object recognition for a while, but I remember that YOLO was quite good was working on realtime video too.<p>[1]: <a href="https:&#x2F;&#x2F;pjreddie.com&#x2F;darknet&#x2F;yolo&#x2F;" rel="nofollow">https:&#x2F;&#x2F;pjreddie.com&#x2F;darknet&#x2F;yolo&#x2F;</a>
评论 #41560526 未加载
yburkov8 months ago
using same app rclip: <a href="https:&#x2F;&#x2F;github.com&#x2F;yurijmikhalevich&#x2F;rclip">https:&#x2F;&#x2F;github.com&#x2F;yurijmikhalevich&#x2F;rclip</a>
netdur8 months ago
I have made similar android app for semantic image search, works offline too, still gathering feedback and polishing UI, but it works, if you are brave enough here is it <a href="https:&#x2F;&#x2F;drive.google.com&#x2F;file&#x2F;d&#x2F;1tE0cY6umj5h5zCY_Jvaou1M8sCfzWMOR&#x2F;view?usp=drive_link" rel="nofollow">https:&#x2F;&#x2F;drive.google.com&#x2F;file&#x2F;d&#x2F;1tE0cY6umj5h5zCY_Jvaou1M8sCf...</a>
评论 #41561342 未加载
评论 #41556810 未加载
ivanjermakov8 months ago
In russian, &quot;sisi&quot; is a variation of &quot;tits&quot;.<p>Is there a job&#x2F;services that confirm that branding is appropriate across different languages? Seems like a non trivial problem to solve.
评论 #41556935 未加载
评论 #41556025 未加载
评论 #41558338 未加载
评论 #41556852 未加载
评论 #41556324 未加载
评论 #41556011 未加载
评论 #41556123 未加载
评论 #41556432 未加载
评论 #41556442 未加载
Jack55008 months ago
Isn‘t clip superseeded by multimodal llms?
评论 #41563154 未加载