Building an Internet Scale Meme Search Engine

785 点作者 whoisburbansky超过 2 年前

66 条评论

This title really undersells the absolute insanity of the described solution. This is a beautiful example of "if it's stupid, but it works, it's not stupid." The justification is very convincing.One thing I'm curious about: how did you build your corpus of meme images and videos?

评论 #34336889 未加载

评论 #34335977 未加载

评论 #34336743 未加载

merpkz超过 2 年前

Nice project, I wanted to build meme search engine myself one day, but figured Tesseract will fail at most of the memes because of how stylized those have become. So I tuned down my meme source to only /r/bertstrips as those contain sane looking text and it's working quite alright - project has no frontend yet, I search from cli and click links.> Initial testing with the Postgres Full Text Search indexing functionality proved unusably slow at the scale of anything over a million images, even when allocated the appropriate hardware resources.I can guarantee you that correctly setup PostgreSQL text search will be faster than ES with much, much less hardware resources needed, it's just a matter of correctly creating tsvector column and creating GIN index on it (and ofc asking right queries so it's actually used). I can help you out setting postgres schema up and debugging queries if you are interested, for testing purposes at least.

评论 #34336339 未加载

评论 #34338569 未加载

CiceroCiceronis超过 2 年前

This is really brilliant to see, and I've been surprised for quite a long time that nothing similar exists. I think it's a real shame that few people with interest in memes have interest in building solutions like this that help us engage with them.People in the 21st century know a lot about the mistakes of the past century that led to much popular culture of the time being lost (especially terminally online people who've watched lots of Youtube documentaries about lost Dr. Who episodes and so on), so it surprises me how little we try and avoid those same mistakes with today's ephemeral pop culture in the form of memes. People like yourself who want to help make the internet's huge corpus of memes tractable are part of the solution in terms of meme archival and cultural memory.(There's a good meme metadiscussion group on Discord, "The Philosopher's Meme," which you might be interested in joining. People there would be very keen to discuss what you've made.)

评论 #34347084 未加载

评论 #34337416 未加载

iamflimflam1超过 2 年前

Love the hackiness of this - however, the vision framework is available on Desktop macs as well - <a href="https://developer.apple.com/documentation/vision" rel="nofollow">https://developer.apple.com/documentation/vision</a>and specifically:<a href="https://developer.apple.com/documentation/vision/vnrecognizetextrequest" rel="nofollow">https://developer.apple.com/documentation/vision/vnrecognize...</a>

评论 #34338217 未加载

评论 #34344537 未加载

评论 #34339198 未加载

yakubin超过 2 年前

The photo under Upgrading the iPhone OCR Service Into An OCR Cluster. In the future, data centres are going to host racks of iPhones.

评论 #34343333 未加载

评论 #34336362 未加载

评论 #34336297 未加载

评论 #34336901 未加载

aabajian超过 2 年前

I had a friend in med school who wrote a very early note-taking app for the iPad. Turns out that there was no way to render PowerPoint files when the iPad first came out. He realized that the iOS/Mac OS "quick preview" function could be used to take screenshots of each PowerPoint slide. For a brief time, his was the only app that could display PowerPoints (albeit, they were just screenshots!). There's a lot of hidden utility in Apple libraries.

ksdme9超过 2 年前

Love the inventiveness.My question is about the image distribution costs. All the memes on the site seem to be coming straight off an object storage, all that bandwidth consumption has got to add up(?). Some sort of a CDN might help depending on the search patterns.

memeatlas超过 2 年前

Although not as elegant a solution as this I've also tried my hand as well at indexing and categorizing memes. I wanted to save a very specific type of meme though since there are, in my mind, 2 main categories of memes. The first category are what I call "story" memes, they are standalone and typically what you see being shared on Facebook. They usually have text and are able to tell a story on their own with no additional context and can be presented as a single post, story, etc, (think 4 panel comics). The second type are reaction memes. These are used to respond to people and usually convey a feeling towards a post or tweet. They can also be standalone so they should probably be considered a subset of the "story" memes. I've gravitated towards the reaction memes as I see more utility in them and can be used in a more universal way. My site if anyone is interested (its still a work in progress):<a href="https://www.memeatlas.com" rel="nofollow">https://www.memeatlas.com</a>

评论 #34344000 未加载

评论 #34450115 未加载

lormayna超过 2 年前

If you don't need advanced search features, you can use Sonic (<a href="https://github.com/valeriansaliou/sonic">https://github.com/valeriansaliou/sonic</a>). It's blazing fast and you can save lot of money on servers.

评论 #34353120 未加载

philsnow超过 2 年前

I sat down literally last night and started sketching out the scratch-my-own-itch solution to more or less exactly this problem, because I too have meme-aphasia where I know there exists a meme that fits perfectly in a conversation, but I have about 5 seconds to find it before the moment passes.I'm so, so glad to see that I'm not the only person in the world with the same "problem". Well done, mandatory.edit: holy crap you even index videos, nice

csande17超过 2 年前

I wonder how the performance of Vision.framework on desktop Mac hardware compares to a cluster of phones. (The author mentions that it was "fairly slow", but it sounds like they were running an iOS app in the simulator and not a macOS app.)

评论 #34337053 未加载

评论 #34336344 未加载

Fabricio20超过 2 年前

Does the Vision API call back to apple servers in any fashion? Like how google on-device voice recognition APIs will call back to Google when you are online (unless you explicitly pass flags to force it in offline mode).If so, is there any risk in getting your account suspended or ip range banned somehow because of this, for example?

评论 #34339844 未加载

tomw1808超过 2 年前

Absolutely amazing on the tech side!!!Now, after reading the article, I gave your search engine a try. I was looking for that futurama its a trap meme (pretty much pops up on any image search here <a href="https://www.google.com/search?q=futurama+its+a+trap" rel="nofollow">https://www.google.com/search?q=futurama+its+a+trap</a>)The problem is, the search engine you built is now very text-heavy, which seems to be usually very unconnected to the actual meme. So, searching for "its a trap" did not yield the results I was actually hoping for, but made total sense looking at how the search was implemented.Are you planning to implement an actual tagging of the content of some sorts? Maybe a clustering of similar objects (like iphone clusters similar peoples faces in the gallery) and then tag those clusters with keywords somehow?

评论 #34340239 未加载

mseidl超过 2 年前

Do you use mongodb to make it web scale? You turn it on and it scales right up.

评论 #34339474 未加载

Arbortheus超过 2 年前

This is great, I particularly like the part about using compute from old unwanted iPhones. Quite an inventive way to reuse/recycle otherwise obsolete hardware!

permo-w超过 2 年前

I have absolutely no experience in this area and I'm curious:is there really no open-source text recognition software that's on-par with or close to Apple's (presumably proprietary) implementation? the article mentions Tesseract. is that the current best open-source option?

评论 #34357952 未加载

MrGilbert超过 2 年前

This is remarkable. I'd love to see that combined with some kind of sentiment analysis like Microsoft offers, just to see if something useful comes out of it.Sometimes, I don’t know the exact words when looking for a meme, but once I see it, I know that’s the one.

评论 #34336073 未加载

评论 #34347267 未加载

oefrha超过 2 年前

> My preliminary speed tests were fairly slow on my Macbook. However, once I deployed the app to an actual iPhone the speed of OCR was extremely promising (possibly due to the Vision framework using the GPU). I was then able to perform extremely accurate OCR on thousands of images in no time at all, even on the budget iPhone models like the 2nd gen SE.I suppose that’s an old Intel MacBook? I’d be very surprised if the Vision framework performs better on a 2nd gen iPhone SE than even the first M1 MacBook Air.

评论 #34338048 未加载

komali2超过 2 年前

Would love to see that load balancer implementation, as I'm a scrub and this project fascinates me.

评论 #34335972 未加载

andai超过 2 年前

I have a "hackish but works for me" meme database: I use my Telegram "self chat" to send memes I like to myself, and I tag them with the kind of words I'm likely to search for when looking for them later.Works great for me.It's kind of like trying to come up with a good Google search phrase, based on how other people must have phrased something, but relying on knowledge of how you phrase things instead.

评论 #34340186 未加载

lysecret超过 2 年前

Wait what this is absolutely brilliant. Actually insane it works so well using a stack of iPhones as an ocr server. My deepest respect.

Thorentis超过 2 年前

IaaS - iPhone as a Service, coming soon to AWS.

marginalia_nu超过 2 年前

Now this is the sort of disgusting pile of jank I love to see.

JustARandomGuy超过 2 年前

Very inventive. Admittedly when I read the first few paragraphs, I was thinking “he’s got to have $40K of iPhones doing image processing” but you made a good point about being able to use iPhones with screen and other damage.What was your average price per iPhone, if you don’t mind disclosing?

ekns超过 2 年前

Last time I looked into OCR stuff I came to a similar conclusion (though I didn't implement anything back thne). It would be really nice to have "open source" models that had similar performance, without having to deal with the iphone cluster hackery.

nysv超过 2 年前

If only there was a way to filter out ifunny results, I absolutely detest that watermark.

Freak_NL超过 2 年前

> Better yet, I don’t even want to use them as phones, so even iPhones that are IMEI banned or are locked to unpopular networks are perfectly fine for my use.Fences worldwide will be overjoyed to hear of this novel application.

suave_dude超过 2 年前

I have a question what do you guys think is the best back end for a video search engine app?

dirtyid超过 2 年前

Outrageous effort! So far japanese, mandarin returning results as well.Do you have list of sources where memes are ingested from?Would be nice to have some option to explore memes by category.

looki超过 2 年前

Very cool project! I'll try to remember it the next time I'm looking for a specific image. I noticed that repeated appearances of the search term are ranked higher, which isn't necessarily productive. Also, some kind of duplicate detection would be nice. Searching for "SpongeBob" yields many copies of the same images that mentions "SpongeBob" several times.

baradhiren07超过 2 年前

<a href="https://findthatmeme.com/?search=take+my+money" rel="nofollow">https://findthatmeme.com/?search=take+my+money</a>

petesergeant超过 2 年前

I was hoping this would help me find the Database Iceberg meme that shows different levels of database insanity. It didn’t. Anyone have a link?

评论 #34336616 未加载

2Gkashmiri超过 2 年前

Are you going to open source the "app" part of it ?I would love to replicate this setup for my own project....I am thinking, load balanced, multi location redundant "iOS machines" with 3-4 phones in with power backup and internet dongle.We could use something like zerotier/tailscale to get internet access from outside your local network

ipsum2超过 2 年前

This is amazing. Out of curiosity, why not try deep learning OCR software instead of Tesseract? PaddleOCR is popular.

评论 #34336426 未加载

solarkraft超过 2 年前

That's a fun way to do OCR. Next up: Classifying memes by subjects and themes to build something like KnowYourMeme's gallery, but for every meme.Bonus: Index from a lot of sources to help track a meme's origin.This type of thing is on my long list of "can somebody else please do this already".

schappim超过 2 年前

Pretty insane. If you don’t want to use iPhones, I made macOCR a while back. It uses the same vision APIs, with a very simple CLI interface. See: <a href="https://github.com/schappim/macOCR">https://github.com/schappim/macOCR</a>

petercooper超过 2 年前

You can do it on macOS as well, it has the same API for fast high quality OCR. I used it to create an OCR system to detect secrets or credentials in screencasts: <a href="https://github.com/peterc/videocr">https://github.com/peterc/videocr</a>

kgbcia超过 2 年前

That's genius. I realized the cost advantage of text to speech on an old android versus Google cloud

sneak超过 2 年前

Don't you have to re-sign and re-deploy tour iOS app every 7 days to keep it running on the iPhones?

paulmd超过 2 年前

Is there a pgsync equivalent for Oracle? Spent some time building replication from a source-of-truth to a search engine at a previous job.Wish we could have used postgres but the tools were dictated rather than letting the requirements drive the tooling.

spuz超过 2 年前

I'm curious how well the iPhone OCR actually works. How do you deal with errors? Is the error rate low enough that you can accept the output from the iPhone OCR as is or do you also run it through a cleaning process (e.g. spell check)?

评论 #34336697 未加载

yreg超过 2 年前

This is absolutely brilliant.I believe I will actually use it a lot if you keep the site up.Minor feedback for the blog post: It deserves a better meta description (for link previews). The first paragraph doesn't advertise how good the article is going to be.

causality0超过 2 年前

I tried a few memes. The results were quite poor, and vastly inferior to just using Google. In the case of text searches I had to scroll through dozens of results before finding the original meme images.

nowahe超过 2 年前

Out of curiosity, how does your Image Similarity Search works ? Are you also using some feature of Apple's Vision framework, or running some ML model on your linode instance ?

评论 #34340194 未加载

Liquidor超过 2 年前

Brilliant! :-)Maybe a dumb question, but could you use your data to train a new OCR model so you wouldn't have to rely on iOS?I don't know much about ML/AI so maybe not feasible.

gerdesj超过 2 年前

"It looked like it was time to bite the bullet and write an OCR iOS server in Swift."Quite a large bullet required, one with plenty of chewing left in it.

joshu超过 2 年前

Heh, this finds a bunch of copies of a video I made. If you are going to cache them and repost them, you probably need to have a DMCA process.

kome超过 2 年前

you are a genius. also, the Search Engine works so well

surume超过 2 年前

Thank you for building this! It's so much fun. I looked for memes that I saw years ago and found them in seconds. Excellent work!!

Scaevolus超过 2 年前

Looks like you've solved the OCR problem, now to solve the duplication problem and use it as a ranking hint. :-)

1wsk超过 2 年前

Could you not extract the model and run it on a server? Its probably not as easy but i know it has been done with NeuralHash

nisegami超过 2 年前

Is this the person on /r/hardwareswap who's been looking for semi-functional, even IMEI banned, iPhones?

the_arun超过 2 年前

Nice hack. But doesn't Google/(any public search engine) image search do this for us already?

kevmo314超过 2 年前

Could you expose your iPhone cluster as an OCR API? Seems like it would be competitive with the GCP API.

评论 #34336578 未加载

francis-io超过 2 年前

Would be great if the images had a unique name so I could save them without having to rename them.

mateuszbuda超过 2 年前

Great project! With your DIY attitude, if you need to build your own infrastructure for web scraping, here is a tutorial for mobile proxy setup which might be helpful: <a href="https://scrapingfish.com/blog/byo-mobile-proxy-for-web-scraping" rel="nofollow">https://scrapingfish.com/blog/byo-mobile-proxy-for-web-scrap...</a>

9dev超过 2 年前

This is the single bestest thing I have read in a long while. Absolute madness. Pure bliss.

fuzzygroup超过 2 年前

This is utterly fantastic and you are to be commended for your Crazy Mad Scientist genius!

julianeon超过 2 年前

This makes me wonder what other cool things I can do with an old iPhone.

Tade0超过 2 年前

I wonder how it fares against deep-fried "E" or Opossum memes?

skizm超过 2 年前

Where did the original set of meme images, gifs, and videos come from?

coayer超过 2 年前

I love the iPhone cluster so much!

danbruder超过 2 年前

this is very clever. I wonder what other use cases could leverage this approach

ariehkovler超过 2 年前

This is mad and I love it.

geek_at超过 2 年前

really amazing! I love the solution and the project in general

transitivebs超过 2 年前

My #1 recommendation for anyone thinking about the convoluted OCR solution: use a cheap OCR API and save yourself months of time / hassle / upkeep. Google's OCR API is a good place to start, but AWS has one too and dozens of others out there.

评论 #34335985 未加载

评论 #34336359 未加载

评论 #34336354 未加载

评论 #34336102 未加载