TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Shape Catcher: Find Unicode characters by drawing them

115 pointsby molenzwiebelabout 8 years ago

16 comments

cedex12about 8 years ago
Similar tool for mathematical symbols: <a href="http:&#x2F;&#x2F;detexify.kirelabs.org&#x2F;classify.html" rel="nofollow">http:&#x2F;&#x2F;detexify.kirelabs.org&#x2F;classify.html</a>
brudgersabout 8 years ago
I could not get it to identify a British Pound symbol after several attempts. The top proposed glyph was much more obscure and the following ones were increasingly obscure from there.<p>I suspect that the training corpus may have been a table of Unicode glyphs rather than text from the wild.
评论 #14286792 未加载
throwanemabout 8 years ago
This is kind of a missing piece, in a lot of ways. With such a large character set as Unicode&#x27;s, discovery can be a real pain - when you see a novel character, how do you find out what it&#x27;s called, so you can find out how to type it?<p>Unless you&#x27;re using something like Emacs which lets you point at a character and ask the editor to tell you everything it knows about what&#x27;s there, this kind of identification becomes a daunting task to contemplate. Shapecatcher does an excellent job of it; as long as you can draw something roughly approximating the glyph you have in mind, it&#x27;ll very effectively winnow down the search space to a very manageable list of possible matches.
评论 #14285717 未加载
ardacinarabout 8 years ago
The search isn&#x27;t really perfect. I tried drawing a (pretty good, IMO) Hiragana &quot;no&quot; and that result was in the third place (First was, a latin small m. の looks nothing like an m). Then tried Greek small sigma (σ) but not perfectly (I draw ny sigmas in a weird way, looks like this: <a href="http:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;XYVHO" rel="nofollow">http:&#x2F;&#x2F;imgur.com&#x2F;a&#x2F;XYVHO</a>), the top result I got (Malayalam fraction one quarter: ൳) kind of looks like the thing I drew, but the rest of the results are not really resembling it and there&#x27;s no sigma there.
评论 #14286160 未加载
Silhouetteabout 8 years ago
Interesting idea. It seems to struggle a bit with some types of characters. For example, drawing a lowercase pi would return many characters with more than two legs, which showed up ahead of pi itself and other characters that do have the two. Does clicking on the good&#x2F;bad feedback links in cases like this help to train the algorithm in some way?
tyingqabout 8 years ago
Really well done, and handled my crappy drawings just fine.<p>I did see the link to your thesis on captcha, but a specific higher level blog post on how this works would likely be popular.<p>Edit: One piece of feedback...it&#x27;s hard to draw dots. You have to drag the cursor with the button down, or drag your finger in mobile to get a dot. So dots end up more like little lines. Also, an &quot;Undo&quot; to remove the last &quot;cursor down &#x2F; draw&quot; event would be nice. Starting over for every line is the only current option.
hsivonenabout 8 years ago
This is cool, though I was a bit disappointed to notice the part about no support for CJK characters after trying to draw one and not having it recognized. It seems to me that looking up Unihan ideographs is an area where a tool like this could be particularly useful.
评论 #14285723 未加载
评论 #14285520 未加载
评论 #14285256 未加载
m-p-3about 8 years ago
It reminds me the special character finder in Google Docs, very well done.
justincliftabout 8 years ago
I use this occasionally when trying to find a new glyph. There are some drawbacks though:<p>• Last updated in 2012: <a href="http:&#x2F;&#x2F;shapecatcher.com&#x2F;news.html" rel="nofollow">http:&#x2F;&#x2F;shapecatcher.com&#x2F;news.html</a><p>• No way to draw straight lines except pixel-by-pixel (really tedious). This turned out to be a pain when trying to draw various arrow types (made of straight lines).<p>I&#x27;m hoping the author, Benjamin Milde, picks the project up again and keeps it updated, or makes it Open Source, then someone else does.
hashharabout 8 years ago
This is really great. Works perfectly and solves a very practical problem for me. Unicode really should do something about discoverability though.
评论 #14285085 未加载
评论 #14285232 未加载
评论 #14285308 未加载
eriknstrabout 8 years ago
Pretty neat. Would be useful to be able to restrict the blocks that are searched. For example I might know that the character I&#x27;m looking for is Japanese, so if I could let it know that I was looking for is Japanese then it could restrict itself to Katakana, Katakana Phonetic Extensions and other blocks if any that apply to Japanese specifically.
trhaynesabout 8 years ago
Love this tool! (Btw title should be one word: &quot;Shapecatcher&quot;)
nfriedlyabout 8 years ago
Android Wear does something like this for emojis - I&#x27;ve gotten pretty good at drawing a &quot;thumbs up&quot; to respond to text messages and such.
tigerBL00Dabout 8 years ago
Pretty cool! I wonder why the recognizer is not very good at differentiating among types of faces (sad face, happy face, etc.)
hughesabout 8 years ago
𝗇ìϲе homoglyph search tool you&#x27;ve made :)<p>(it found all the letters of the word &quot;nice&quot; quite well!)
runnr_azabout 8 years ago
That&#x27;s really fun!
评论 #14289156 未加载