TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Unicode Text Converter

908 pointsby syskover 10 years ago

76 comments

Systemic33over 10 years ago
Well that definitely takes the 𝕡𝕣𝕚𝕫𝕖 for most noticeable Hacker News submission.<p>Suggestion (if you are author): There are a lot of chars that look like another char, often used on the web, so i think that there are more advanced versions to be made. I think i read that a lot of thai signs and cyrillic look like latin chars.
评论 #8629217 未加载
评论 #8630686 未加载
评论 #8629549 未加载
评论 #8631820 未加载
评论 #8629220 未加载
GregBuchholzover 10 years ago
<p><pre><code> ⎧1 if n = 0; F(n) ≡ ⎨1 if n = 1; ⎩F(n-1) + F(n-2) if n &gt; 1. ⎛ ∇∙D⃑ = ρ ⎞ ⎜ ∇∙B⃑ = 0 ⎟ ⎜ ∇×E⃑ = -∂B⃑&#x2F;∂t ⎟ ⎝ ∇×H⃑ = J⃑ + ∂D⃑&#x2F;∂t ⎠ ⌠¹ π = 2⎮ √1̅̅-̅̅x̅̅²̅̅ dx ⌡₋₁ ⎡1 0 1⎤ ⎡î⎤ ⎢0 1 0⎥ ⎢ĵ⎥ ⎣1 0 1⎦ ⎣k̂⎦ Γ ⊢ t:S S&lt;:T ――――――――――――――― (T-Sub) Γ ⊢ t:T ⎛ 1 ⎞ⁿ ℯ = lim ⎜1+ ― ⎟ ⁿ→∞ ⎝ n ⎠</code></pre>
评论 #8631650 未加载
评论 #8631106 未加载
评论 #8631013 未加载
emillonover 10 years ago
Funny how it triggered a bug in Firefox. When the tab is unfocused, its title in the handle is &quot;𝑼𝒏…&quot;, but when it gets the focus it becomes &quot;𝑼&lt;D835&gt;…&quot; (in a square box). The next codepoint is U+1D48F whose UTF-16 BE encoding is d8 35 dc 8f.<p>I&#x27;d say that the truncation algorithm operates on bytes and that it can&#x27;t make sense of d8 35, but I&#x27;m not too sure how to fix that since graphemes can have arbitrary length (right?). Do you have to compute the width in advance?
评论 #8629304 未加载
评论 #8629309 未加载
评论 #8630853 未加载
评论 #8630067 未加载
评论 #8630187 未加载
gus_massaover 10 years ago
This is similar to the pseudolocalization (þšéûðöļöçåļîžåţîöñ), that adds random accents to English word to test the localization capabilities of a program without requiring another language knowledge.<p>An online version: <a href="http://www.pseudolocalize.com/" rel="nofollow">http:&#x2F;&#x2F;www.pseudolocalize.com&#x2F;</a><p>A library: <a href="http://code.google.com/p/pseudolocalization-tool/" rel="nofollow">http:&#x2F;&#x2F;code.google.com&#x2F;p&#x2F;pseudolocalization-tool&#x2F;</a>
gojomoover 10 years ago
Hey! I was just thinking about this site, and visited it for the first time in years, after mentioning the old <i>San Francisco</i> ransom-font in another thread.<p>By randomly mixing these Unicode letter and letterlike characters, you can simulate a cut-and-paste ransom-note. For example, an acquired company could announce changes to its privacy policy:<p><pre><code> wE ℎåve yøuR ρrIvᴀçy ⅈn a ᴡiNdøwleSs ℞oøm, &amp; ℙℓaℕ τø ⅆo µnSρεaKᴀble †hiℕℊs t○ ⅈt</code></pre>
评论 #8629311 未加载
hbbioover 10 years ago
Oh, no !<p>The cat should have stayed in a box, if this gains too much popularity, HN will read like MySpace back in the days.<p>And top HN news will be: &quot;A browser plugin that translates Unicode back to ASCII&quot;.
评论 #8631520 未加载
评论 #8629684 未加载
评论 #8629683 未加载
评论 #8630156 未加载
评论 #8630366 未加载
robjhover 10 years ago
For others without that specific font or what have you: &quot;Unicode Text Converter&quot;<p>On my windows box with chrome all i see are empty boxes.
评论 #8629247 未加载
评论 #8631139 未加载
评论 #8629470 未加载
评论 #8630672 未加载
评论 #8629298 未加载
评论 #8630675 未加载
评论 #8630919 未加载
评论 #8629699 未加载
MrBuddyCasinoover 10 years ago
This surprises me, what exactly is the point of encoding what are essentially different fonts in unicode? Isn&#x27;t that the job of the presentation layer?<p>(the Fraktur variant is awesome btw, and is apparently in the valid unicode range for Java...)
评论 #8629327 未加载
评论 #8629721 未加载
评论 #8629457 未加载
mxfhover 10 years ago
Since it wasn&#x27;t mentioned here earlier, it&#x27;s worth to take a look at shapecatcher to see what glyphs might resemble latin letters.<p>Scribbling something resembling the latin capital letter A returns for example any of these codepoints: A𝘈ΑАÅ𝖠∆ДΔ𝐴𝟺дᎪߡ𝛢Å4𝛥ᴬᐃⵠ𐌀𝘼𝛬Λ△𝟦Ą𝜟𝓐⌓⧍ᗋ🜂Ⲇ🗻🍙ⲇѦᗩᗅ<p><a href="http://shapecatcher.com/" rel="nofollow">http:&#x2F;&#x2F;shapecatcher.com&#x2F;</a> (<a href="https://news.ycombinator.com/item?id=5150107" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=5150107</a>)<p>Also the Unicode Consortium has some reports on security:<p><a href="http://www.unicode.org/reports/tr36/" rel="nofollow">http:&#x2F;&#x2F;www.unicode.org&#x2F;reports&#x2F;tr36&#x2F;</a><p><a href="http://www.unicode.org/reports/tr39/" rel="nofollow">http:&#x2F;&#x2F;www.unicode.org&#x2F;reports&#x2F;tr39&#x2F;</a><p>listing all kind of spoofing methods you haven even thought of.
horse_continuumover 10 years ago
One of my friends, moving to China for a semester to teach, was thinking of using a proper Chinese name to make it easier for students to address him. He had a good idea, even, which he shared on Facebook.<p>I proposed that we should name him after the lack of unicode support in our browsers, and we ended up calling him &quot;Box Boxbox&quot; for a couple of months.
TorKlingbergover 10 years ago
Does anyone know why there are separate Unicode code points for letters in bold, bold italic and Fraktur? Normally this sort of thing should be handled by different fonts &#x2F; font variants. Is it for compatibility with some legacy encoding?
评论 #8629303 未加载
jfmercerover 10 years ago
I couldn&#x27;t help but notice that this converter was copyrighted by Eli the Bearded. Google &quot;Eli the Bearded&quot;, but not from work. You&#x27;ll get some very interesting results.<p><a href="https://encrypted.google.com/#q=Eli%20the%20Bearded" rel="nofollow">https:&#x2F;&#x2F;encrypted.google.com&#x2F;#q=Eli%20the%20Bearded</a>
qeorgeover 10 years ago
I was once bilked into buying some scraped content as original work by this method. It passed copyscape, and my test of Googling a a random sentence in quotes didn&#x27;t bring anything up. I let it go because I had already accepted the work, and the lesson was worth more than the article anyway.<p>Don&#x27;t be fool as I was! Had I manually transcribed a sentence into Google instead of copying + pasting the Unicode chars, I would have found hundreds of copies of the same article.
sthlmover 10 years ago
In Javascript, many unicode characters are allowed [0], so háćḱéŕŃéẃś is a valid variable name [1].<p>Note: The number of іllэБіъlэVаѓіаъlэИамэѕ [2] used in your production code is inversely proportional to the number of friends you&#x27;ll make in the maintenance team.<p>[0] <a href="https://mathiasbynens.be/notes/javascript-identifiers" rel="nofollow">https:&#x2F;&#x2F;mathiasbynens.be&#x2F;notes&#x2F;javascript-identifiers</a><p>[1] <a href="https://mothereff.in/js-variables#h%C3%A1%C4%87%E1%B8%B1%C3%A9%C5%95%C5%83%C3%A9%E1%BA%83%C5%9B" rel="nofollow">https:&#x2F;&#x2F;mothereff.in&#x2F;js-variables#h%C3%A1%C4%87%E1%B8%B1%C3%...</a><p>[2] <a href="http://www.panix.com/~eli/unicode/convert.cgi?text=illegibleVariableNames" rel="nofollow">http:&#x2F;&#x2F;www.panix.com&#x2F;~eli&#x2F;unicode&#x2F;convert.cgi?text=illegible...</a>
评论 #8629387 未加载
edgarallenbroover 10 years ago
This is great, but why is the Australian translation called &#x27;upside down pseudoalphabet&#x27;?
cgranierover 10 years ago
What I need is something that takes all the extended characters (think Spanish or Swedish) and turns them into alternative safe versions.<p>For instance, á into a, ñ into n, å into a, etc.<p>Had my hopes up when I saw the title.<p>Does anyone have any ideas or links to working scripts that I can turn into something useful? I need to &quot;sanitize&quot; a database of foreign documentaries before uploading to YouTube (their metadata input system chokes on extended chars). Thanks!
评论 #8630381 未加载
评论 #8630412 未加载
评论 #8631383 未加载
评论 #8631349 未加载
pudover 10 years ago
I made an iPhone app that does kind of the same thing, but converts letters to their upside-down unicode equivalent. It&#x27;s fun for sending upside-down texts.<p>Free and ad-free, just a fun project:<p><a href="https://itunes.apple.com/us/app/texting-upside-down-free/id435354073?mt=8" rel="nofollow">https:&#x2F;&#x2F;itunes.apple.com&#x2F;us&#x2F;app&#x2F;texting-upside-down-free&#x2F;id4...</a>
评论 #8629902 未加载
评论 #8630767 未加载
评论 #8629819 未加载
kcorbittover 10 years ago
Just a PSA for discoverability: since the replacement characters use different code points than their more standard equivalents, the default HN search (<a href="https://hn.algolia.com" rel="nofollow">https:&#x2F;&#x2F;hn.algolia.com</a>) at least doesn&#x27;t find this submission when searching for &quot;unicode.&quot;
lazyjonesover 10 years ago
Great, now we&#x27;ll have to rely on IDEs with clickable drop-down lists of variables and function names because simple text input just got a lot harder for languages where Unicode is allowed for symbols!<p><a href="http://play.golang.org/p/2zYfCx_J-O" rel="nofollow">http:&#x2F;&#x2F;play.golang.org&#x2F;p&#x2F;2zYfCx_J-O</a>
评论 #8629790 未加载
评论 #8631083 未加载
Immortalinover 10 years ago
On iOS 8.1 safari all I see is a bunch of squares ;(
petecooperover 10 years ago
My iOS&#x2F;Safari shows squares in the page itself, but a row of boxed aliens in the `Bookmarks and History` list:<p><a href="http://imgur.com/l98p9oN" rel="nofollow">http:&#x2F;&#x2F;imgur.com&#x2F;l98p9oN</a><p>(image is safe for work, though other stuff on imgur.com is likely not)
tezzaover 10 years ago
🆃🅷🅴🆁🅴 🅶🅾🅴🆂 🆁🅴🅰🅳🅰🅱🅸🅻🅸🆃🆈, 🆂🅴🅰🆁🅲🅷🅰🅱🅸🅻🅸🆃🆈
评论 #8630791 未加载
grimgrinover 10 years ago
My friend made a similar tool that you may enjoy:<p><a href="http://antglove.com/erger" rel="nofollow">http:&#x2F;&#x2F;antglove.com&#x2F;erger</a>
评论 #8630219 未加载
rossyover 10 years ago
I wish this worked on Windows&#x2F;Chrome, or I knew why it didn&#x27;t work so I could star the issue on their bug tracker.
gojomoover 10 years ago
Interesting; the title displayed OK minutes ago, on the main page, in Firefox&#x2F;OSX. But now it&#x27;s showing as unsupported-glyph boxes inside the page... but still looks OK in the titlebar of the item (comments) page.<p>Did some automated or administrative process mutate the characters? Or is this just Firefox drifting, in choice of font?
hesselinkover 10 years ago
Strangely, for me on Firefox 33.1 on OS X, the title shows up fine on the main page. But when I click through to the comment, I get boxes only, and from then on, the main page also doesn&#x27;t work anymore until I restart Firefox. I suspect an extension, but I&#x27;m not sure.
评论 #8629301 未加载
spindritfover 10 years ago
Also, strike-through. Which is the one I find genuinely useful because I like the suggestive way to say s̶o̶m̶e̶t̶h̶i̶n̶g̶ then visibly correcting to something else.<p><a href="http://adamvarga.com/strike/" rel="nofollow">http:&#x2F;&#x2F;adamvarga.com&#x2F;strike&#x2F;</a>
评论 #8629659 未加载
guardian5xover 10 years ago
I only saw boxes in the title with Chrome 38. Tried out IE10 and it works just fine.
评论 #8629297 未加载
评论 #8630214 未加载
geekamover 10 years ago
This fails to show up on my iPhone 5S Safari and I thought it supported Unicode.
ck2over 10 years ago
Note that XP cannot show<p><pre><code> Negative Circled Squared Negative Squared Double-struck Bold Bold italic Bold script Fraktur </code></pre> At least not with the fonts I have.
评论 #8630285 未加载
huuuover 10 years ago
𝕯𝖔𝖊𝖘 𝖆𝖓𝖞𝖔𝖓𝖊 𝖐𝖓𝖔𝖜 𝖜𝖍𝖞 𝖙𝖍𝖊 𝖑𝖎𝖓𝖊 𝖍𝖊𝖎𝖌𝖍𝖙 𝖔𝖋 𝖙𝖍𝖊𝖘𝖊 𝖈𝖍𝖆𝖗𝖆𝖈𝖙𝖊𝖗𝖘 𝖎𝖘 𝖘𝖔 𝖍𝖎𝖌𝖍?
评论 #8629299 未加载
sanxiynover 10 years ago
<a href="https://twitter.com/benbjohnson/status/533848879423578112" rel="nofollow">https:&#x2F;&#x2F;twitter.com&#x2F;benbjohnson&#x2F;status&#x2F;533848879423578112</a>
sovokover 10 years ago
Very cool. Although the upside-down text doesn&#x27;t work with ümlauts and numbers. A reverse function would also be nice.<p>I wrote a similar tool that does this (<a href="http://lunicode.com" rel="nofollow">http:&#x2F;&#x2F;lunicode.com</a>). It&#x27;s on Github if you want to use the code: <a href="https://github.com/combatwombat/Lunicode.js" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;combatwombat&#x2F;Lunicode.js</a>
cturnerover 10 years ago
Different problem, but someone who knows about unicode will probably know this -<p>When I paste from microsoft documents into putty, characters will often be transformed to weird versions. Example - emdash is a different character to &#x27;-&#x27;. It comes through as a weird tilda character instead of a dash. Mmm. Frustating.<p>Is there a robust program you can run on putty to catch such type and flatten it to ascii?
评论 #8629875 未加载
评论 #8630159 未加载
netheril96over 10 years ago
𝕿𝖍𝖎𝖘 𝖋𝖊𝖊𝖑𝖘 𝖑𝖎𝖐𝖊 𝖙𝖊𝖗𝖗𝖎𝖇𝖑𝖊 𝖍𝖆𝖈𝖐 𝖇𝖚𝖙 𝕴 𝖑𝖎𝖐𝖊 𝖎𝖙. 𝕹𝖔𝖜 𝕴 𝖈𝖆𝖓 𝖚𝖘𝖊 𝖆𝖑𝖑 𝖐𝖎𝖓𝖉𝖘 𝖔𝖋 𝖋𝖆𝖓𝖈𝖞 𝖋𝖔𝖗𝖒𝖆𝖙𝖙𝖎𝖓𝖌 𝖔𝖓 𝖙𝖍𝖔𝖘𝖊 𝖘𝖎𝖙𝖊𝖘 𝖙𝖍𝖆𝖙 𝖉𝖔𝖊𝖘𝖓&#x27;𝖙 𝖘𝖚𝖕𝖕𝖔𝖗𝖙 𝖋𝖔𝖗𝖒𝖆𝖙𝖙𝖎𝖓𝖌.
评论 #8629687 未加载
评论 #8630906 未加载
anjbeover 10 years ago
I’ve never been a fan of this sort of thing. The Unicode characters in these font blocks are not letters for making words; at least the double‐struck, fraktur, bold, italic, and bold italics are semantically for use in mathematical equations.<p>This can have some strange effects if you try to use them like letters. Example: What’s the lowercase transform of 𝑼? 𝑼! Not 𝒖.
petercooperover 10 years ago
If you like this sort of thing, you might like this piece I wrote some time back about writing a Ruby script using whitespace for all identifiers: <a href="http://www.rubyinside.com/the-split-is-not-enough-whitespace-shenigans-for-rubyists-5980.html" rel="nofollow">http:&#x2F;&#x2F;www.rubyinside.com&#x2F;the-split-is-not-enough-whitespace...</a>
评论 #8630916 未加载
edentover 10 years ago
This is the w̶o̶r̶s̶t̶ b̲e̲s̲t̲ use of Unicode!
hliyanover 10 years ago
Impressive! Hopefully, this won&#x27;t end with HN sanitizing everything except latin + latin extended from submissions.
评论 #8629308 未加载
NoMoreNicksLeftover 10 years ago
I don&#x27;t really speak&#x2F;read Russian, but I have a passable understanding of Cyrillic, and those always look dumb. It doesn&#x27;t look like &quot;the&quot; to be, it looks lik &quot;guh-buh-yeh&quot; or something.<p>Same thing with the Borat DVD cover.
评论 #8634320 未加载
calineczkaover 10 years ago
Finally a way to express myself on facebook properly ;) I wonder if bold text would lead to better conversion from ads using this trick. And I wonder when is facebook going to ban this because obviously it works :)
dsjoergover 10 years ago
ᴅᴏᴇꜱ ᴀɴyᴏɴᴇ ᴋɴᴏᴡ ɪꜰ ᴩᴏᴩᴜʟᴀʀ ꜱᴇᴀʀᴄʜ ᴇɴɢɪɴᴇꜱ ᴅᴇ-ᴜɴɪᴄᴏᴅᴇ ᴛᴇxᴛ ᴡʜᴇɴ ɪɴᴅᴇxɪɴɢ?
评论 #8630188 未加载
评论 #8630459 未加载
评论 #8630139 未加载
grayclhnover 10 years ago
I look forward to a Hacker News front page that looks like a ransom note.
评论 #8631233 未加载
arikrakover 10 years ago
See <a href="https://news.ycombinator.com/item?id=7383672" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=7383672</a> though they changed my title to normal text.
codemonkeymikeover 10 years ago
Continued use of this would be a good way of making me not use HN.
DanBCover 10 years ago
Chrome on iOS is giving me the character unavailable boxes. Normally I&#x27;d just change the font but I can&#x27;t do that here.<p>This doesn&#x27;t feel like the future.
rplntover 10 years ago
Does not really work for characters like <i>úôä</i>, not sure if there isn&#x27;t anything similar in those &quot;styles&quot; or it was just ignored.
parasjover 10 years ago
𝓖𝓻𝓮𝓪𝓽 𝓯𝓸𝓻 𝓹𝓪𝓼𝓼𝔀𝓸𝓻𝓭𝓼
seba_dos1over 10 years ago
𝕴 𝖋𝖊𝖊𝖑 𝖆 𝖓𝖊𝖜 𝖛𝖎𝖗𝖆𝖑 𝖙𝖍𝖎𝖓𝖌 𝖔𝖓 𝖘𝖔𝖈𝖎𝖆𝖑 𝖒𝖊𝖉𝖎𝖆 𝖈𝖔𝖒𝖎𝖓𝖌.
评论 #8632243 未加载
seqizzover 10 years ago
I feel nice.. <a href="http://i.imgur.com/lbvRWwm.png" rel="nofollow">http:&#x2F;&#x2F;i.imgur.com&#x2F;lbvRWwm.png</a>
darkstalkerover 10 years ago
I&#x27;ve used this page for a long time. Writing stuff in fullwidth unicode for sure makes it look more funny
getdavidhigginsover 10 years ago
<a href="https://www.unicod.es/" rel="nofollow">https:&#x2F;&#x2F;www.unicod.es&#x2F;</a>
jromettyover 10 years ago
It should be mentioned that this returns a blank title on the android app.
cm2012over 10 years ago
On my android all the unicode characters (including the title) are blank.
tempodoxover 10 years ago
It works :)<p>𝑼𝒏𝒊𝒄𝒐𝒅𝒆 𝑻𝒆𝒙𝒕 𝑪𝒐𝒏𝒗𝒆𝒓𝒕𝒆𝒓<p>comes in a fancy bold italic font in my HN list. I love this hack.
评论 #8629251 未加载
Flottover 10 years ago
This is not good news if it bypasses the spam filters! Does it?
sjwrightover 10 years ago
The question I have is, what&#x27;s the easiest way to strip this 🅹🆄🅽🅺 out of unicode strings submitted by web users? With a nod to Cunningham&#x27;s Law, surely the right answer is a regular expression?
评论 #8629904 未加载
评论 #8629917 未加载
gpvosover 10 years ago
I do feel that Unicode is slowly jumping the shark.
aruggirelloover 10 years ago
!ꙅᴙɘTliꟻ mAqꙅ ꟻo Tɘꙅ wɘᴎ ɘloHw A ꙅbɘɘᴎ ꙅiHT ,oᴎ HO
edemover 10 years ago
Can you do z̝̗a͈̣̳͓l͏g̱̭͖̜̙o̢̦̫̯ as well?
JulianMorrisonover 10 years ago
𝕸𝖊𝖎𝖓 𝕷𝖚𝖋𝖙𝖐𝖎𝖘𝖘𝖊𝖓𝖋𝖆𝖍𝖗𝖟𝖊𝖚𝖌 𝖎𝖘𝖙 𝖛𝖔𝖑𝖑𝖊𝖗 𝕬𝖆𝖑𝖊.
getdavidhigginsover 10 years ago
thiѕ iѕ gréät, ƅüt ìt&#x27;s cl߀sèԁ sòùrcè!!!<p>üníto߀ɭs ìѕ ϻùcհ ƃettër!!<p><a href="https://www.unicod.es/" rel="nofollow">https:&#x2F;&#x2F;www.unicod.es&#x2F;</a>
nooberminover 10 years ago
Quite a way to make the point.
shaurzover 10 years ago
What is the point of having different codepoints for FONTS in Unicode? What a load of nonsense.
评论 #8629376 未加载
评论 #8630466 未加载
ryanjmoover 10 years ago
เ ђคשє Շ๏ Շгץ Շђเร ๏ยՇ.
vjvjover 10 years ago
🆃🅷🅴 🆂🅴🅲🆁🅴🆃 🅸🆂 🅾🆄🆃.
sakriover 10 years ago
fun for passwords
tmmmover 10 years ago
How does it work?
tibbonover 10 years ago
It appears to work on Facebook and Twitter.<p>inception
fiatjafover 10 years ago
.ǝɔıu ɹǝdns sɐʍ sıɥʇ
yAnonymousover 10 years ago
𝓘 𝔀𝓸𝓷𝓭𝓮𝓻 𝓱𝓸𝔀 𝔀𝓮𝓵𝓵 𝓝𝓢𝓐 𝓓𝓟𝓘 𝓼𝔂𝓼𝓽𝓮𝓶𝓼 𝓱𝓪𝓷𝓭𝓵𝓮 𝓽𝓱𝓲𝓼.
jackmaneyover 10 years ago
I&#x27;d like to buy a vowel, please. Let&#x27;s go with &quot;e&quot;.
kalopsover 10 years ago
teh cancer that is HN. predicting next post someone shows off rageflipping text
PSeitzover 10 years ago
𝓚𝓐𝓦𝓐𝓘
Kiroover 10 years ago
Twitch chat will love this.
Houshalterover 10 years ago
☐☐☐☐☐☐ ☐☐☐ ☐☐☐☐☐☐ ☐☐☐☐ ☐☐☐☐☐☐☐☐