DuckDuckGo \u202E

348 pointsby zeepzeepover 3 years ago

27 comments

lucideerover 3 years ago

Everyone here is asking if this is an "intentional easter-egg" or an "accidental bug"But what about accidentally working-as-intended?Sure it's a little trickier to read, but it's certainly not a "bug" that will cause any damage / danger / instability / etc.

评论 #30353833 未加载

评论 #30353440 未加载

评论 #30358346 未加载

评论 #30357879 未加载

评论 #30358811 未加载

评论 #30355216 未加载

评论 #30357015 未加载

评论 #30392765 未加载

echelonover 3 years ago

You still have to be mindful of \u202e in anything new that you're writing, but browsers do a much better job of not having it bleed across elements like they did back in the 2000s.Back in the era of forums that didn't support unicode correctly (2005ish?), it was trollish fun to post messages containing \u202E and watch the UI and all subsequent messages and elements get messed up. (One stray \u202E would flip the entire page contents following it.) I never took it to a level of abuse since it was easy to remove and then ban offenders, but it was fun in a one-off thread, and it always had great reactions.I patched my own software to handle it, but I don't recall anyone really abusing it in a widespread manner. (Contrast this with the era of prolific and widely abused AOL/AIM exploits that would kill your IM client with malformed messages.)IIRC, a bunch of messaging clients also didn't (or still don't) handle \u202e termination and it sometimes bled into new messages and even the text input box. That was pretty horrible and unfixable without restarting.Obligatory XKCD: <a href="https://xkcd.com/1137/" rel="nofollow">https://xkcd.com/1137/</a>Some shenanigans in the wild:<a href="https://www.reddit.com/r/Unicode/comments/hc1rxi/i_put_a_right_to_left_override_character_in_my/" rel="nofollow">https://www.reddit.com/r/Unicode/comments/hc1rxi/i_put_a_rig...</a><a href="https://twitter.com/mkolsek/status/1237123571341803522" rel="nofollow">https://twitter.com/mkolsek/status/1237123571341803522</a>(These are way tamer than the effects used to be.)(Also, HN filters it out. I tried to have some fun. :P)

评论 #30356582 未加载

benbristowover 3 years ago

Reversed: U+202E RIGHT-TO-LEFT OVERRIDE, decimal: 8238, HTML: No visual representation, UTF-8: 0xE2 0x80 0xAE, block: General Punctuation

jfk13over 3 years ago

Similarly, if I try <a href="https://www.google.com/search?q=u202e" rel="nofollow">https://www.google.com/search?q=u202e</a>, the second result I currently get (YMMV) is from <a href="https://unicode-table.com/" rel="nofollow">https://unicode-table.com/</a>, and almost the entire snippet shows up backwards in the search results.

评论 #30355196 未加载

评论 #30354125 未加载

评论 #30372659 未加载

Jerrrryover 3 years ago

Stacking combining diacritics[1] is also fun, to make extremely tall text.Also fun is enumerating all the characters in the Private Character section[2] to see what UI symbols are able to be inserted into unintended places.[1] <a href="https://www.unicode.org/charts/PDF/U0300.pdf" rel="nofollow">https://www.unicode.org/charts/PDF/U0300.pdf</a>[2] <a href="http://www.unicode.org/faq/private_use.html" rel="nofollow">http://www.unicode.org/faq/private_use.html</a> <a href="https://www.unicode.org/charts/PDF/UE000.pdf" rel="nofollow">https://www.unicode.org/charts/PDF/UE000.pdf</a>

评论 #30359076 未加载

评论 #30356653 未加载

hnlmorgover 3 years ago

If there was ever a clear signal that working with Unicode is incredibly hard, it would be the fact that no one on HN can decide if this is accidental or intentional.

评论 #30354049 未加载

评论 #30353353 未加载

评论 #30352859 未加载

评论 #30355923 未加载

评论 #30353711 未加载

评论 #30352802 未加载

评论 #30352868 未加载

评论 #30353423 未加载

stubishover 3 years ago

Our programming languages might need a unicode aware string concatenation operator, similar to locale aware capitalization. Joining LTR text to RTL text seems like it should result in combined LTR + RTL text, not letting the LTR marker override and change meaning.

评论 #30371851 未加载

bncyover 3 years ago

Umm, there's a little info button to the right that says that this 'quick' answer was proposed by DuckDuckHack community author.

gunapologist99over 3 years ago

Are there any lists of unicode characters (like the OWASP one) that should be blacklisted from most apps (not just for XSS, but even for desktop apps)?Are there any good security guides/best practices for unicode sanitation?

评论 #30353168 未加载

评论 #30353691 未加载

评论 #30352920 未加载

评论 #30355504 未加载

评论 #30354182 未加载

评论 #30353550 未加载

bkmeneguelloover 3 years ago

<a href="https://xkcd.com/1137/" rel="nofollow">https://xkcd.com/1137/</a>

评论 #30355813 未加载

splchover 3 years ago

Oh that's cute! Translation for anyone curious / lazy:Punctuation General :block ,0xAE 0x80 0xE2 :8-UTF ,representation visual No :HTML ,8238 :decimal ,OVERRIDE LEFT-TO-RIGHT 202E+ULove the demos :)

Sebb767over 3 years ago

I'm not sure whether this is a bug or a feature^Weaster egg

评论 #30352728 未加载

评论 #30357000 未加载

评论 #30352667 未加载

avnigoover 3 years ago

The funny thing is that search queries preceded by a backslash on DuckDuckGo are supposed to take you to the first search result, but that functionality seems to be buggy anyway:<a href="https://www.reddit.com/r/duckduckgo/comments/sp9e5r/backslash_does_not_actually_go_to_first_result/" rel="nofollow">https://www.reddit.com/r/duckduckgo/comments/sp9e5r/backslas...</a>

thecosmicfrogover 3 years ago

Reminds me of searching for the terms "do a barrel roll", "recursion" or "askew" on Google. I'm sure there's plenty of others.

TadeusTaDover 3 years ago

Instantly reminded me of a relevant xkcd: <a href="https://xkcd.com/1137/" rel="nofollow">https://xkcd.com/1137/</a>

评论 #30352839 未加载

d13over 3 years ago

Can anyone explain what this is all about? I’m looking at the link and threads and have absolutely no idea what’s supposed to be significant here

评论 #30357626 未加载

评论 #30357643 未加载

tobz1000over 3 years ago

Easter egg or bug?

评论 #30352606 未加载

评论 #30352648 未加载

评论 #30352721 未加载

评论 #30352924 未加载

chris_wotover 3 years ago

"This Instant Answer was made by the DuckDuckHack Community.Developer: Cosimo StrepponeDeveloper: mintsoft"

ryukopostingover 3 years ago

And somehow, the "external link" icon is outside the scope of Unicode.

ameliusover 3 years ago

> This is often abused by hackers to disguise file extensions: when using it in the file name my-text.'U+202E'cod.exe, the file name is actually displayed as my-text.exe.docSo every programmer has to know about and support U+202E, but not filesystem programmers?

评论 #30353703 未加载

评论 #30353616 未加载

评论 #30353614 未加载

评论 #30353641 未加载

soheilover 3 years ago

What's next, searching for the word death causes you to die?

评论 #30355041 未加载

f3rnandoover 3 years ago

Also known as "Top Gun"

soheilover 3 years ago

Where does DDG get its search result? Do they scrape Google? If so how do they not bet banned both technically and legally?

评论 #30353582 未加载

评论 #30353259 未加载

gamblerover 3 years ago

Extremely bad design. This kind of complexity should have been moved to some kind of post-processing spec rather than core Unicode. It's already causing issues and will cause more. The more universal something is, the more effort should be applied to keeping it simple.

评论 #30353595 未加载

评论 #30353682 未加载

评论 #30357900 未加载

评论 #30353802 未加载

评论 #30353679 未加载

dheeraover 3 years ago

&#8238;damnit hn

kroltanover 3 years ago

It's intentional, if you inspect the `innerText` you'll see it's reversed there too:<pre><code> zero_click_wrapper.innerText.codePointAt(0) </code></pre> Evaluates to 32. And if you think 32 = 0x20 could mean the next one would be 0x2E, then no, codePointAt(1) is 0x55.

评论 #30353074 未加载

heartbeatsover 3 years ago

Why can't I just disable RTL on my system?I do not speak a word of Arabic. There is no circumstance in which my life will be materially improved by correct RTL text rendering. I might want proper display of individual characters so I can copy-paste them, but I have no use for RTL text.On the other hand, RTL causes a lot of unpleasant problems like this. Why can't I simply coerce all foreign languages into LTR?

27 comments

lucideerover 3 years ago

评论 #30353833 未加载

评论 #30353440 未加载

评论 #30358346 未加载

评论 #30357879 未加载

评论 #30358811 未加载

评论 #30355216 未加载

评论 #30357015 未加载

评论 #30392765 未加载

echelonover 3 years ago

评论 #30356582 未加载

benbristowover 3 years ago

Reversed: U+202E RIGHT-TO-LEFT OVERRIDE, decimal: 8238, HTML: No visual representation, UTF-8: 0xE2 0x80 0xAE, block: General Punctuation

jfk13over 3 years ago

评论 #30355196 未加载

评论 #30354125 未加载

评论 #30372659 未加载

Jerrrryover 3 years ago

评论 #30359076 未加载

评论 #30356653 未加载

hnlmorgover 3 years ago

If there was ever a clear signal that working with Unicode is incredibly hard, it would be the fact that no one on HN can decide if this is accidental or intentional.

评论 #30354049 未加载

评论 #30353353 未加载

评论 #30352859 未加载

评论 #30355923 未加载

评论 #30353711 未加载

评论 #30352802 未加载

评论 #30352868 未加载

评论 #30353423 未加载

stubishover 3 years ago

评论 #30371851 未加载

bncyover 3 years ago

Umm, there's a little info button to the right that says that this 'quick' answer was proposed by DuckDuckHack community author.

gunapologist99over 3 years ago

评论 #30353168 未加载

评论 #30353691 未加载

评论 #30352920 未加载

评论 #30355504 未加载

评论 #30354182 未加载

评论 #30353550 未加载

bkmeneguelloover 3 years ago

<a href="https://xkcd.com/1137/" rel="nofollow">https://xkcd.com/1137/</a>

评论 #30355813 未加载

splchover 3 years ago

Sebb767over 3 years ago

I'm not sure whether this is a bug or a feature^Weaster egg

评论 #30352728 未加载

评论 #30357000 未加载

评论 #30352667 未加载

avnigoover 3 years ago

thecosmicfrogover 3 years ago

Reminds me of searching for the terms "do a barrel roll", "recursion" or "askew" on Google. I'm sure there's plenty of others.

TadeusTaDover 3 years ago

Instantly reminded me of a relevant xkcd: <a href="https://xkcd.com/1137/" rel="nofollow">https://xkcd.com/1137/</a>

评论 #30352839 未加载

d13over 3 years ago

Can anyone explain what this is all about? I’m looking at the link and threads and have absolutely no idea what’s supposed to be significant here

评论 #30357626 未加载

评论 #30357643 未加载

tobz1000over 3 years ago

Easter egg or bug?

评论 #30352606 未加载

评论 #30352648 未加载

评论 #30352721 未加载

评论 #30352924 未加载

chris_wotover 3 years ago

"This Instant Answer was made by the DuckDuckHack Community.Developer: Cosimo StrepponeDeveloper: mintsoft"

ryukopostingover 3 years ago

And somehow, the "external link" icon is outside the scope of Unicode.

ameliusover 3 years ago

评论 #30353703 未加载

评论 #30353616 未加载

评论 #30353614 未加载

评论 #30353641 未加载

soheilover 3 years ago

What's next, searching for the word death causes you to die?

评论 #30355041 未加载

f3rnandoover 3 years ago

Also known as "Top Gun"

soheilover 3 years ago

Where does DDG get its search result? Do they scrape Google? If so how do they not bet banned both technically and legally?