TechEcho

7 comments

lelfover 11 years ago

It's broken.Λ̊1 → ⊻∪ά → Λ̊⋌𝄞 → 뤔뷾 → 駴點Edit: anyway, even with correct (a+b)%n it's plain bad idea.Unicode is not English alphabet. Everything not in basic multilingual plane is broken automatically. And even in BMP there's going to be bag of glitches starting from hanging combining characters and ending to ‘oops someone normalised our string and it's now different’ (for site, not for user / Unicode).

评论 #6660062 未加载

评论 #6687474 未加载

mischanixover 11 years ago

Not reciprocal for CJK input, e.g. "한글" takes 5 iterations to reach stability. I believe this has to do with the utf-16 encoding of codepoints > 0x10000

评论 #6659931 未加载

aculverover 11 years ago

Inputting "こんにちは。元気ですか？" caused an application error:<pre><code> [ArgumentException: Error serializing value 'ᄳᅳᅋᅁᅏტ㈣䳷ᅇᄹᄫ�' of type 'System.String.'] </code></pre> After realizing it was "？" that was breaking everything, I ended up with this round trip:"こんにちは。元気ですか。" → "ᄳᅳᅋᅁᅏტ㈣䳷ᅇᄹᄫტ" → "こんにちは。ጃ⷗ですか。"It's broken. I suspect Unicode requires more careful manipulation than OP anticipated. :-)

peterwallerover 11 years ago

Copy-pasting the contents of rot8000.com/info in and hitting cypher twice ends up scrambling the contents quite a bit..<pre><code> It also bypasses 32 control characters, technically making it rot7968, sometimes with an additional offset. </code></pre> -><pre><code> It also bypasses ⋍2 control characters, technically making it rot⋏⋬68, sometimes with an additional offset.</code></pre>

评论 #6660243 未加载

rottytoothover 11 years ago

I put in a fix for CJK and the result is: nearly everything that's not CJK now rotates into it and back out; CJK is an huge section of the Basic Multilingual Plane. The fix invalidates rotations done with rot8000 before the fix, unfortunately.

njharmanover 11 years ago

I just realized that 13 was probably chosen for rot13 cause that's half the number of letters in English alphabet.I miss "obvious" stuff like that all the time.

jloughryover 11 years ago

Why not call it Rot8192 or Rot0x7777 ?

评论 #6659994 未加载

7 comments

lelfover 11 years ago

评论 #6660062 未加载

评论 #6687474 未加载

mischanixover 11 years ago

Not reciprocal for CJK input, e.g. "한글" takes 5 iterations to reach stability. I believe this has to do with the utf-16 encoding of codepoints > 0x10000

评论 #6659931 未加载

aculverover 11 years ago

peterwallerover 11 years ago

评论 #6660243 未加载

rottytoothover 11 years ago

njharmanover 11 years ago

I just realized that 13 was probably chosen for rot13 cause that's half the number of letters in English alphabet.I miss "obvious" stuff like that all the time.

jloughryover 11 years ago

Why not call it Rot8192 or Rot0x7777 ?

评论 #6659994 未加载

Rot8000 – Rot13 for the Unicode generation

7 comments

Rot8000 – Rot13 for the Unicode generation

7 comments