Once again, programmers forget alphabets other than the Latin one exist. And by Latin, I mean the one containing only A through Z, with no “fancies”.<p>Then there’s the issue of character encodings, which the article does a decent job explaining! Nitpick though: they claims it’s a UTF-8/ASCII thing, but it’s actually a UTF-8/Windows-1252 issue.<p>In related news, even GNU coreutils fails to support UTF-8 properly despite a claim of support for multibyte character sets: <a href="https://catgirl.ai/log/cut-c-harmful/" rel="nofollow">https://catgirl.ai/log/cut-c-harmful/</a>