The readme has a link to a technical paper on arxiv that was uploaded last year[0], has that perhaps been discussed before?<p>(not meant as a complaint that this might have been submitted before already, I'm just curious about what might have already been said about it)<p>[0] <a href="https://arxiv.org/abs/2109.10433" rel="nofollow">https://arxiv.org/abs/2109.10433</a>
I dunno much about Unicode, but I imagine it is a regular language? (Aka: acception / rejection can be determined by regular expressions / finite state machines)<p>If so, regular languages are one of those 'surprising things that can be parallelized'. It doesn't seem possible at first thought though.
I'm at the point where I can no longer see any reason to use UTF-16. UTF-8 is used everywhere today and constantly converting between the two is not only inefficient, but also introduce a risk of bugs/corruption.