The paper has recently been called into question for overestimating their performance relative to BERT: <a href="https://news.ycombinator.com/item?id=36758433">https://news.ycombinator.com/item?id=36758433</a>. Might be good for the blog's author to take this into account in their explainer. The author's perspective sounds a bit too positive (and borderline salesmanlike).
In addition to the evaluation issues, it looks like several of their test sets have significant overlap with the test sets [1]. Especially for a compression-based technique, having exact duplicates is going to help a lot.<p>[1] <a href="https://github.com/bazingagin/npc_gzip/issues/13">https://github.com/bazingagin/npc_gzip/issues/13</a>