科技回声

1 comment

yarg大约 1 年前

(What the article calls) Canonisation is an interesting issue - it seems very hard to get right and to do so efficiently.I'm trying to wrap my head around a the issue in a regex parser that I've knocked up.I'm currently ending up (expectedly) with multiple nodes representing equivalent languages; I want to strip these out when I use code generation to convert the constructed network into a switch based static automaton.The most robust way to see whether two languages are the same is to xor them and test for interminability - but this means comparing each pair of nodes throughout the network, and I'd rather avoid the n^2 scaling if there's another option.That option is to generate a canonical expression for the language that the machine represents, somewhat difficult but far more efficient when it comes to detecting collisions.<a href="https://en.wikipedia.org/wiki/Canonization" rel="nofollow">https://en.wikipedia.org/wiki/Canonization</a><a href="https://en.wikipedia.org/wiki/Canonicalization" rel="nofollow">https://en.wikipedia.org/wiki/Canonicalization</a>

评论 #40404550 未加载

Hashing Modulo Theories

1 comment

Hashing Modulo Theories

1 comment