I used this for a project re: similarity between two strings.<p>The Jaccard similarity between sets of uni- and bi-grams was a surprising effective metric.<p>DOG -> {d, o, g, do, og}<p>GOD -> {g, o, d, go, od}<p>intersection = {d, g, o}<p>union = {d, g, o, do, go, od, og}<p>J = 3 / 7 = ~43%