> While the randomized tests are great, it's still entirely possible for them to miss bugs. The state space for the arguments of a binary transfer function is 3*64 * 3*64, and if only a small part of that contains wrong behaviour it would be really unlikely for us to find it with random tests by chance.<p>Wouldn't exhaustive testing on 3*8 * 3*8 inputs (that is, on 1-byte-wide integers) be good enough? That's about 43 million combinations which is feasible to loop over even in Python.