Wilfred, please correct your statement that the benchmarks game requires "all the test programs for the same language to be identical".<p>It isn't true. It wasn't true 4 years ago.<p>For sure, my preference was to show PyPy programs that also <i>worked</i> with CPython -- that made clear that optimizing for PyPy could make performance worse with CPython and <i>vice versa</i>.