The code change on GitHub: <a href="https://github.com/openbsd/src/commit/56bc0efbd80da2ff595e44601f8d41473418bf4a" rel="nofollow">https://github.com/openbsd/src/commit/56bc0efbd80da2ff595e44...</a>
Not identical but related: do compilers tune with the OS to get code and data into L1 cache? I believe so, I want to believe you can compile to avoid mem fetch, let alone disc. E.g. if I know my AES needs a 20k table, does the compiler know how to make this apparent to the CPU farm so its never out of L1?<p>I continue to believe a 2cpu 1mb L1 cache is faster than 2 cpu, 2HT, 512k L1 cache. 4x fake CPUs and less L1 doesn't beat true cpu and lots of cache on die. But, I might be wrong!