The article really only touches on the absolute basics. It would have left me hurting for more specifics. However, I've read Ulrich Drepper's epic paper on modern memory architectures. (<a href="http://lwn.net/Articles/259710/" rel="nofollow">http://lwn.net/Articles/259710/</a>)<p>You should do yourself a favor and read it.
Is this different from caches? If so, the advice also applies to properly using the cache. I've read that cache misses are the biggest source of slow-downs on modern processors. (Sorry, no citation)
Nice introduction to data prefetching. I don't fully agree with this statement, though:<p><i>Second, programmers have very little control over instruction placement in memory.</i><p>Depending on your platform and type of application you may have full control over instruction placement and it might be extremely important to know how to sequence instructions to get maximum performance.