科技回声

8 条评论

pcwalton将近 10 年前

I talked with Chris Lattner a few years ago about register allocation and he had an interesting perspective. In his view (from what I remember; my recollection could be somewhat incorrect) most of the classical academic research on it is solving the wrong problem. Most formulations of register allocation are graph coloring algorithms designed to answer the question "is this function colorable using K registers without spilling?" The algorithms for handling the case when you do spill usually don't have as much thought put into them. But this is emphasizing the wrong aspect of the problem in practice; for almost any interesting function on the commonly used architectures (x86/ARM), the answer to the theoretical question is trivially "no" (because there are relatively few registers), and the most important practical problem is how to spill effectively (including deciding whether to spill versus split versus rematerialize, etc.) As I understand it, that's the idea behind LLVM's (relatively) new greedy allocator [1]: the graph-coloring part of the problem is simple, and the focus is on spilling and splitting, problems that the classical academic literature has tended to put less emphasis on.[1]: <a href="http://blog.llvm.org/2011/09/greedy-register-allocation-in-llvm-30.html" rel="nofollow">http://blog.llvm.org/2011/09/greedy-register-allocation-in-l...</a>

评论 #9754055 未加载

评论 #9755843 未加载

评论 #9755071 未加载

DannyBee将近 10 年前

"If the code has more than that many variables, OCaml compiler has to park the extra variables in memory and this parking is called spilling."This is just wrong. It would be accurate to say "If the code has more than that number of variables live at the same time".If the computations are not live at the same time, they can share a register.

gct将近 10 年前

The notion that there's only 13/16 registers (assuming x86) hasn't been true for a long time. There's hundreds in the latest cores from Intel. It's true there's only 13/16 names for them, but with register renaming there's way more places to actually put data than that.

评论 #9754549 未加载

tempodox将近 10 年前

Those are interesting observations. When doing something like `let rec loop ...`, I got into the habit of using parameters for only those entities that might change from one iteration to the next. The rest (as being constant / invariant across iterations) dwells in the closure env, if only for the sake of making the source code shorter. The article shows nicely how this obvious mental shortcut has its (equally obvious) costs.It's also a nice practical demonstration of how reading disassemblies is still an integral part of understanding a language, even this relatively abstract ML descendant.

cosmicexplorer将近 10 年前

Why wouldn't the OCaml compiler optimize away the variable usage in the first example? It seems an easy optimization to make if they're only used once.

评论 #9753964 未加载

lispm将近 10 年前

There is <> missing in line 3 of the first code snippet.

a-dub将近 10 年前

Does OCaml generate code that makes use of SIMD?

froh42将近 10 年前

Use a better algorithm instead of worrying about processor registers.And if you start considering the computer architecture think about caches, memory latency and bursting first. Use a cache-conscious data structure.Optimizing at the register level is ridiculous, this is the least place where you get significant returns for the effort you spend.Oh, and make it correct first, make it fast second. Use a profiler.

评论 #9754024 未加载

8 条评论

pcwalton将近 10 年前

评论 #9754055 未加载

评论 #9755843 未加载

评论 #9755071 未加载

DannyBee将近 10 年前

gct将近 10 年前

评论 #9754549 未加载

tempodox将近 10 年前

cosmicexplorer将近 10 年前

Why wouldn't the OCaml compiler optimize away the variable usage in the first example? It seems an easy optimization to make if they're only used once.

评论 #9753964 未加载

lispm将近 10 年前

There is <> missing in line 3 of the first code snippet.

a-dub将近 10 年前

Does OCaml generate code that makes use of SIMD?

froh42将近 10 年前

评论 #9754024 未加载

CPU registers and OCaml

8 条评论

CPU registers and OCaml

8 条评论