AoT, JIT, bytecode, none of this matters if you can't control your data layout and access patterns. All this talk of performance and still not a mention of cache misses anywhere.<p>There seems to be a desire for this magic inliner and compiler that fixes all your performance problems when it just doesn't work like that.<p>Until you understand your data and have complete control over it none of this other stuff matters.
If "template JIT" means what I think it does, this is basically what you get from Cython[1] or, later, Nuitka[2], only they're "template AOT"s.<p>Sadly, it buys you sorely little. CPython's bytecodes are "big", so very little time is spent between them relative to that spent inside them. The motivation to do anything smarter here is lacking.<p>[1] <a href="http://cython.org/" rel="nofollow">http://cython.org/</a><p>[2] <a href="http://nuitka.net/" rel="nofollow">http://nuitka.net/</a>