Why CPUs aren't getting any faster (2010)

78 pointsby michael_nielsenabout 11 years ago

16 comments

delrothabout 11 years ago

CPUs are getting faster. Sandy Bridge is a 15-20% IPC improvement on Nehalem for some heavily integer and memory access based workloads. On the same workloads, Haswell is another 15-20% IPC improvement on Sandy Bridge.I work on the Dolphin Emulator (<a href="https://dolphin-emu.org/" rel="nofollow">https://dolphin-emu.org/</a>) which is a very CPU intensive program (emulates a 730MHz PowerPC core, plus a GPU, plus a DSP, all of that in realtime). We try and track CPU improvements to provide our users with proper recommandations on what hardware to go for. Here are the results of a CPU benchmark based on our software: <a href="https://docs.google.com/spreadsheet/ccc?key=0AunYlOAfGABxdFQ0UzJyTFAxbzZhYWtGcGwySlRFa1E#gid=0" rel="nofollow">https://docs.google.com/spreadsheet/ccc?key=0AunYlOAfGABxdFQ...</a>

评论 #7662830 未加载

trustfundbabyabout 11 years ago

The article doesn't go into it in depth, but I think the answer is that chip makers hit a wall somewhere over 3Ghz range where it became difficult to ramp up cpu frequency without spending ridiculous sums on cooling the processor so it could operate properly (you'll notice that even now, the fastest intel chips come in at the 3.1-3.2 Ghz range ... theres a reason for that)I was big into building computers during the cpu race between AMD/Intel back in the late 90's early 2000's and the Intel Pentium 4 processor line is notable for pushing the envelope from the the high 2Ghz range all the way up to the 3.4Ghz and 3.6ghz (I still have a 3.4Ghz chip sitting in my home office ... those were the days!)Wikipedia does a great job of chronicling what happened with the Pentium 4 line here <a href="http://en.wikipedia.org/wiki/Pentium_4" rel="nofollow">http://en.wikipedia.org/wiki/Pentium_4</a> with hints to what I've just alluded to above"Overclocking early stepping Northwood cores yielded a startling phenomenon. While core voltage approaching 1.7 V and above would often allow substantial additional gains in overclocking headroom, the processor would slowly (over several months or even weeks) become more unstable over time with a degradation in maximum stable clock speed before dying and becoming totally unusable"It was after their failures with the brute force attempt at higher cpu cycles that Intel finally went a different way with initially the Pentium M line (code named Dothan and Banias) <a href="http://en.wikipedia.org/wiki/Pentium_M" rel="nofollow">http://en.wikipedia.org/wiki/Pentium_M</a> and eventually the Core duo/Core series that they've since built on.

评论 #7662358 未加载

评论 #7662160 未加载

评论 #7662588 未加载

higherpurposeabout 11 years ago

Interesting that the article mentions SNB, because since then the gains in performance have been much smaller. SNB was the last "significant" gain in performance for Intel CPU's I'd say (+35 percent over previous generation). All of the new generations since then have gotten like 10 percent increase in IPC, at best, and Broadwell will probably get a max gain of +5 percent.To "hide" this, Intel has refocused its marketing on power consumption, where gains seem easier to achieve (for now), as well as other pure marketing tricks such as calling what used to be "Turbo Boost speed", the "normal speed". I've noticed for example recently a Bay Trail laptop being marketed at "2 Ghz", even though Bay Trail's base speed is much lower than that.

评论 #7662071 未加载

评论 #7662146 未加载

Spittieabout 11 years ago

I personally have my own idea for that: Sure, we have hit many walls, but I don't think that's the main reason for the slow down in CPU development. I think it's mostly because the R&D moved from making the CPU faster for making the CPU consume less, to follow the Laptop/Mobile market (as everyone loves/hates to say, every year is the year of the death of the PC).Also we're at the point where most of the very-requesting software isn't bottlenecked by the CPU, or where you can just throw more cores at the problem and solve it. And also software is starting to leverage GPU acceleration, which gives an huge boost when usable. And GPUs are getting a lot faster every new generation.

评论 #7662641 未加载

logicalleeabout 11 years ago

This is why CPUs aren't getting any faster:<a href="https://www.google.com/search?q=c+%2F+5+ghz" rel="nofollow">https://www.google.com/search?q=c+%2F+5+ghz</a>

评论 #7662100 未加载

评论 #7662804 未加载

zokierabout 11 years ago

The power wall theory is bit odd though. Why are modern Intel desktop CPUs limited to so low power budgets? Ivy bridges were just 77W (TDP), and now Haswells are apparently 65-84W. Desktop platforms should be able to handle far more power, at least in the 100-150 watt range. Meanwhile desktop GPUs are hitting 200-300 watt TDPs regularly, with far more limited cooling systems.Why isn't Intel able (or willing) to push the power envelope higher in desktops?

评论 #7662900 未加载

bluedinoabout 11 years ago

Additions to the instruction set can help out where raw GHz don't get things done.Another big improvement has been moving certain functions to hardware - Intel's Quicksync is a great example of this.

评论 #7661951 未加载

neonaabout 11 years ago

I hope we see an increase in real software parallelism, since that's the only real way out of this for the foreseeable future. Tacking on more cores is still an option we have, we're just having trouble using them right now in many contexts.In the longer term, we'll hopefully see advancements that let us fundamentally change how logic processors are constructed, such as possibly photonic logic chips. Only a major shift will let us break through the current single-thread performance wall.

评论 #7661769 未加载

th3iedkidabout 11 years ago

Weren't there walls back in the 90s?I would rather bet on a new tech leap than to go by federated designs at this stage.

评论 #7661850 未加载

zwegnerabout 11 years ago

The article doesn't really seem to answer the question the title says it does.Of course there's the well-known reasons, nonlinearity of power vs frequency scaling, diminishing returns in hardware design, etc. But there are others that we don't hear so much about.Hardware design is still in pretty nascent stage, technology-wise. The languages used (say SystemC or Verilog) offer very little high-level abstraction, and the simulation tools suck. Sections of the CPU are still typically designed in isolation in an ad-hoc way, using barely any measurements, and rarely on anything more than a few small kernels. Excel is about the most statistically advanced tool used in this. Of course, CPUs are hugely intertwined and complicated beasts, and the optimal values of parameters such as register file sizes, number of reservation stations, cache latency, decode width, whatever, are all interconnected. As long as design teams only focus on their one little portion of the chip, without any overarching goal of global optimization, we're leaving a ton of performance on the table.And for that matter, so is software/compiler design. The software people have just been treating hardware as a fixed target they have no control over, trusting that it will keep improving. That makes us lazy, and our software becomes more and more slow, by design (The Great Moore's Law Compensator if you will, also known as <a href="https://en.wikipedia.org/wiki/Wirth%27s_law" rel="nofollow">https://en.wikipedia.org/wiki/Wirth%27s_law</a>).The same problems we see in hardware design, huge numbers of deeply intertwined parameters, also applies to software/compiler design. We're still writing in C++ for performance code, for chrissakes. And even beyond that, the parameters in software and hardware are deeply intertwined with each other. To optimize hardware parameters, you need to make lots of measurements of representative software workloads. But where do those come from, and how are they compiled? Compiler writers have the liberty to change the way code is compiled to optimize performance on a specific chip (even if this isn't done so much in practice). To get an actually representative measurement of hardware, these compiler changes need to be taken into account. Ideally, you'd be able to tune parameters at all layers of the stack, and design software and hardware together as one entity. That is, make a hardware change, then do lots of compiler changes to optimize for that particular hardware instantiation. This needs to be automated, easy to extend, and super-duper fast, to try all of the zillions of possibilities we're not touching at the moment. There's even "crazy" possibilities like moving functionality across the hardware/software barrier. Of course it's a difficult problem, but we've made almost zero progress on it.Backwards compatibility is another reason. New instructions get added regularly, but those are only for cases where big gains are achieved in important workloads. For the most part, CPU designers want improvements that work without a recompile, because that's what most businesses/consumers want. One can envision a software ecosystem that this wouldn't be such a problem for, but instead we have people still running IE6/WinXP/etc. Software can move at a glacial pace, and hardware needs to accommodate it. But this of course also enables this awfully slow pace of software progress.

评论 #7662519 未加载

评论 #7663016 未加载

评论 #7663410 未加载

ufmaceabout 11 years ago

I'm curious if anyone here has any perspective on how close we are to absolute physical limits in CPU design. Last I heard, we're getting pretty close to dealing with quantum issues due to how small the transistor and connection size is getting, the frequency of light we need to do the etching, etc. I wonder if anybody knows how close we are to hitting hard limits in various categories. Surely, we'll hit some eventually, and I wonder what happens then.

评论 #7663075 未加载

评论 #7663108 未加载

snarfyabout 11 years ago

Grace Harper explains it best:<a href="http://www.youtube.com/watch?v=JEpsKnWZrJ8" rel="nofollow">http://www.youtube.com/watch?v=JEpsKnWZrJ8</a>

akuma73about 11 years ago

The end of Dennard scaling is the root-cause. This is causing power density to stop scaling with smaller transistors and will ultimately be the end of Moore's law.<a href="http://research.microsoft.com/en-us/events/fs2013/doug-burger_beastfrombelow.pdf" rel="nofollow">http://research.microsoft.com/en-us/events/fs2013/doug-burge...</a>

评论 #7665582 未加载

exeliusabout 11 years ago

IMO CPUs aren't getting any faster because we don't really need them to be much faster.Now before the flames begin, let me caveat that as "We don't really need them to be much faster at single-threaded workloads." The article round-aboutly mentions this in the context of specialized CPUs, specifically GPUs: GPUs are basically hyper-concentrated thread runners. They're not very fast at running any single thread, but they have efficient shared memory and can run thousands of individual threads at once.For larger workloads, we've gotten a lot more efficient through cloud computing. An individual CPU in the cloud is really not any faster than it was 5 years ago; but the advances made in energy efficiency (aka heat) and miniaturization mean you can fit a lot more of them in a smaller space.While the technical hurdles to going faster are very real, I think we've built a technical infrastructure that's just not as reliant on the performance of any single piece of the system as it used to be. Therefore there is less demand for faster CPUs, when for many of the traditional "hard" computational workloads, more CPUs works almost as well and is a lot easier to scale than faster CPUs.

评论 #7662201 未加载

评论 #7662207 未加载

philosophusabout 11 years ago

I realize this article is from 2010, but it could have mentioned AMD, which does have a 5 GHz chip available now. It requires liquid cooling however.

评论 #7662263 未加载

jokoonabout 11 years ago

or "why it's more and more relevant to code with performance in mind, and consider minimalist designs"

评论 #7662329 未加载