TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

A Deep Dive into AMD’s Rome Epyc Architecture

135 pointsby lamchobalmost 6 years ago

6 comments

mjw1007almost 6 years ago
Up until around 2012, realworldtech.com and anandtech.com used to publish rather more detailed descriptions of the microarchitecture inside each core.<p>Is anyone publishing things like that these days? I mean pages like these:<p><a href="https:&#x2F;&#x2F;www.realworldtech.com&#x2F;haswell-cpu&#x2F;4&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.realworldtech.com&#x2F;haswell-cpu&#x2F;4&#x2F;</a> <a href="https:&#x2F;&#x2F;www.anandtech.com&#x2F;show&#x2F;6355&#x2F;intels-haswell-architecture&#x2F;8" rel="nofollow">https:&#x2F;&#x2F;www.anandtech.com&#x2F;show&#x2F;6355&#x2F;intels-haswell-architect...</a><p>(I noticed that Agner Fog&#x27;s chapter on Ryzen is conspicuously missing a &quot;Literature&quot; section.)
评论 #20724199 未加载
评论 #20724407 未加载
评论 #20723791 未加载
评论 #20724716 未加载
评论 #20723926 未加载
评论 #20726130 未加载
评论 #20724058 未加载
mmrezaiealmost 6 years ago
There must be a simulation for this kind of architectures to see what is the best combination of size and components while making it practical! I wonder if anyone knows something like that? A tool to minmax these choices and estimate if this can be done with resources they have got.
评论 #20724443 未加载
评论 #20724212 未加载
评论 #20723712 未加载
评论 #20723665 未加载
评论 #20723667 未加载
评论 #20723924 未加载
评论 #20724730 未加载
评论 #20724416 未加载
MayeulCalmost 6 years ago
&gt; “We like features that improve both power and performance,” Clark elaborated. “Being on the right path more often is important because the worst use of power is executing instructions that you are just going to throw away. We are not throwing work away after we figure out dynamically that we were wrong to do it. This definitely burns more power on the front end, but it pays dividends on the back end.”<p>Every documentation I&#x27;ve seen is quite light on the branch prediction improvements. Going by the slides, they improved is accuracy by 1&#x2F;3; I&#x27;d be curious to know how. Side note: if your superscalar is big enough (yeah, those registers use power), couldn&#x27;t you just get rid of branch prediction at no performance cost (doing something else while waiting for the data)?<p>My only grudge against Zen (as a consumer) is that the AM4 socket is intended for both APUs and CPUs. While this is a good thing, I have a couple utterly useless video outputs on my motherboard. I would have liked AMD to include some display driver circuitry on every chip. Maybe in the I&#x2F;O die, if they use such a thing in all of their designs going forward? I mean, I would be quite content with using software rendering when I need to drive a screen, or even spare a bit of memory bandwidth and CPU cycles to drive an extra display from my desktop&#x27;s graphics card.
评论 #20726731 未加载
shaklee3almost 6 years ago
This didn&#x27;t really seem like a deep dive compared to the anandtech article. I was hoping for some memory bandwidth benchmarks, since this should be the first chip that has 8 channels without caveats (looking at you power 9). It&#x27;s also not clear if it&#x27;s 16 channels with 2S, but I suspect not.<p>Edit: the picture from AMD in this review makes me think it can hit 16 memory channels with the two socket version. Does anyone know if this is true?
评论 #20725280 未加载
评论 #20755558 未加载
thinkersilveralmost 6 years ago
The poster is holding a line of bash to the standard of code and is illustrating that readability should be the goal and a way of bringing bash commands to a standard of readability for something like a PR. Readability is really there to show _intent_<p>I would say though that if you are bringing this to the code standards of today then this should really be wrapped up in some kind of unit test (<a href="https:&#x2F;&#x2F;github.com&#x2F;sstephenson&#x2F;bats" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;sstephenson&#x2F;bats</a> )for it to pass the PR. That would make the code a bit more maintainable and can be integrated as a stage in your CI&#x2F;CD pipeline.<p>If we do that then the intent would be clarified by the input and the expected output of the test. Then then the code would at least be maintainable and the readability problem becomes less of an issue when it comes to technical debt.<p>I&#x27;ve done this plenty of times with my teams and its certainly helped.
评论 #20726138 未加载
ramshankeralmost 6 years ago
My gut feeling is that Intel also lays out &#x2F; develops IO block and cores seperately. It&#x27;s just that they are all put on single silicon.
评论 #20724718 未加载