TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Zen, CUDA, and Tensor Cores, Part I: The Silicon

175 点作者 throwaway712719 个月前

5 条评论

fulafel9 个月前
The answer to the leading question "What’s the difference between a Zen core, a CUDA core, and a Tensor core?" is not covered in Part 1, so you may want to wait if this interests you more than chip layouts.
评论 #41471947 未加载
paulmd9 个月前
you can calculate the area of the tensor and raytracing units by measuring+comparing die sizes between the nearest 20-series and 16-series chips. Contrary to the assumptions a lot of people made from the cartoon diagrams, it&#x27;s actually relatively small, together they make up approximately 18% of the cluster area and it&#x27;s below 10% of the chip as a whole. The area is roughly 2&#x2F;3rds tensor unit area and 1&#x2F;3 raytracing unit area, so RT is around 3% of total chip area and tensor is around 6%.<p><a href="https:&#x2F;&#x2F;old.reddit.com&#x2F;r&#x2F;hardware&#x2F;comments&#x2F;baajes&#x2F;rtx_adds_195mm2_per_tpc_tensors_125_rt_07&#x2F;" rel="nofollow">https:&#x2F;&#x2F;old.reddit.com&#x2F;r&#x2F;hardware&#x2F;comments&#x2F;baajes&#x2F;rtx_adds_1...</a><p>This could have changed somewhat in newer releases, but probably not too drastically, since NVIDIA has never really increased raw ray performance since the 20-series launch. And while there have been a few raytracing features around the edges, raster and cache have been bumped significantly too (notably, ampere got dual-issue fp32 pipelines... which didn&#x27;t really work out for NVIDIA that well either!) so honestly there&#x27;s a reasonable chance it&#x27;s slightly less in subsequent architectures.
kvemkon9 个月前
&gt; Each of the tiles on the CPU side is actually a Zen 4 core, complete with its dedicated L2 cache.<p>Perhaps, it could be more interesting to compare without L2 cache.
评论 #41475567 未加载
评论 #41473541 未加载
diabllicseagull9 个月前
It was a good read. I wonder what hot takes he&#x27;ll have in the second part if any.
downvotetruth9 个月前
I refused to buy the so determined defective chips even if they represented better value because if the intent was truly to try and max yield then there should be for Ryzen for example good 7 core versions with only 1 core that was found to be defective. Since no 7 core zens exist, then at least some of the CPUs with 6 core CCDs have intentionally had 1 of the cores destroyed for reasons unknown, which could be to meet volume targets. If this is because for Ryzen the cores can only be disabled in pairs, then it boggles my mind that it would not be economic given the $ diff of tens to hundreds of dollars between the 6 and 8 core versions that is does not make sense to add the circuits to allow each core to be individually fused off and allow further product differentiation, especially considering how much effort and # of SKUs have been put forth with the frequency binning in AM4 (5700x, 5800, 5800x, 5800xt, etc.), rather than bigger market segmentation jumps.
评论 #41471418 未加载
评论 #41471491 未加载
评论 #41471434 未加载