TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

n times faster than C, Arm edition

5 pointsby iscmt7 months ago

2 comments

snvzz7 months ago
&gt;In general, try to use the compiler to your advantage, but make sure to check the resulting assembly. In this case eqsub emitted a perfect sequence of cmeqs and subs, but you won’t always be so lucky. Occasionally it just won’t emit what you want, even if you use intrinsics, and in those cases writing in raw assembly is an option.<p>I strongly disagree with such a conclusion. If you&#x27;re gonna twist and bend your source code so that the compiler generates the &quot;desired assembly&quot; for your target architecture, you should be instead writing assembly.
ndesaulniers7 months ago
&gt; It turns out that the AArch64 execution state (or is it the A64 instruction set? or the Armv8-A architecture? I hate this<p><a href="https:&#x2F;&#x2F;nickdesaulniers.github.io&#x2F;blog&#x2F;2023&#x2F;03&#x2F;10&#x2F;disambiguating-arm&#x2F;" rel="nofollow">https:&#x2F;&#x2F;nickdesaulniers.github.io&#x2F;blog&#x2F;2023&#x2F;03&#x2F;10&#x2F;disambigua...</a>