TechEcho

6 comments

wolf550eover 7 years ago

A recent blog post by Vlad Krasnov, author of a bunch of the crypto assembly code in openssl and in golang, about frequency scaling when using AVX-512 making it not worth it: <a href="https://blog.cloudflare.com/on-the-dangers-of-intels-frequency-scaling/" rel="nofollow">https://blog.cloudflare.com/on-the-dangers-of-intels-frequen...</a>He doesn't like the title of the OP and provided links:> Very misleading title. Could just as well name it "accelerate sha256 up to 134x". You need to compare apples to apples. If AVX2 was used in the same way AVX512 is used, the speedup would be 2X at most. Reminds me of two of my papers <a href="https://eprint.iacr.org/2012/371.pdf" rel="nofollow">https://eprint.iacr.org/2012/371.pdf</a> <a href="https://eprint.iacr.org/2012/067.pdf" rel="nofollow">https://eprint.iacr.org/2012/067.pdf</a>(from <a href="https://twitter.com/thecomp1ler/status/940724783804645376" rel="nofollow">https://twitter.com/thecomp1ler/status/940724783804645376</a>)EDIT: Thanks 'delhanty !

评论 #15918268 未加载

评论 #15920036 未加载

eloffover 7 years ago

This is assembly, not pure Go, but it doesn't use CGO which I probably what they mean.Intel Cannon Lake processors will support the SHA instruction extensions (currently available only on Goldmont). It will be interesting to see how that compares with this approach of running 16 SHA computations in parallel. You would be able to get rid of the scheduling overhead of having to first queue up 16 SHA calculations from other threads.

评论 #15918658 未加载

评论 #15918090 未加载

foobarbazetcover 7 years ago

One thing to note is that the benchmark is running on a Skylake Platinum chip which has two AVX512 FMAs.You need a Gold 6000 series and above to see any benefit from AVX512. In most other cases the CPU throttles down some insane amount and there’s no to little benefit.

评论 #15919334 未加载

评论 #15920125 未加载

评论 #15918267 未加载

ComputerGuruover 7 years ago

I blogged about the SHA instruction support in the x86_64 ISA a few months back, it’ll be nice to see it actually happen: <a href="https://neosmart.net/blog/2017/will-amds-ryzen-finally-bring-sha-extensions-to-intels-cpus/" rel="nofollow">https://neosmart.net/blog/2017/will-amds-ryzen-finally-bring...</a>

dragonfaxover 7 years ago

Isn't this the kind of thing that was missing from the "go on different platforms" benchmark a little while back. The intel platform has crazy optimization for encryption algorithms on Inteil, while ARM was severely lacking.

评论 #15918951 未加载

mikebenfieldover 7 years ago

Possibly I'm confused, but in what sense is this "in Pure Go"?

评论 #15917828 未加载

评论 #15917685 未加载

评论 #15917679 未加载

6 comments

wolf550eover 7 years ago

评论 #15918268 未加载

评论 #15920036 未加载

eloffover 7 years ago

评论 #15918658 未加载

评论 #15918090 未加载

foobarbazetcover 7 years ago

评论 #15919334 未加载

评论 #15920125 未加载

评论 #15918267 未加载

ComputerGuruover 7 years ago

dragonfaxover 7 years ago

评论 #15918951 未加载

mikebenfieldover 7 years ago

Possibly I'm confused, but in what sense is this "in Pure Go"?

评论 #15917828 未加载

评论 #15917685 未加载

评论 #15917679 未加载

Show HN: Accelerate SHA256 Computations in Go Using AVX512 instructions

6 comments

Show HN: Accelerate SHA256 Computations in Go Using AVX512 instructions

6 comments