TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Karatsuba Matrix Multiplication and Its Efficient Hardware Implementations

139 pointsby emacs283 months ago

5 comments

freetonik3 months ago
Random anecdote: 15 years ago I was starting an online community for computer science students, and needed to come up with a name. I made a survey for the community members to vote on options, with some boring ones like &quot;CS Hub&quot; and stuff; one of the options was &quot;Karatsuba&quot;: I had just learned about the Karatsuba multiplication algorithm, and somehow got this idea into my head that his last name sounds cool and unique, and my online community (later grown into a ed-tech startup) should be named after this Russian mathematician.<p>Anatoly Karatsuba himself was already dead (he died in 2008). I emailed his daughter Yekaterina (who is also a mathematician, btw) asking for a permission to use their last name. She agreed, but asked to be extra careful about potential implicit affiliations, i.e. to be clear that the content has nothing to do with her father&#x27;s research.<p>She also expressed an opinion that in the field of mathematics and computation, at least in Russia, actual researchers are rarely involved in writing textbooks, and the textbooks used in universities often contain conflicting or even wrong information about the authorship of research.<p>In the end a different name was chosen for the project.
dvasdekis3 months ago
Would this work have the potential to speed up encoding&#x2F;decoding of the PAR2 format[0]? This format is widely used to protect datasets against bitrot and small losses, but is held back because of the significant compute overhead when dealing with large datasets.<p>[0] <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Parchive" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Parchive</a>
评论 #43412347 未加载
oofbey3 months ago
They&#x27;re proposing &quot;new hardware architectures&quot; to take advantage of this idea. Anybody with a background in GPU floating point math comment on how realistic this is?
评论 #43406893 未加载
评论 #43407002 未加载
评论 #43406855 未加载
godsmokescrack3 months ago
The basic idea of reducing 4 multiplications to 3 multiplications<p>(ax + b)(cx + d) = acx^2 + [(a + b)(c + d) - ac - bd]x + bd<p>holds pretty generally; there isn&#x27;t any new math or algorithm here that I can see. Their own complexity analysis (eqns. 7 and 8) shows this performs about the same as using Karatsuba multiplication on the entries of the matrices (instead of on the matrices themselves).
评论 #43417314 未加载
ash-ali3 months ago
the govy uses specialized hardware that isn&#x27;t sold on the market right? would something like this be useful in developing said hardware&gt;?
评论 #43407422 未加载
评论 #43449323 未加载