TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Symmetric Power Transformers

18 pointsby _hark9 months ago

4 comments

brrrrrm9 months ago
So, noticing that linearized models have tiny KV caches <i>ahem</i> i mean state spaces, this approach tries to increase their size along the embedding dimension. Increasing this enormously by applying a different softmax (which is compatible with the expanding tensor product) yields a very symmetric mathematical structure that can be exploited to recover some efficiency.<p>Is that right?
评论 #41280381 未加载
userbinator9 months ago
Glanced at the title and clicked, expecting this to be EE related.
d110af5ccf9 months ago
Formatted like a formal academic publication. No way (that I can tell) to grab a pdf. Comes across as a blog masquerading as academic literature to me. Am I wrong? Did I miss something and there&#x27;s an offline version available?<p>Pages served up over http are ephemeral. An absolutely essential part of formal academic literature is the archival aspect - self contained, immutable, and referenceable in an unambiguous manner.<p>There&#x27;s also an immediate practical aspect for me. I will likely never get around to reading this because I will forget it exists because my &quot;reading list&quot; consists of a pile of pdf files.
kazinator9 months ago
I almost clicked on this, thinking it would be an electrical engineering topic; good thing I read the domain name.