TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Vision Transformers Need Registers

94 点作者 felineflock15 天前

4 条评论

refulgentis15 天前
[2023]<p>Previously:<p>2 years ago: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=37794996">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=37794996</a>.<p>1 year ago: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40329675">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40329675</a>
Ameo15 天前
Extremely cool!<p>It&#x27;s interesting and honestly encouraging that this kind of thing can be discovered and understood using just &quot;simple linear methods&quot; and high-level analysis of patterns in layer activations.
Scene_Cast215 天前
So basically multiple CLS tokens.<p>Fwiw, I tried multiple global tokens in my chess neural net and didn&#x27;t see any uplift compared to my baseline of just having one.
评论 #43824224 未加载
bigdict15 天前
Has this been used widely since?
评论 #43824013 未加载
评论 #43824085 未加载
评论 #43825100 未加载
评论 #43824294 未加载