TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Binary embeddings maintain surprisingly high performance

1 点作者 oatsandsugar6 个月前

1 comment

oatsandsugar6 个月前
They trained a model to create embeddings that were a 1024 dimension vector, with each vector being a floating point with 32 bits.<p>This gave them a baseline performance of 100% with an embedding size of 4,096 bytes.<p>They then experimented with lopping off the second half of the embedding, leaving 512 dimensions, at 2048 bytes.<p>They also experimented with just flattening each dimension to 1 bit, 0 or 1 (0 for negative, 1 for positive), reducing the size of the embedding to a minuscule 128 bytes.<p>Counterintuitively, the &quot;binary&quot; simplification was not only way smaller, but ended up being slightly more performant (96.46% c.f. 95.22%).<p>This result is wild to me.