TE
科技回声
首页
24小时热榜
最新
最佳
问答
展示
工作
中文
GitHub
Twitter
首页
An Empirical Study of Mamba-Based Language Models
43 点
作者
panabee
11 个月前
1 comment
jiggawatts
11 个月前
Collapse
What’s the largest Mamba model that has been trained so far?<p>Seems like it scales better than transformers, but this would only be really obvious at parameter counts far in excess of the experiments in this paper.
评论 #40676033 未加载