TE
科技回声
首页
24小时热榜
最新
最佳
问答
展示
工作
中文
GitHub
Twitter
首页
Tencent's 'Hunyuan-T1'–The First Mamba-Powered Ultra-Large Model
19 点
作者
bananaflag
大约 2 个月前
2 条评论
ranguna
大约 2 个月前
Collapse
Not sure what they mean by ultra large. Hopefully it doesn't mean it's bigger than 1T parameters, if so, the results look pretty bad because R1 beats this model on a lot of benchmark and R1 is less than 700B parameters.
评论 #43449752 未加载
adultSwim
大约 2 个月前
It's neat to see someone scale up an alternative architecture.