TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
Tencent's 'Hunyuan-T1'–The First Mamba-Powered Ultra-Large Model
19 points
by
bananaflag
2 months ago
2 comments
ranguna
2 months ago
Collapse
Not sure what they mean by ultra large. Hopefully it doesn't mean it's bigger than 1T parameters, if so, the results look pretty bad because R1 beats this model on a lot of benchmark and R1 is less than 700B parameters.
评论 #43449752 未加载
adultSwim
2 months ago
It's neat to see someone scale up an alternative architecture.