TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Tencent's 'Hunyuan-T1'–The First Mamba-Powered Ultra-Large Model

19 pointsby bananaflag2 months ago

2 comments

ranguna2 months ago
Not sure what they mean by ultra large. Hopefully it doesn't mean it's bigger than 1T parameters, if so, the results look pretty bad because R1 beats this model on a lot of benchmark and R1 is less than 700B parameters.
评论 #43449752 未加载
adultSwim2 months ago
It's neat to see someone scale up an alternative architecture.