TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

DeepSeek-V2: A Strong, Economical, and Efficient Moe Language Model

14 pointsby jasondaviesabout 1 year ago

2 comments

unravellerabout 1 year ago
It's claiming to be llama3-70B tier in strength, 3x cheaper, 3-5x faster than it due to only having 21B out of 400B+ activated at any one time. With L3-70B normally costing <$1/Million.
bearjawsabout 1 year ago
It&#x27;s performance at 21B parameters is very impressive.<p>I also like using something between 13 and 70B parameters, since it will run on a 32GB MacBook Pro easily.
评论 #40282145 未加载