TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

LLaMA 3 70B Llamafiles

51 pointsby birrielabout 1 year ago

3 comments

aapplebyabout 1 year ago
What's the cheapest hardware setup that can run a 70B model at tolerably interactive rates? (say 10 characters a second)
评论 #40095155 未加载
评论 #40123829 未加载
评论 #40095110 未加载
评论 #40111265 未加载
skaviabout 1 year ago
What’s the relationship between llamafile and llama.cpp these days? IIRC, it was originally llama.cpp compiled with Cosmopolitan Libc. I’ve read more recently of optimizations in llamafile with comparisons against llama.cpp. Are those pushed upstream? Have the projects diverged?
评论 #40170982 未加载
d-z-mabout 1 year ago
8B llamafile when? :^)
评论 #40102300 未加载