TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Jlama (Java) outperforms llama.cpp in F32 Llama 7B Model

7 pointsby tjakealmost 2 years ago

3 comments

syllogisticalmost 2 years ago
Huh, yeah it repros. Java is faster 159s vs 203s for the 256 tokens on my intel i9 12 gen
评论 #37122594 未加载
version_fivealmost 2 years ago
Where does the performance difference come from? And in what kind of processor & gpu? I didn't even know llama.cpp had a 32 bit option. For now I'm pretty suspicious it's a fair comparison.
评论 #37121843 未加载
tjakealmost 2 years ago
GH: <a href="https:&#x2F;&#x2F;github.com&#x2F;tjake&#x2F;Jlama">https:&#x2F;&#x2F;github.com&#x2F;tjake&#x2F;Jlama</a>