TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Back to Profile
Submissions by Gcam
1
From GPT-4 to Mistral 7B, there is a 300x range in the cost of LLM inference
2 points
by
Gcam
over 1 year ago
no comments
2
Show HN: LLM Benchmarks Leaderboard with 60 model and API host combinations
3 points
by
Gcam
over 1 year ago
1 comment
3
Mistral API reduces time to first token by 10x (only place for Mistral Medium)
4 points
by
Gcam
over 1 year ago
no comments
4
240 Tokens/s achieved by Groq's custom chips on Lama 2 Chat (70B)
5 points
by
Gcam
over 1 year ago
no comments
5
New GPT-4 Turbo (0125 Preview) slightly faster per initial benchmarks
2 points
by
Gcam
over 1 year ago
no comments
← Previous
Next →