Home 24h Top Newest Best Ask Show Jobs

Back to Profile

Submissions by Gcam

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

Home

Home Newest Best Ask Show Jobs

Resources

HackerNews API Original HackerNews Next.js

© 2025 TechEcho. All rights reserved.

1

From GPT-4 to Mistral 7B, there is a 300x range in the cost of LLM inference

2 pointsby Gcamover 1 year ago

2

Show HN: LLM Benchmarks Leaderboard with 60 model and API host combinations

3 pointsby Gcamover 1 year ago

3

Mistral API reduces time to first token by 10x (only place for Mistral Medium)

4 pointsby Gcamover 1 year ago

4

240 Tokens/s achieved by Groq's custom chips on Lama 2 Chat (70B)

5 pointsby Gcamover 1 year ago

5

New GPT-4 Turbo (0125 Preview) slightly faster per initial benchmarks

2 pointsby Gcamover 1 year ago

← Previous