TE
TechEcho
AccueilTop 24hRécentsMeilleursQuestionsPrésentationsEmplois
GitHubTwitter
Accueil

TechEcho

Une plateforme d'actualités technologiques construite avec Next.js, fournissant des nouvelles et discussions technologiques mondiales.

GitHubTwitter

Accueil

AccueilRécentsMeilleursQuestionsPrésentationsEmplois

Ressources

HackerNews APIHackerNews OriginalNext.js

© 2025 TechEcho. Tous droits réservés.

Ask HN: Hardware for 1k RPS?

5 pointspar gskyil y a 2 jours
I ran an uncensored model on a CPU server. as expected its dead slow (min or two per query).<p>What kinda hardware (GPU) do i need to serve 1k RPS?<p>I could not find APIs for uncensored models that kinda forced me to run locally

2 comments

eddythompson80il y a 2 jours
Depends on your model size and how many of it can fit in memory. Multiply the size by 1k and divide by the memory capacity of the hardware for a rough ballpark.
barnabeeil y a 2 jours
<a href="https:&#x2F;&#x2F;venice.ai" rel="nofollow">https:&#x2F;&#x2F;venice.ai</a> claim to offer uncensored models (I’ve not tested that claim)
评论 #44142661 未加载