Ask HN: no inference on DDR – Why?

3 点作者 AureliusMA超过 1 年前

A high RAM server is (at least) an order of magnitude cheaper than a GPU compute server. Why aren't we seeing RAM servers running inference? Is it just because the RAM bandwidth isn't high enough or is there another bottleneck that makes it unsuitable despite the cost saving?

Ask HN: no inference on DDR – Why?

暂无评论

Ask HN: no inference on DDR – Why?

暂无评论