I was playing around with some different GPUs yesterday and put all of the results here: https://www.tensordock.com/benchmarks<p>I tried a vLLM and Resnet training workload. The H100 outperforms the A100 about 45% to 80% consistently, but it isn’t that much faster…<p>What workloads would see the most speedup, because I’m really not seeing 3x+ any on vLLM or simple training workloads?
Oops, just noticed the link isn’t clickable, here you go!
<a href="https://tensordock.com/benchmarks" rel="nofollow">https://tensordock.com/benchmarks</a>