9 pointsby asparaguialmost 5 years ago

1 comment

cinntailealmost 5 years ago

Under figure 1 it says "Comparisons are normalized by overall training time regardless of system size, which ranges from 8 to 4096 chips. Taller bars are better."<p>Does this really make sense? The new TPU should have lots of chips and therefore finish training faster, which would make comparing like this kind of pointless? Am I misunderstanding something here?

Google breaks AI performance records in MLPerf using TPUv4

1 comment

Google breaks AI performance records in MLPerf using TPUv4

1 comment