nice writeup, but i feel that for most people, the software side of training models should be more interesting and accessible.<p>for one, "full" gpu utilization, one or many, remains an open topic in training workflows. spending efforts towards that, while renting from cloud, is a more accessible and fruitful to me than to finetune for marginal improvements.<p>this course was a nice source of inspiration - <a href="https://efficientml.ai/" rel="nofollow">https://efficientml.ai/</a> - and i highly recommend looking into this to see what to do next with whatever hardware you have to work with.