It's incredible how far we've come with quantization and parameter-efficient fine-tuning a la QLora. It's legitimately possible to fine-tune multi-billion parameter models on Google Colab or a consumer GPU. I can see how something like this service can be useful to simplify things for those that don't want to deal directly with hardware, but if you do have your own hardware it's amazing how much more you can do with it than before.