I'll be excited to see an sd.cpp in the near future.<p>Gerganov's projects have been a boon for inference accessibility. I really admire their spirit: fast inference "with no extra dependencies".
I would be interested in how CLIP performs post quantization. This should be relatively simple to test via ImageNet zero-shot top-1 acc as one example metric.