Every time I try out one of these (relatively) smaller diffusion-based text-to-image networks, it only makes me want DALL-E 2 access that much more. I tried "two corgis riding a dragon" and... well, I will have nightmares at what it generated.