3 pointsby lawrencechenover 2 years ago

1 comment

nmfisherover 2 years ago

I'm intrigued to see if anyone can squeeze out similar quality with a smaller dataset (Microsoft's implementation was trained on 60,000 hours apparently).<p>Not that that's impossible to get your hands on nowadays, but it still takes quite a long time to train on decent (though admittedly not extremely high-end) hardware.

VALL-E unoffical implementation (text to speech synthesis)

1 comment

VALL-E unoffical implementation (text to speech synthesis)

1 comment