51 点作者 selimonder将近 3 年前

3 条评论

Twiddling around trying to get this to work. Looks exciting :)

pbronez将近 3 年前

Very cool. Need to dig around and figure out what the training dataset is. Could be a great way to get some sample fodder.

notorious-dto将近 3 年前

There seems to be a missing python module called "image_synthesis", anyone know more about this?

Diffsound: Discrete Diffusion Model for Text-to-Sound Generation