TE
科技回声
首页
24小时热榜
最新
最佳
问答
展示
工作
中文
GitHub
Twitter
首页
Diffsound: Discrete Diffusion Model for Text-to-Sound Generation
51 点
作者
selimonder
将近 3 年前
3 条评论
notorious-dto
将近 3 年前
Twiddling around trying to get this to work. Looks exciting :)
pbronez
将近 3 年前
Very cool. Need to dig around and figure out what the training dataset is. Could be a great way to get some sample fodder.
notorious-dto
将近 3 年前
There seems to be a missing python module called "image_synthesis", anyone know more about this?