TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Diffsound: Discrete Diffusion Model for Text-to-Sound Generation

51 点作者 selimonder将近 3 年前

3 条评论

notorious-dto将近 3 年前
Twiddling around trying to get this to work. Looks exciting :)
pbronez将近 3 年前
Very cool. Need to dig around and figure out what the training dataset is. Could be a great way to get some sample fodder.
notorious-dto将近 3 年前
There seems to be a missing python module called "image_synthesis", anyone know more about this?