TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

AudioGen: Textually Guided Audio Generation

146 pointsby pierreover 2 years ago

9 comments

solardevover 2 years ago
The last thing you&#x27;ll hear before the AI eats you: <a href="https:&#x2F;&#x2F;felixkreuk.github.io&#x2F;text2audio_arxiv_samples&#x2F;large_32factor_1streams_2048codesPerBook&#x2F;continuous_laughter_and_chuckling.mp3" rel="nofollow">https:&#x2F;&#x2F;felixkreuk.github.io&#x2F;text2audio_arxiv_samples&#x2F;large_...</a>
iamthemonsterover 2 years ago
It would be very interesting indeed to have an ebook reader paired with bluetooth earphones, and it simultaneously feeds the words into this to make an ambient soundtrack, perhaps also choosing music appropriate to the word-choice on the page.
nudpiedoover 2 years ago
That could be another missing piece to videogame generational art, sfx sounds and soon soundtracks.
评论 #33042841 未加载
kevmo314over 2 years ago
The speech samples are really funny. Very Sims-esque.
评论 #33046156 未加载
karmasimidaover 2 years ago
It will be more useful if it can narrate text along with those background effects.
评论 #33041456 未加载
youssefabdelmover 2 years ago
-__- I wish researchers would train a stereo 44.1kHz version...why always 16kHz? I know I know 16kHz saves more compute but come ooooon you&#x27;re Meta
fragmedeover 2 years ago
Text2audio is impressive, but I wanna see dance2audio. Just need a million dollars in funding to pay for cameras and dancers.
fuzzythinkerover 2 years ago
[code] redirects to the same page
评论 #33040882 未加载
uwagarover 2 years ago
s&#x2F;textually&#x2F;sexually<p>i giggled :)