TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

MusicLM: Generating music from text

291 点作者 georgehill超过 2 年前

23 条评论

georgehill超过 2 年前
Demo: <a href="https:&#x2F;&#x2F;google-research.github.io&#x2F;seanet&#x2F;musiclm&#x2F;examples&#x2F;" rel="nofollow">https:&#x2F;&#x2F;google-research.github.io&#x2F;seanet&#x2F;musiclm&#x2F;examples&#x2F;</a>
评论 #34542853 未加载
评论 #34546975 未加载
评论 #34545637 未加载
评论 #34542347 未加载
评论 #34543520 未加载
评论 #34543294 未加载
评论 #34542296 未加载
评论 #34544838 未加载
评论 #34547527 未加载
kleiba超过 2 年前
Note that there&#x27;s a second thread on the front page right now that links to the examples:<p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=34541693" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=34541693</a><p>Perhaps they could be merged?
TedDoesntTalk超过 2 年前
I’m writing my second video game (old school arcade style side-shooter) and hope I can use this to generate background chip tune music!
评论 #34544162 未加载
评论 #34567064 未加载
评论 #34544444 未加载
dyno12345超过 2 年前
I want to do the reverse: pass in some music and have it describe what I&#x27;m hearing
评论 #34543427 未加载
评论 #34542882 未加载
评论 #34545344 未加载
评论 #34545688 未加载
评论 #34576439 未加载
评论 #34546010 未加载
spyder超过 2 年前
Awesome, this is probably the best one to date even compared to the recent Riffusion, Jukebox and the older MIDI generating MuseNet.<p>especially the conditioning on humming and whistling examples are cool, but to bad they use very common melodies for that so it&#x27;s easier job for the model and harder for us to judge how well would it work on less common melodies.
评论 #34544940 未加载
williamcotton超过 2 年前
As a musician I don’t see these tools as competing with what I do but enabling me to do so much more. It takes a lot of time and money to create a professional recording for my songs and drum machines and synths just don’t work for Americana. These tools offer the possibility of backing tracks that sound like Willie Nelson’s band from the 70s but at a fraction of the time and effort.<p>I can’t wait until they get to the point where they’re more composable or auto-accompany given an acoustic guitar and vocal input.
dariosalvi78超过 2 年前
AI music is the next thing coming, wait for copyright lawsuits to fall like bombs.<p>I find it fascinating that MidJourney can make a 3D model of my face from a low quality image, rotate it in space, apply it on someone&#x27;s else body, add coherent shadows and backgrounds, with a very credible result, and yet an AI cannot generate a decent song, which is 1-dimensional and has probably much less internal modelling to care about?<p>One reason I can think of is because eyes are &quot;integrators&quot; and ears are &quot;derivators&quot;. That is, that human ear is very sensitive so small differences, whereas vision cares more about the ensemble? I don&#x27;t know, but I think that AI music will come one day. It may not be as great as human music, but it will suffice for, say, putting a music background for your startup cheap marketing ad.
评论 #34544836 未加载
评论 #34546066 未加载
评论 #34557628 未加载
评论 #34545122 未加载
woolion超过 2 年前
I don&#x27;t really understand why this approached is pushed for music. You can overpaint an image, but you can&#x27;t do that with a song. Cutting an image to reintroduce coherence is easy too. For a song you need midi, or another symbolic representation. That was the approach of pop2piano (unfortunately it is limited to covers, not generating from scratch). And even if a song generated this is OK, listening to half an hour full of AI mistakes is really tiring. With a symbolic representation you could at least fix the mistakes if there is one good output.
评论 #34543621 未加载
评论 #34544180 未加载
评论 #34546255 未加载
TaupeRanger超过 2 年前
Yet another cherry picked attempt at music gen with a &quot;demo&quot; page that only contains the outputs that happen to not sound like incoherent noise. There&#x27;s a reason ChatGPT and Midjourney are so popular, but no music-gen tools have even come close: you can actually create stuff with them that is useful and&#x2F;or enjoyable. Good music gen is much harder, and the reasons for this (still unclear) are pretty important to the future of AI, imo.
评论 #34544859 未加载
评论 #34542527 未加载
评论 #34543889 未加载
xp84超过 2 年前
They said they&#x27;re not going to allow people to use it based on fear of plagiarism accusations &#x2F; music industry lawsuits. Ugh, typical.<p>I want someone to train it on public domain music. Kind of like the YouTube Audio Library but I assume that&#x27;s not exactly the right license for this. But with sufficient effort someone could make a lot of recordings of public domain music for this purpose and build something that the RIAA thugs couldn&#x27;t actually touch.
pkchv超过 2 年前
I love the current rate of progress in generative music ml research. Riffusion model ( <a href="https:&#x2F;&#x2F;www.riffusion.com&#x2F;about" rel="nofollow">https:&#x2F;&#x2F;www.riffusion.com&#x2F;about</a> ) was trending here just a month ago. Pity Google never shares the actual models, would love to hear artists go wild with it.<p>btw, some of you might find interesting, diffusion-based &#x2F; generative concept album: <a href="https:&#x2F;&#x2F;wielu.bandcamp.com&#x2F;album&#x2F;vectorstep-ep" rel="nofollow">https:&#x2F;&#x2F;wielu.bandcamp.com&#x2F;album&#x2F;vectorstep-ep</a>
gschoeni超过 2 年前
Any plans on releasing the actual audio data and not just a csv with links to YouTube IDs?<p>Also maybe a silly question, but what&#x27;s the legal ramifications of downloading these YouTube videos and training on them yourself? Google must have some rights, but what about people outside of Google?
duckington超过 2 年前
I wonder if this technology will eventually revolutionize music the same way synthesizers did. Or at least lead to music and effects&#x2F;filters that are simply not possible with current DAWs and plugins.<p>Custom generation of samples from text alone seems revolutionary.
threevox超过 2 年前
Google: kings of releasing papers but never shipping anything
andjelam990超过 2 年前
Interesting concept! I wonder how the hinders around copywriting will be solved.
eddsh1994超过 2 年前
Where can I get the MusicCaps dataset? I can&#x27;t seem to find it
评论 #34548956 未加载
guyisra超过 2 年前
Yet Another ClosedAI research project with no intention of releasing..<p>what a joke
akie超过 2 年前
I hate how good this is.
FeepingCreature超过 2 年前
Weights when though. :-(
评论 #34545083 未加载
评论 #34544190 未加载
Gertie01超过 2 年前
Can&#x27;t wait to generate infinite music.
frinnylee超过 2 年前
Anyone knows how to reach out the team?
winrid超过 2 年前
It would be cool to have a music service trained on your music playlist and just generates music you might like.<p>Also, is this technically a vocoloid? :)
bulbosaur123超过 2 年前
I&#x27;m absolutely flabbergasted by rapid progression of music related machine learning models. I&#x27;ve always seen music as the &quot;final frontier&quot;. With images you can have tiny errors and noise that isn&#x27;t that noticeable by the human eye, but with music everything has to be impeccable and if a note is slightly off, you instantly hear it. Machine learning models that will be able to create outstanding music, imo, will mark the end-game of creative AI.<p>Anyone want to share their experiments with MusicLM, feel free to join community-fan discord and subreddit:<p><a href="https:&#x2F;&#x2F;reddit.com&#x2F;r&#x2F;MusicLM&#x2F;" rel="nofollow">https:&#x2F;&#x2F;reddit.com&#x2F;r&#x2F;MusicLM&#x2F;</a><p><a href="https:&#x2F;&#x2F;discord.gg&#x2F;pjVcsyfCJR" rel="nofollow">https:&#x2F;&#x2F;discord.gg&#x2F;pjVcsyfCJR</a><p>BTW, I really hope this gets integrated in software like Ableton similarly how image generators get slowly integrated into Adobe Creative Suite.
评论 #34546367 未加载
评论 #34545584 未加载
评论 #34546097 未加载