TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Generating music with expressive timing and dynamics

126 点作者 iansimon将近 8 年前

6 条评论

contingo将近 8 年前
It&#x27;s refreshing to hear generated piano music that isn&#x27;t either strictly metrical or entirely freeform, but with patches where you do get a somewhat natural sense of rubato and sensitive dynamic shaping. It&#x27;s sort of convincingly improvisatory. The constantly shifting harmonic idiom is disorienting in a not very pleasant way – the worst kind of Chopin + Ligeti mashup – especially when you raise the temperature. It would be interesting to use period&#x2F;style-specific training sets.<p>To my ears the 5:00 clip does have a larger structure, there are clearly extended passages of building up to and ebbing away from large climaxes, where you get a real sense of sustained intensification, but of course if you follow the detail everything is built up from lots of fleeting and unrelated ideas.
评论 #14668644 未加载
henearkr将近 8 年前
It seems that this model does not have any notion of &quot;cadence&quot; (the punctuation in musical grammar, given by harmony and tonality). The &quot;expressivity&quot; must be correlated to the harmony grammar, else it does not make sense. Unfortunately the samples in the article do not sound very good to me, and I am pretty sure that it is because of that.
kastnerkyle将近 8 年前
This is stunning! Great stuff.<p>Since the input and prediction is a single sequence, did you experiment with beamsearch&#x2F;stochastic beamsearch decoding (maybe with additional diversity criteria)?<p>I found that even simple models (markov chains) got a big diversity boost with a stochastic beamsearch - it might avoid the problems with low temperature repetition that could happen in a standard beamsearch. However, my music models are much, much, (much) worse than this, so my relative improvement might be related to that.<p>Similarly, I am finding really nice results in text (RNN-VAE) with scheduled sampling, it might be worth experimenting with.<p>I am amazed at how good this next-step sampled output is. The above ideas might just hurt the result, I am having a hard time imagining how it could be better.<p>What soundfont&#x2F;midi rendering package is used for this? The piano sound is really rich.<p>Looking forward to hearing what creative things users will do with this model.
评论 #14671694 未加载
评论 #14669056 未加载
DomreiRoam将近 8 年前
Could it mean that you could generate music for games that would follow the action and help build up tension?
评论 #14670000 未加载
the_cat_kittles将近 8 年前
that first example is jaw dropping. its just like what good musicians do when they are noodling. damn. well done! probably the best results i&#x27;ve ever heard for this type of effort.
评论 #14668954 未加载
hakcermani将近 8 年前
This can generate elevator music that will never repeat. I am up for that ! (Just getting into ML with Udacity, Coursera courses. This is just fascinating)
评论 #14669010 未加载
评论 #14670143 未加载