TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Generating MIDI melody from lyrics using LSTM-GANs

120 pointsby groarabout 5 years ago

5 comments

gwernabout 5 years ago
&gt; Q5 In the case of Markov, etc., I understand that the generation may be not very conformant to the style learnt, unless using a high order Markov but with the risk of recopying entire sequences from the corpus and thus plagiat. But, in the case of a RNN-based architecture [9], what is the rationale?-------------------- A5: As mentioned before, RNN does the similar work as LSTM in our work. But without including a discriminator, it only learns transmission probability between adjacent notes, but does not promise that generated sequences look like real ones.<p>Come on, guys, that&#x27;s just not true. You do <i>not</i> need an adversarial loss to get good quality melodies. Look at Sturm&#x27;s char-RNN on ABC notation, or OpenAI&#x27;s MuseNet, or a bunch of Project Magenta work, or my own GPT-2 ABC music (MIDI in progress): <a href="https:&#x2F;&#x2F;www.gwern.net&#x2F;GPT-2-music" rel="nofollow">https:&#x2F;&#x2F;www.gwern.net&#x2F;GPT-2-music</a> Or for that matter, any generative model trained with a non-adversarial loss (anything using GPT-2 for example).<p>In fact, generally, everyone <i>avoids</i> GANs for sequence generation because they work so badly compared to regular likelihood training... (Just at a skim, their &#x27;baseline&#x27; is pretty suspicious. I&#x27;d expect an ablation for the GAN, not comparing their 400-unit LSTM to... a 100-unit LSTM <a href="https:&#x2F;&#x2F;www.aclweb.org&#x2F;anthology&#x2F;N19-4015.pdf" rel="nofollow">https:&#x2F;&#x2F;www.aclweb.org&#x2F;anthology&#x2F;N19-4015.pdf</a> ? Really?)
评论 #22529259 未加载
thomasflabout 5 years ago
Oh no, they&#x27;re ruining popular music as we know it. Anybody can just push some buttons and generate the next hit song. All you need is some with artificial lyrics, artificial melody, artificial vocal (Yamaha vocaloid), on top of a beat bought on the net.
评论 #22524622 未加载
评论 #22524485 未加载
评论 #22525023 未加载
评论 #22524525 未加载
评论 #22524909 未加载
评论 #22527752 未加载
评论 #22524504 未加载
dimmuborgirabout 5 years ago
Judging by the four provided melodies: 1) the notes have very little rhythmic variation. 2) the melodies don&#x27;t seem to have any concept of metre or metric accent.
NPMaxwellabout 5 years ago
Music, like weaving, is a predecessor in using algorithms to do work. For hundreds of years, music theorists have been codifying the algorithms, or creating new algorithms, that create good music (for particular definitions of good). Like other code instantiated in a network, this code is more tailored to specific prior states and is less available to analysis of its details than prior efforts.
tabtababout 5 years ago
I always wanted a &quot;demo engine&quot; whereby one feeds in the melody and chord name, and then a style(s). The AI would then use pattern matching to make a fuller score in the chosen style(s). The output could be midi and&#x2F;or an audio file (such as .WAV). Bonus points for vocals if given lyrics. I could make Elvis diet parodies: &quot;Ain&#x27;t nothing but a round dog...&quot; Band-in-a-Box software sort of does this, but lacks realism in my opinion.