You may have seen my recent post about [Chatistics: a Python tool to parse your Messenger/Hangouts/WhatsApp/Telegram chat logs into DataFrames](<a href="https://news.ycombinator.com/item?id=22069699" rel="nofollow">https://news.ycombinator.com/item?id=22069699</a>).<p>This notebook uses the exported chat logs to train a simple GPT/GPT2 conversational model! It uses Google Colab, a notebook platform that allows you to train complex models online for free.<p>The approach is super simple: it takes all your chat logs, turns them into this format:<p>> <speaker1> Hi<p>> <speaker2> Hey - how are you?<p>> <speaker1> Great, thanks!<p>> ...<p>...then simply trains a GPT model on this corpus. In practice, I found that the default parameters (including using GPT and not GPT2) give the best resources for this setup.<p>This notebook will be part of our workshop "Meet your Artificial Self" happening this Saturday at AMLD 2020 in Lausanne, Switzerland: <a href="https://appliedmldays.org/workshops/meet-your-artificial-self-generate-text-that-sounds-like-you" rel="nofollow">https://appliedmldays.org/workshops/meet-your-artificial-sel...</a><p>Feedback is welcome! :D
I got a bit tricked by the title here on HN. Maybe we can replace `talk` with `write`? Thought this was something that could learn how I speak and could generate sound from that, but seems to just be able written language, which is not nearly as interesting (for me).
I'm disappointed that this is about typed text rather than actual talking - I had hoped that training something that talked like me might assist technology vendors in actually creating voice recognition technology that works for me.<p>And yes my problems with voice recognition are probably due to my Scottish accent.... ;-)
I've been playing with training different sizes[0] of gpt on my own chat data precisely for this reason.<p>Coincidentally, today I was even planning to publish my last post and notebook for training gpt2-1.5b and then chatting to oneself with the model. I left it for tomorrow though.. Maybe a mistake.<p>There is quite a lot you can do and talking to my trained model which is responding to me as me can be real weird at times. It's definitely the most engaged Ive been with gpt while talking to myself.<p>Having said that you seem to train here on very little. Still - cool demo.<p>[0] <a href="https://svilentodorov.xyz/blog/gpt-345M-finetune/" rel="nofollow">https://svilentodorov.xyz/blog/gpt-345M-finetune/</a>
This is cool - might be worth training a simple discriminator model to identify <i>your</i> utterances, and then you can use the plug-and-play language model (PPLM - <a href="https://github.com/huggingface/transformers/blob/master/examples/pplm/run_pplm.py" rel="nofollow">https://github.com/huggingface/transformers/blob/master/exam...</a>) to generate utterances modeling a specific speaker without special tokens. Could also take less time to fine-tune.
I totally missed that Lyrebird was acquired : <a href="https://news.ycombinator.com/item?id=21006405" rel="nofollow">https://news.ycombinator.com/item?id=21006405</a>
My curiosity is tempered by the fact that I've seen this episode of Black Mirror before... :)<p><a href="https://en.wikipedia.org/wiki/Be_Right_Back" rel="nofollow">https://en.wikipedia.org/wiki/Be_Right_Back</a>
A computer trained to talk like me would spend a lot of time swearing and whining about how it can't take it anymore, which I admit would be pretty funny.
This is part of a workshop series[0]. Does anyone know if the talks/shops will be recorded?<p>[0]<a href="https://appliedmldays.org/workshops" rel="nofollow">https://appliedmldays.org/workshops</a>
I’ve never used PyTorch before... is this running within my local machine, or is there some API in here that’s also sending data to Google to also train their models? Asking a privacy point-of-view..
throwaway, duh.<p>When I was a teenager I wrote a very graphic and very disturbing work of fiction that was archived on a popular erotica text website.. I have had anxiety for many years now that eventually someone will glue the authorship of that story to my identity.. If people in my real life discover my fantasies from years back because of my writing signature, I do not want to guess where that will leave me.. I am not looking forward to the future!!
Oh, oobee doo<p>I wanna be like you<p>I wanna walk like you, talk like you, too<p>You'll see it's true someone like me<p>Can learn to be like someone like you