Using a Markov chain to generate readable nonsense with 20 lines of Python

230 pointsby benhoytover 1 year ago

25 comments

dahartover 1 year ago

See also Mark V Shaney <a href="https://en.wikipedia.org/wiki/Mark_V._Shaney" rel="nofollow">https://en.wikipedia.org/wiki/Mark_V._Shaney</a>This was a class assignment in college, we had a lot of fun with it, and one of my classmates, the brilliant Alyosha Efros, decided to apply the exact same technique to images instead of text. It turned into a paper that revitalized texture synthesis in the Siggraph community. The most interesting part about it (in my opinion) is that he ran it on images of text, and it produces images of readable text nonsense! With the right window size, perhaps there’s a nonzero possibility of being able to produce the same text either way. This always struck me as very meta and makes me wonder if there are ways we could go in reverse with image processing (or other types of data); if there’s a latent underlying representation for the information contained in image content that is much smaller and more efficient to work with. Neural networks might or might not be providing evidence.<a href="https://people.eecs.berkeley.edu/~efros/research/EfrosLeung.html" rel="nofollow">https://people.eecs.berkeley.edu/~efros/research/EfrosLeung....</a>

评论 #38834682 未加载

abetuskover 1 year ago

As many know and point out, this idea is now very old (40+ years?). The big problem is that it creates "word salad" [0]. In the generative art and procgen community, the term "10,000 bowls of oatmeal" problem has been used [1].Taking a step back, this is a perennial problem, even with AI old and new. I've heard that the early (good) chess bots, after Deep Blue, had a problem of only being locally context sensitive and being easily fooled by longer range attack planning. We even see it, to a certain extent, in LLMs where they forget or contradict themselves with what was just said. I have no doubt that LLMs will just keep getting better but, as a snapshot of right now, you can see shades of this "word salad" problem, just a few steps removed to become a "logic salad" problem.[0] <a href="https://en.wikipedia.org/wiki/Word_salad" rel="nofollow">https://en.wikipedia.org/wiki/Word_salad</a>[1] <a href="https://galaxykate0.tumblr.com/post/139774965871/so-you-want-to-build-a-generator" rel="nofollow">https://galaxykate0.tumblr.com/post/139774965871/so-you-want...</a>

评论 #38831622 未加载

评论 #38833470 未加载

评论 #38832453 未加载

评论 #38831752 未加载

评论 #38831961 未加载

westurnerover 1 year ago

>> Making the prefix shorter tends to produce less coherent prose; making it longer tends to reproduce the input text verbatim. For English text, using two words to select a third is a good compromise; it seems to recreate the flavor of the input while adding its own whimsical touch.Next word prediction; vector databases:Vector database: <a href="https://en.wikipedia.org/wiki/Vector_database" rel="nofollow">https://en.wikipedia.org/wiki/Vector_database</a><a href="https://news.ycombinator.com/item?id=37812219">https://news.ycombinator.com/item?id=37812219</a>

ZetaHover 1 year ago

I was doing a similar experiment recently to generate random names that sound like names from a specific language.I was breaking the list of names apart into 3 and 2 letter parts, marking which fragments are from the start, middle and end.To generate the words I started from a random one from the start fragments, then continued with a random one from the middle fragments that starts with the latter that the previous one ended with, and similarly ended it with one from the end fragments. Some examples:Spanish Names:Armusa Vantara ModriaGerman Names:Ven Marwar, Ger Naroff, Vort Kraldent, Görn Henter, Urg Wicher, Wan Ehranus, Eck Hayazin, Wert Biewin, Rein Relberid,Catalan:Pallava Ecorus Sangana Ginavari Telamita ExorxióHungarian cities:Jószög Alszeny Hernafő Garnáza Ragytúr Hidácska Mezécs(edit formatting)

评论 #38831567 未加载

alejoarover 1 year ago

We had a bot that would randomly say things in our IRC channel 15 years ago that worked like this. You could also mention it to prompt it to reply.Every message was added to it's knowledge base and it would say random but hilarious stuff made up from all the nonsense we used to talk about.Good times.

评论 #38831146 未加载

评论 #38831707 未加载

评论 #38831421 未加载

ltr_over 1 year ago

When I read about Markov Chains and hallucination of LLMs, the first thing that comes to my mind is Reggie watts [0] performances, also remind me i used to do the same thing when i was bored in the classroom, but on my paper notebooks,just nonsense that seems plausible, it was kind of entering in a meditative state, it felt good. when i discovered watts i thought to my self, 'what?! I cloud have done that for a living?'.[0] <a href="https://www.youtube.com/watch?v=UXyHf_SpUUI" rel="nofollow">https://www.youtube.com/watch?v=UXyHf_SpUUI</a>

bradrnover 1 year ago

For a more thorough exploration of Markov text generation, I like this article: <a href="https://zompist.com/markov.html" rel="nofollow">https://zompist.com/markov.html</a>

评论 #38836538 未加载

forintiover 1 year ago

I did this once with Perl and a book called Eu by the Brazilian poet Augusto dos Anjos. It spat out verses and I joined a few to create this little thing:Andam monstros sombrios pela escuridão dos remorsosPairando acima dos transeuntesMaldito seja o gênero humanoProstituído talvez em desintegrações maravilhosasSource code: <a href="https://alquerubim.blogspot.com/2018/01/gerador-de-augusto-dos-anjos.html" rel="nofollow">https://alquerubim.blogspot.com/2018/01/gerador-de-augusto-d...</a>

Pompidouover 1 year ago

I implemented one in R this year and feed it with religious content. Then I text to speech output and post it on youtube. <a href="https://youtube.com/playlist?list=PLEB5tND7siCOe7ySYjVOBwR2v9W2efu-r&si=jEkwL39jYBt2ldeZ" rel="nofollow">https://youtube.com/playlist?list=PLEB5tND7siCOe7ySYjVOBwR2v...</a>

hermitcrabover 1 year ago

I created a little program in C++/Qt for using Markov chains to re-write text. But it works on character sequences, rather than words. There are examples and binaries for Windows and Mac:<a href="https://successfulsoftware.net/2019/04/02/bloviate/" rel="nofollow">https://successfulsoftware.net/2019/04/02/bloviate/</a>

not2bover 1 year ago

This is exactly the approach I used back in the late 80s; I published a Markov chain text generator on comp.sources.games that digested Usenet posts, built tables and used the previous two words to generate the next word. It was mildly amusing at the time. Someone resurrected it and put on GitHub, which I'm fine with.

评论 #38830470 未加载

mebassettover 1 year ago

this is old tech - but it had me thinking. Markov chains are picking the next token from a random set and they are giving approximately all possible tokens (from the training set) an equal probability. What if it weighted the probability - say using the inverse vector distance or cosine similarity of the neighbors as a proxy for probability, where the vector embedding came from word2vec...how close would the performance be to a transformer model or even something like lstm rnns ? (I suppose I'm cheating a bit using word2vec here. I might as well just say I'm using the attention mechanism...)

评论 #38831436 未加载

评论 #38832096 未加载

Frummyover 1 year ago

We uh as a group project did markov chains with some custom algorithmic adjustments, scraped reddit via api, and academic torrent (this data required a lot of cleaning and was orders of magnitude bigger, ran separately), to simulate posts and comments. In same group project we also implemented Variable Length Markov Chain, tree-based custom context-length, although a team member did the matrix based implementation custom context-length as well through some matrix magic.

评论 #38833401 未加载

heikkilevantoover 1 year ago

I remember as a school boy I heard about Markov Chains, and played a bit with them. Instead of building big n-gram tables, I found a simpler way. Start with one n-gram, then scan the source text until you find the next occurrence of it, and take the next letter after it. Output it, and append it to your n-gram, dropping its first letter, and repeat until you have sufficient amount of nonsense text.

max_over 1 year ago

I wonder how this would behave if trained on 1 trillion words like the LLMs.Also, Is training a Markov cheaper that training neural nets? It would be a great way to cut AI costs if they could be made as effective as neural nets.There is also an interesting post by Orange duck. [0][0] <a href="https://theorangeduck.com/page/17-line-markov-chain" rel="nofollow">https://theorangeduck.com/page/17-line-markov-chain</a>

评论 #38830802 未加载

评论 #38830825 未加载

AndruLuvisiover 1 year ago

The first edition of Programming Perl had a script that did this called travesty. You can download it from <a href="https://resources.oreilly.com/examples/9780937175644" rel="nofollow">https://resources.oreilly.com/examples/9780937175644</a>When I was learning Perl in the '90s, I remember having a lot of fun running this on different manual pages.

评论 #38833699 未加载

weare138over 1 year ago

<a href="https://en.wikipedia.org/wiki/MegaHAL" rel="nofollow">https://en.wikipedia.org/wiki/MegaHAL</a>

samsquireover 1 year ago

I had an idea.What if you trained a markov chain and got weights of every pair or sequence of pairs for a certain length.Could you also do rotations with this information in vector space? With multiple points of freedom?What if you could do inverse kinematics with markov chains?Like robot planning for a goal?

qarlover 1 year ago

<a href="https://archive.org/details/ComputerRecreationsMarkovChainer" rel="nofollow">https://archive.org/details/ComputerRecreationsMarkovChainer</a>

mdberkeyover 1 year ago

One of my first personal projects was making 2 discord bots that would have an infinite “conversation” using Markov chains. Feeding them text from Reddit produced some hilarious results.

kwhitefootover 1 year ago

> using Lewis Carroll’s Alice in Wonderland as input. The book is more or less nonsense to begin with,Has he ever read Alice?

swayvilover 1 year ago

Plausible nonsense of sufficient complexity and consistency as to seem real. It's trivial to generate.It casts every cherished truth into doubt.

lowbloodsugarover 1 year ago

Never forget a mate implemented a markov chainer in Lambda MOO, and named it “Cute Ickle Baby”. Left it in the Living Room, listening to what people said, and randomly saying things. Sometimes it said the funniest shit and had us all pissing ourselves laughing.

bugbuddyover 1 year ago

This works pretty well for languages that use a space as word separator. Now do this for those languages that don’t and have no clear and easy rules for word separation.

raisterover 1 year ago

This was posted 50 days ago - how does it reach first page again? <a href="https://news.ycombinator.com/from?site=benhoyt.com">https://news.ycombinator.com/from?site=benhoyt.com</a>

评论 #38831489 未加载

评论 #38830305 未加载