Hi everyone, I hope you like my latest side project!<p>I'm an astronomy student who likes programming in his free time. This time I wanted to write something that handles larger amounts of data. And as I recently came across the Stack Exchange data dump including all questions and answers I had the idea of using them to create Markov Chains for (nearly) every Stack Exchange site.
The website displays the resulting content, which is often surprisingly coherent and entertaing, and allows upvotes/downvotes so that the best questions get to the front page.
And as a bonus, I created a quiz where one can guess which site a random question is based on.<p>If you are interested in my other projects, check out <a href="https://lw1.at" rel="nofollow">https://lw1.at</a>, if you want to see the code, everything is Open Source and can be found here: <a href="https://github.com/Findus23/se-simulator" rel="nofollow">https://github.com/Findus23/se-simulator</a><p>Please excuse the very minimal design, but after writing a lot of Single Page Applications I wanted to go the oposite way and write a website with less than 25KB.
Markov chains are so much fun. They produce believable relevant text that ultimately makes no sense, which is basically a definition of comedy. And they're also super simple to understand and implement. I can have lots of fun without having to do any wild natural language processing.
I have now written a bit more on how to hopefully get this to run locally and how everything works here:<p><a href="https://github.com/Findus23/se-simulator#se-simulator" rel="nofollow">https://github.com/Findus23/se-simulator#se-simulator</a>
Ham Radio:
<a href="https://se-simulator.lw1.at/q/which-mode-describes-this" rel="nofollow">https://se-simulator.lw1.at/q/which-mode-describes-this</a><p>> If you can already be synchronized when it comes through the use of your test. That's a switching powersupply. I disable AGC in my comments above as an antenna analyzer that works depends on the Pi transmit frequency that isn't necessary to send an SWL, but let's dig further by adding another radial... You have bigger problems.<p><i></i>Any sufficiently advanced technology is indistinguishable from magic.<i></i>
I'd like to see Stack Exchange moderator responses using Markov chains. Like "Stop answering this guy as he is posing useless questions", or "This duplicate is considered not relevant".
> <i>Remove Broken Lightbulb from the toe?</i><p>> <i>As it is horizontal? This means the color and material like them a few sheets of paper on the internet?</i><p>I don't know how much effort you put into this, but that alone was absolutely worth it.
> What is an open-commercial license?<p>> Why did they determine that he used / invisibility cloak technology?<p>> How did newton APPROXIMATE THE AREA UNDER THESE PARTICULAR CURVES
I am curious how would those results compare with RNN models, such as ones in Andrej Karpathy's "The Unreasonable Effectiveness of Recurrent Neural Networks" <a href="http://karpathy.github.io/2015/05/21/rnn-effectiveness/" rel="nofollow">http://karpathy.github.io/2015/05/21/rnn-effectiveness/</a><p>(E.g. as they are able to learn code grammar.)
Turns out the random mishmash of pseudoscience and conspiracy theories that is Skeptics.SE is hilarious fodder for a Markov chain:<p><a href="https://se-simulator.lw1.at/q/do-greeks-driving-affect-the-whaling-industry" rel="nofollow">https://se-simulator.lw1.at/q/do-greeks-driving-affect-the-w...</a>
I wonder if you split the corpus and only used low voted questions/answers for one corpus and edited questions + high-voted answers for the other ... could we tell the difference in the output chains?<p>SE often has translated questions, or questions of low quality, IME.