It's unfortunate you removed all the primary subreddits (or degree > 75), it makes a lot of the networks confusing (the gaming network, for example) because they seem to only show niche topics.
This is really well done! I would recommend putting the defaults back in and playing around with and finding a good value for "remove connections that have less than" using a number based on a percentage of the posts in sub instead of an arbitrary 8 connections.<p>For example, remove connections from /r/programming to subs that are cross referenced from /r/programming less than 5% of the time.<p>Again, very nice!
Not surprising that the drama bomb meta subreddits are the biggest connectors (TrueRedditDrama being the largest graphically).<p>I personally think the overall quality/SNR of the entire site would go up if meta subs (a sub who's sole purpose is to link to other subs) were banned outright. They don't seem to do anything but stir up strife and abuse.
With others, I recently played around with a custom built dataset, using unsupervised machine learning to cluster the subreddits, based on content.<p>You can play with it here:<p><a href="http://demo.xplr.com/xplr/umbreddit/" rel="nofollow">http://demo.xplr.com/xplr/umbreddit/</a><p>The system can be queried:<p><a href="http://demo.xplr.com/xplr/umbreddit/?search=cyclists" rel="nofollow">http://demo.xplr.com/xplr/umbreddit/?search=cyclists</a><p><a href="http://demo.xplr.com/xplr/umbreddit/?search=species" rel="nofollow">http://demo.xplr.com/xplr/umbreddit/?search=species</a><p>For the curious, more technical information is available here:
<a href="https://xplr.com/xplr-umbrella-dataviz-on-top-of-unsupervised-machine-learning/" rel="nofollow">https://xplr.com/xplr-umbrella-dataviz-on-top-of-unsupervise...</a>
and here:
<a href="https://xplr.com/a-subreddit-recommender-with-xplr/" rel="nofollow">https://xplr.com/a-subreddit-recommender-with-xplr/</a><p>This analysis is slightly different than the study of cross-posed links. Here the content of posted URLs (not yet comments)is analyzed and subreddits are put into clusters within a search engine.<p>This allows the easy building of a subreddit recommender for Reddit (Chrome only for now): <a href="https://chrome.google.com/webstore/detail/preddit/epicmjpmnmjgbmahjcigppkenngbdjbd" rel="nofollow">https://chrome.google.com/webstore/detail/preddit/epicmjpmnm...</a>
This is really useful to me, I'm always wondering about specific subreddits that may interest me. Now I can actually find them. I think there may be other ways to present this information to help people discover subreddits of interest.
Can anyone find the cluster of technical/programming subreddits? Closest ones I see are the Startup hub, and java_help. I know they're in there somewhere though.