TechEcho

5 comments

ptwobrussellalmost 11 years ago

This post highlights that there are indeed some significant untapped opportunities in mining GitHub user and repository data. As I was working on the 2nd Edition of Mining the Social Web last year, I observed the very same thing and introduced an entire chapter that models GitHub as a interest graph. (Think: users are interested in projects and programming languages by extension.) The IPython Notebook with all of the sample code is available with all of the other source [1] but really just begins to scratch the surface with some rudimentary centrality techniques. Like any other interest graph, the possibilities are fairly endless.<p>[1] <a href="http://nbviewer.ipython.org/github/ptwobrussell/Mining-the-Social-Web-2nd-Edition/blob/master/ipynb/Chapter%207%20-%20Mining%20GitHub.ipynb" rel="nofollow">http://nbviewer.ipython.org/github/ptwobrussell/Mining-the-S...</a>

评论 #7954938 未加载

idunningalmost 11 years ago

Tried with /JuliaLang/julia and got garbage results - my guess is that the build instructions in the README dominate. Trying something like /JuliaOpt/Optim.jl, which has a very on-topic README, faired slightly better but still had some bizzare things like /sergiotapia/go-style-guide

评论 #7954478 未加载

sitkackalmost 11 years ago

I like the idea of your project, but it seems like the algorithmic database version of wikipedia that you plan to profiteer off of?<p>Words like marketplace, crowdsourced, and open platform played well in 2005 but now they kinda smell like a scam.

andarsalmost 11 years ago

Another related site that attempts to do a similar thing: <a href="http://kare.progger.io" rel="nofollow">http://kare.progger.io</a>

评论 #7955157 未加载

hitlin37almost 11 years ago

Strangely, they didn't mention what kind of topic algorithm they are using. Is it LDA based?

评论 #7955418 未加载

5 comments

ptwobrussellalmost 11 years ago

评论 #7954938 未加载

idunningalmost 11 years ago

评论 #7954478 未加载

sitkackalmost 11 years ago

andarsalmost 11 years ago

Another related site that attempts to do a similar thing: <a href="http://kare.progger.io" rel="nofollow">http://kare.progger.io</a>

评论 #7955157 未加载

hitlin37almost 11 years ago

Strangely, they didn't mention what kind of topic algorithm they are using. Is it LDA based?

评论 #7955418 未加载

An algorithmic approach to GitHub exploration

5 comments

An algorithmic approach to GitHub exploration

5 comments