TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

An algorithmic approach to GitHub exploration

26 点作者 doppenhe将近 11 年前

5 条评论

ptwobrussell将近 11 年前
This post highlights that there are indeed some significant untapped opportunities in mining GitHub user and repository data. As I was working on the 2nd Edition of Mining the Social Web last year, I observed the very same thing and introduced an entire chapter that models GitHub as a interest graph. (Think: users are interested in projects and programming languages by extension.) The IPython Notebook with all of the sample code is available with all of the other source [1] but really just begins to scratch the surface with some rudimentary centrality techniques. Like any other interest graph, the possibilities are fairly endless.<p>[1] <a href="http://nbviewer.ipython.org/github/ptwobrussell/Mining-the-Social-Web-2nd-Edition/blob/master/ipynb/Chapter%207%20-%20Mining%20GitHub.ipynb" rel="nofollow">http:&#x2F;&#x2F;nbviewer.ipython.org&#x2F;github&#x2F;ptwobrussell&#x2F;Mining-the-S...</a>
评论 #7954938 未加载
idunning将近 11 年前
Tried with &#x2F;JuliaLang&#x2F;julia and got garbage results - my guess is that the build instructions in the README dominate. Trying something like &#x2F;JuliaOpt&#x2F;Optim.jl, which has a very on-topic README, faired slightly better but still had some bizzare things like &#x2F;sergiotapia&#x2F;go-style-guide
评论 #7954478 未加载
sitkack将近 11 年前
I like the idea of your project, but it seems like the algorithmic database version of wikipedia that you plan to profiteer off of?<p>Words like marketplace, crowdsourced, and open platform played well in 2005 but now they kinda smell like a scam.
andars将近 11 年前
Another related site that attempts to do a similar thing: <a href="http://kare.progger.io" rel="nofollow">http:&#x2F;&#x2F;kare.progger.io</a>
评论 #7955157 未加载
hitlin37将近 11 年前
Strangely, they didn&#x27;t mention what kind of topic algorithm they are using. Is it LDA based?
评论 #7955418 未加载