TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: anyone interested to build a hacker news with tags?

4 pointsby tucsonover 10 years ago
Could anyone help me build a hacker news with tags? I am asking only those who are interested to have it as well because I only have a budget for the hosting for this.<p>The point is to be able to search through the whole archive using tags&#x2F;keywords.<p>example of tags:<p>&#x27;security&#x27;<p>&#x27;crm&#x27;<p>&#x27;a&#x2F;b testing&#x27;<p>&#x27;optimization&#x27;<p>&#x27;http&#x27;, &#x27;ssl&#x27;, &#x27;domain name&#x27;<p>&#x27;scala&#x27;, &#x27;c++&#x27;, &#x27;php&#x27;, etc<p>&#x27;lua&#x27;<p>&#x27;sql&#x27;<p>&#x27;marketing&#x27;<p>&#x27;website&#x27;<p>&#x27;landing page&#x27;<p>=&gt; get all posts that relate to each tag (and combinations of tags) <i>sorted by points of individual posts&#x2F;comments</i>.<p>To do list: 1. import all hacker news database 2. insert in database all tags for all posts&#x2F;comments, using an algorithm similar to the Kaggle Keyword Extraction algo (https:&#x2F;&#x2F;www.kaggle.com&#x2F;c&#x2F;facebook-recruiting-iii-keyword-extraction), which will need to be refined. 3. create great user interface to the new database<p>------- or if no-one has the time, could anyone advise me on how to download the whole hacker news database?

2 comments

captn3m0over 10 years ago
1. You can download the dataset using <a href="http://hn.algolia.com/api" rel="nofollow">http:&#x2F;&#x2F;hn.algolia.com&#x2F;api</a>. Mind the rate-limits, though.<p>2. This has already been done quite a few times by various apps, most prominently here: <a href="http://algorithmia.com/demo/hn" rel="nofollow">http:&#x2F;&#x2F;algorithmia.com&#x2F;demo&#x2F;hn</a> (<a href="http://blog.algorithmia.com/post/86295023534/algorithmic-tagging-of-hackernews-or-any-other-site" rel="nofollow">http:&#x2F;&#x2F;blog.algorithmia.com&#x2F;post&#x2F;86295023534&#x2F;algorithmic-tag...</a>)
评论 #8420872 未加载
评论 #8426153 未加载
wanghqover 10 years ago
Can&#x27;t you just search the keywords? I wonder how useful it would be given that the information (tech articles, such as rails2.1, best features in jQuery1.0,...) will be out-of-date as time goes.<p>I think what&#x27;s useful is various tools if they are still alive. That&#x27;s why I want to build a toolbox which collects all the useful tools.<p><a href="https://news.ycombinator.com/item?id=8413016" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=8413016</a>
评论 #8426150 未加载