TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Analysis of 2.5 Years of Frontpage Articles

23 pointsby miketabout 12 years ago

4 comments

johncooganabout 12 years ago
Huge fan of DiffBot and awesome projects like this. Really cool analysis, thanks for posting.<p>Would be possible for you to post / send me the original data? I have been very interested in working on more longitudinal analysis using DiffBot data and this seems like a fun and interesting place to start. I'm happy to open-source / clearly attribute DiffBot's contribution to whatever I find / hack together, and would feel a lot more comfortable about integrating DiffBot into larger projects in the future.<p>Please email me (in my profile) if this is a possibility. Thanks!
评论 #5609252 未加载
mayankabout 12 years ago
Funny, I just built a HN article catcher that uses Diffbot to collect and classify submissions from the /new page [1]. I've been a Diffbot fan for quite a while now (although their entity recognition/tag classifier needs a bit of work as you can see from the categorization on my catcher page below).<p>[1] <a href="http://lahiri.me/more" rel="nofollow">http://lahiri.me/more</a><p>I should add that their API is fantastic, and far better than using BeautifulSoup/NLTK for extracting textual content from webpages.
评论 #5609278 未加载
tliouabout 12 years ago
Had to figure out how to use it ... but interesting once you do! Android vs IPhone on Hackernews frontpage shows spike in iphone on launch dates, but mediocre to no activity for android. is it because android is less interesting and not as innovative? or not as fun to talk/read about?<p><a href="http://diffbot.com/robotlab/hackernews/#type=tags&#38;item=IPhone&#38;item=Android%20(operating%20system)&#38;item=" rel="nofollow">http://diffbot.com/robotlab/hackernews/#type=tags&#38;item=I...</a>
minimaxabout 12 years ago
Neat! Wish I could select by just domain name (i.e. just nytimes.com rather than dozen or so whatever.nytimes.com subdomains).