hacker_news_scraper | https://github.com/victoriastuart/hacker_news_scraper<p>A Python 3 script for scraping the Hacker News feed, filtering that content by<p>* number of points, and/or<p>* number of comments, and/or<p>* excluding posts {dead | flagged | youtube | wikipedia | ...} according to a keywords list<p>Run via ~/.bashrc alias or crontab.<p>Experimental code; I will monitor it locally (scheduled to run {06:00 | 18:00} daily, starting 2020-04-14) for bugs, etc.
Not to discount your effort, but is there a benefit for scraping the data when HN offers an official API[1]? Does the API not expose all the data you need?<p>[1] <a href="https://github.com/HackerNews/API" rel="nofollow">https://github.com/HackerNews/API</a>