What infrastructure do news aggregators typically use to do the crawling, parsing, indexing, extracting, storing/format in database, etc.? Are there open source news aggregators so that I can learn how those problems are solved?
Hi! I'm just developing one at <a href="http://github.com/msmuenchen/skynetrss" rel="nofollow">http://github.com/msmuenchen/skynetrss</a> (demo instance at <a href="http://dtmp.vm-dg.de/rss/" rel="nofollow">http://dtmp.vm-dg.de/rss/</a> - don't rely on this one working 100% of the time;)).<p>The only other FOSS project I know about is TinyTinyRSS aka TTRSS.
There are a few still active or not : Gregarius, Lilina, TinTinyRSS, RSSLounge and the more recent selfoss from the same developper.
More generally you would search for new informations on the topic with something like "self hosted rss reader" in a search engine