Ha, there's a whole section on clones of the summarizer from Classifier4J.<p>I wrote that in 2003 (I think?) based on @pg's "A plan for spam" essay, and then "invented" the summarization approach (I'm sure others had done similar, but I thought it up myself anyway).<p>Turns out it was rather well tuned. The 2003 implementation, presumably downloaded from sourceforge(!) still wins comparisons on datasets which didn't even exist when I wrote it[1].<p>I much prefer the Python implementation though[2], which I hadn't seen before.<p>Also, Textacy on top of Spacy is awesome for any kind of text work.<p>[1] <a href="https://dl.acm.org/citation.cfm?id=2797081" rel="nofollow">https://dl.acm.org/citation.cfm?id=2797081</a><p>[2] <a href="https://github.com/thavelick/summarize/blob/master/summarize.py" rel="nofollow">https://github.com/thavelick/summarize/blob/master/summarize...</a>