TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: user generated TL;DRs with a smart scorer

19 pointsby quanabout 14 years ago

8 comments

quanabout 14 years ago
Hi HN,<p>Here's a little background about gistpoint, I was an FPGA developer working on DSP systems at a big company in D.C. Last May, bored of my job and frustrated at my career progress, I took the paths most travelled and quit my job to wander around the world. Along the way, I picked up ruby/rails and javascript because I want to transition into web programming. The learning curve was steep coming from the hardware world. Plus running from city to city also made it tough to maintain focus. 2 months ago, after many false starts and months of distraction, I finally decided to push it through completion.<p>gistpoint is a web app that lets you search and submit summaries for online articles. It's unique in that it has a scorer that rates how relevant a summary is compared to its fulltext. This is done by analyzing several factors such as information coverage, sentence grammar, word stemming, etc. So the best summary will rise to the top and gets shown as the default. No signup/login is needed to submit summaries, and quality is controlled without voting. When no user-submitted summary is available, it will extract the most relevant passage from the fulltext. From my experience if I just naturally summarize an article I just read, the scorer rates it highly compared to the extracted one.<p>Please let me know what you think. If anyone wants to know more about the scoring algorithm, I'll be happy to write a post about it. I definitely open source the summary extraction and grammar gems on github. They're already on rubygems <a href="https://rubygems.org/profiles/50055" rel="nofollow">https://rubygems.org/profiles/50055</a> for you to download, I just need to create the documentations.
评论 #2540989 未加载
评论 #2540962 未加载
kmfrkabout 14 years ago
Very interesting concept. I think it's necessary to find a way to gauge the most popular articles rather than trying to tl;dr the entire internet.<p>For instance, you could look into some APIs for the most popular submissions on Hacker News and reddit and present them as articles that need summaries.<p>You could also create a bookmarklet that users or authors can use to recommend an article to be summarized, which will then appear in some kind of summary queue on the main page. (Predefined) tags could also be included, so people who are only interested in summarizing certain articles, can cherry-pick these.<p>Another thing is to suggestion is to reward productive users and upvoted summaries with a link to the writer's flattr profile. People can do this, even though the person has a flattr profile now[1][2], and I think this might increase the incentive even more.<p>If this takes off, you could create some browser extensions similar to the Hacker News Sidebar[3], which displays a sidepanel on websites that have been submitted to Hacker News.<p>[1]: <a href="http://blog.flattr.net/2011/04/opening-the-floodgates" rel="nofollow">http://blog.flattr.net/2011/04/opening-the-floodgates</a><p>[2]: <a href="http://blog.flattr.net/2011/05/flattr-almost-anyone" rel="nofollow">http://blog.flattr.net/2011/05/flattr-almost-anyone</a><p>[3]: <a href="https://chrome.google.com/webstore/detail/hhedbplnihmkekhgma.." rel="nofollow">https://chrome.google.com/webstore/detail/hhedbplnihmkekhgma...</a>.
评论 #2541283 未加载
NathanKPabout 14 years ago
I like the concept, but I feel that your algorithm for generating an initial summary needs a lot of work. For example, I was experimenting with the API:<p>The input page:<p><a href="http://bookflavor.com/dukan-diet-2-steps-lose-weight-2-steps-forever-pierre-dukan-0307887960" rel="nofollow">http://bookflavor.com/dukan-diet-2-steps-lose-weight-2-steps...</a><p>The API results:<p><a href="http://gistpoint.com/get?u=http://bookflavor.com/dukan-diet-2-steps-lose-weight-2-steps-forever-pierre-dukan-0307887960" rel="nofollow">http://gistpoint.com/get?u=http://bookflavor.com/dukan-diet-...</a><p>Rather than choosing a snippet of the main text, it chose a few words from one of the menu items...<p>If you can fine tune the algorithmic generation of snippets in the API this could be very useful. Keep up the good work!
评论 #2541042 未加载
评论 #2541117 未加载
dfrankeabout 14 years ago
Apparently I can quickly produce extremely high-scoring gibberish using Emacs M-x dissociated-press.
rexreedabout 14 years ago
User-generated Cliff's Notes for online articles? I can see some value there, but in as much as Cliff's Notes has value. I worry sometimes about the intellectual shortcuts people are taking in not reading articles completely or depending on other's feedback / scores / comments to determine the value of the content. Since when was reading for comprehension a chore? I know people are short on time, but is it wise to shortcut really understanding something?
评论 #2541037 未加载
runjakeabout 14 years ago
I don't remember what tldr means. Someone told me it was something from Reddit, but that's as much as I can recall.<p>Can we use words than the general HN audience would understand?
评论 #2540917 未加载
评论 #2540926 未加载
brosephiusabout 14 years ago
nice idea, but what's the incentive for me to submit a summary?
评论 #2541290 未加载
dorkitudeabout 14 years ago
Comic Sans?
评论 #2540965 未加载