How do you get blurbs from a website?

1 点作者 amrithk大约 17 年前

When you type in a URL on websites like Facebook and Digg, they automatically pull up a blurb of the website that contains the first few sentances of the site.For example, when typing in cnn.com, the CNN blurb "Breaking news U.S., World, Weather, Entertainment & Video News" and the first few sentances on the CNN website automatically appear as well.How is this done? Is there some sort of crawler that goes to the link provided? Thanks all.

2 条评论

Readmore大约 17 年前

I believe what you're talking about is the title of the website.From www.cnn.com<html lang="en"><head><title>CNN.com - Breaking News, U.S., World, Weather, Entertainment & Video News</title>So you could just parse our the title of each site and display that.

评论 #170996 未加载

epi0Bauqu大约 17 年前

Yes, that is the work of crawling and parsing. On some sites, they are human edited as well.

评论 #170966 未加载