TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

How do you get blurbs from a website?

1 点作者 amrithk大约 17 年前
When you type in a URL on websites like Facebook and Digg, they automatically pull up a blurb of the website that contains the first few sentances of the site.<p>For example, when typing in cnn.com, the CNN blurb "Breaking news U.S., World, Weather, Entertainment &#38; Video News" and the first few sentances on the CNN website automatically appear as well.<p>How is this done? Is there some sort of crawler that goes to the link provided? Thanks all.

2 条评论

Readmore大约 17 年前
I believe what you're talking about is the title of the website.<p>From www.cnn.com<p>&#60;html lang="en"&#62;&#60;head&#62;&#60;title&#62;CNN.com - Breaking News, U.S., World, Weather, Entertainment &#38; Video News&#60;/title&#62;<p>So you could just parse our the title of each site and display that.
评论 #170996 未加载
epi0Bauqu大约 17 年前
Yes, that is the work of crawling and parsing. On some sites, they are human edited as well.
评论 #170966 未加载