TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Posted Items on Facebook

3 pointsby sangguinealmost 17 years ago
Hi. I have a technical question about how Facebook does something using PHP. On Facebook, if you enter a link on Posted Items, Facebook automatically grabs the title, the thumbnail and the few first sentences. Do you know what they use? cURL maybe? If they do use cURL, do you think they would store this info in a database or pull this info every time?

4 comments

nertzyalmost 17 years ago
Here's how to find out:<p>Set up a page on a server you control that has a URL you know that Facebook has never seen.<p>Start making a posted item on Facebook with that URL.<p>Check your logs and see what User-Agent Facebook is using, and try to get your best guess as to what is going on.<p>Beyond the User-Agent (which might just be something like "Facebook") you could always dive deeper by investigating the actual packets sent and comparing them to those generated by something like wget or cURL.
wavealmost 17 years ago
The easiest way will be to use Alexa Site Thumbnail from AWS <a href="http://tinyurl.com/57zbp2" rel="nofollow">http://tinyurl.com/57zbp2</a> , which charges $0.0002/thumbnail.
bkrauszalmost 17 years ago
Title and first sentence are probably grabbed directly (probably cURL), it's trivial to snag a &#60;title&#62; tag and the first &#60;p&#62; tag. They definitely cache them somewhere, since pulling them every time would do terrible things to load time and is unreliable.
评论 #205817 未加载
ComputerGurualmost 17 years ago
cURL can't grab the thumbnails - you need to run the HTML through a rendering engine to do that.