TE
TechEcho
Home
24h Top
Newest
Best
Ask
Show
Jobs
English
GitHub
Twitter
Home
Facebook's robots.txt
40 points
by
sander
over 11 years ago
7 comments
perryh2
over 11 years ago
Collapse
<a href="http://disqus.com/humans.txt" rel="nofollow">http://disqus.com/humans.txt</a>
评论 #6822970 未加载
评论 #6821761 未加载
评论 #6821904 未加载
viana007
over 11 years ago
Collapse
<a href="http://www.google.com/robots.txt" rel="nofollow">http://www.google.com/robots.txt</a>
评论 #6823056 未加载
评论 #6821679 未加载
kr1m
over 11 years ago
Collapse
You don't scrape Facebook, Facebook scrapes you!
评论 #6821656 未加载
yalogin
over 11 years ago
Collapse
So what does it mean by facebook whitelisting a scraping service? Do they actively block scrapers?
评论 #6821643 未加载
pdfcollect
over 11 years ago
Collapse
Is there a way to replace this robots.txt with a null robots.txt? :)
评论 #6821603 未加载
bibstha
over 11 years ago
Collapse
What is a User Agent: Yeti?
评论 #6822826 未加载
decasteve
over 11 years ago
Collapse
Even Facebook's robots.txt has a hatred for my pseudo-anonymous browser settings. Facebook gives me this (for any page): "Sorry, something went wrong. We're working on getting this fixed as soon as we can."
评论 #6821610 未加载