TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Facebook's robots.txt

40 pointsby sanderover 11 years ago

7 comments

perryh2over 11 years ago
<a href="http://disqus.com/humans.txt" rel="nofollow">http:&#x2F;&#x2F;disqus.com&#x2F;humans.txt</a>
评论 #6822970 未加载
评论 #6821761 未加载
评论 #6821904 未加载
viana007over 11 years ago
<a href="http://www.google.com/robots.txt" rel="nofollow">http:&#x2F;&#x2F;www.google.com&#x2F;robots.txt</a>
评论 #6823056 未加载
评论 #6821679 未加载
kr1mover 11 years ago
You don&#x27;t scrape Facebook, Facebook scrapes you!
评论 #6821656 未加载
yaloginover 11 years ago
So what does it mean by facebook whitelisting a scraping service? Do they actively block scrapers?
评论 #6821643 未加载
pdfcollectover 11 years ago
Is there a way to replace this robots.txt with a null robots.txt? :)
评论 #6821603 未加载
bibsthaover 11 years ago
What is a User Agent: Yeti?
评论 #6822826 未加载
decasteveover 11 years ago
Even Facebook&#x27;s robots.txt has a hatred for my pseudo-anonymous browser settings. Facebook gives me this (for any page): &quot;Sorry, something went wrong. We&#x27;re working on getting this fixed as soon as we can.&quot;
评论 #6821610 未加载