TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Facebook employee responds on robots.txt controversy

54 点作者 petewarden将近 15 年前

9 条评论

finiteloop将近 15 年前
This is Bret Taylor, CTO of Facebook.<p>There are a couple of things I want to clarify. First, we genuinely support data portability: we want users to be able to use their data in other applications without restriction. Our new data policies, which we deployed at f8, clearly reflect this (<a href="http://developers.facebook.com/policy/" rel="nofollow">http://developers.facebook.com/policy/</a>):<p><pre><code> "Users give you their basic account information when they connect with your application. For all other data, you must obtain explicit consent from the user who provided the data to us before using it for any purpose other than displaying it back to the user on your application." </code></pre> Basically, users have complete control over their data, and as long as user gives an application explicit consent, Facebook doesn't get in the way of the user using their data in your applications beyond basic protections like selling data to ad networks and other sleazy data collectors.<p>Crawling is a bit of special case. We have a privacy control enabling users to decide whether they want their profile page to show up in search engines. Many of the other "crawlers" don't really meet user expectations. As Blake mentioned in his response on Pete's blog post, some sleazy crawlers simply aggregate user data en masse and then sell it, which we view as a threat to user privacy.<p>Pete's post did bring up some real issues with the way we were handling things. In particular, I think it was bad for us to stray from Internet standards and conventions by having an robots.txt that was open and a separate agreement with additional restrictions. This was just a lapse of judgment.<p>We are updating our robots.txt to explicitly allow the crawlers of search engines that we currently allow to index Facebook content and disallow all other crawlers. We will whitelist crawlers when legitimate companies contact us who want to crawl us (presumably search engines). For other purposes, we really want people using our API because it has explicit controls around privacy and has important additional requirements that we feel are important when a company is using users' data from Facebook (e.g., we require that you have a privacy policy and offer users the ability to delete their data from your service).<p>This robots.txt change should be deployed today. The change will make our robots.txt abide by conventions and standards, which I think is the main legitimate complaint in Pete's post.
评论 #1440191 未加载
评论 #1440197 未加载
评论 #1440577 未加载
评论 #1440374 未加载
评论 #1442733 未加载
评论 #1440192 未加载
评论 #1495215 未加载
评论 #1440251 未加载
jdrock将近 15 年前
In my personal opinion, the Facebook employee's response shows that he doesn't understand the company's high-level business goals. Their developers may think they are building an open web, but it's very clear the company is only interested in a closed web they control.<p>Based on my interaction with website API developers, most of them honestly believe that they are building an open web. But let's face it, the API is a benefit to them and creates a critical dependency for the user of the API. It's basically vendor lock-in.
评论 #1440026 未加载
Aaronontheweb将近 15 年前
Pete's right-on about this: "You've chosen to leave all that information out in the open so you can benefit from the search traffic, and instead try to change the established rules of the web so you can selectively sue anyone you decide is a threat. "<p>Speaking as someone who's working on leveraging the Facebook API in a commercial product, this leaves me feeling like I'm opening myself to a lot of legal exposure if Facebook subjectively decides that my service poses even a minor threat to them. Given that I'm bootstrapping, there's no way I'd be able to put up any sort of legal fight what so ever against a company as well-funded as Facebook.
评论 #1439815 未加载
gkoberger将近 15 年前
"Facebook has always been a closed system where developers are expected to live in a culture of asking permission before doing anything, and existing at the whim of the company's management. [...] The web I love is an open world where you are free to innovate as long as you stick to the mutually agreed rules."<p>Facebook's policies aside, it's interesting to note that this criticism was written in response to a letter from Blake Ross- the founder of Firefox.
curio将近 15 年前
this is a really important issue.<p>this new model that facebook is trying to push isn't scalable. it favors the big guys and it's bad for the open web.<p>you shouldn't have a TOS that contradicts your robots.txt. period.
jrussbowman将近 15 年前
Sorry, as implied to my comment on the first story about this posted to Hacker News, I'm siding with Facebook on this one.<p>Of course they tune their page so that search crawlers can best index the information, so does everyone else on the web.<p>However, they also provide an API, with clearly defined terms of use, which you may use to get information. Basically your complaint boils down to you don't want to use the methods they've set up for you to access their data, and you're complaining about it.<p>As for comments about fair, true spirit of the internet what have you... I don't think the true spirit of the internet has ever been everyone has to give everything away in every possible imaginable way. And, in the end, it is Facebook's data, they make that clear before you ever start adding data to their servers. So does just about everyone else who allows you to submit data to their servers.<p>This shouldn't be confused with how difficult it is to remove data and accounts from their system, which is a giant pain in the butt, or the fact that they've made drastic changes to the public nature of the data after keeping it private for so long. That's all just a giant mess.
评论 #1439874 未加载
评论 #1439936 未加载
评论 #1439848 未加载
评论 #1440246 未加载
评论 #1439907 未加载
ojbyrne将近 15 年前
Facebook isn't the only site with that language. In fact I think you'll have to look hard to find any major site that doesn't have it. Not that it doesn't suck.
tjmaxal将近 15 年前
this is essentially the same argument people use against the iOS platform.<p>But clearly from Apple's example it is possible to build a profitable product on a closed system.<p>Admittedly it doesn't seem "fair" or true to the spirit of the internet. but it shouldn't be any surprise that a company is going to do what is best for that company forsaking all others.
sd273将近 15 年前
I'm tired of hearing people complain about private companies' rules for using their information/platforms. Facebook and Apple can do whatever they want on their own platforms and if people don't like it they can leave/not use the service. That's how a free market and free web works. People complained about Microsoft's dominance years ago and now their ascendancy is ending due to this idea called the "market"...
评论 #1440257 未加载