TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Whitehouse.gov petitions are blocked from search results

33 点作者 rquantz超过 7 年前

4 条评论

samfriedman超过 7 年前
To anyone blaming the current administration, note that the robots.txt is identical before the election too: <a href="https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20161101000359&#x2F;https:&#x2F;&#x2F;petitions.whitehouse.gov&#x2F;robots.txt" rel="nofollow">https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20161101000359&#x2F;https:&#x2F;&#x2F;petitions...</a>
评论 #15450943 未加载
mrguyorama超过 7 年前
Isn&#x27;t there a built in search page for these petitions? What good would it be to have these petitions indexed by google? To be honest, I don&#x27;t really want petitions influenced by SEO
评论 #15458660 未加载
masukomi超过 7 年前
i would point out that robots.txt is optional you don&#x27;t have to follow it. It would be easy enough for one of us to extract the text of each petition, with a simple spider, put it on a web site with links back to the original, and let google search that. The petitions are public documents for public consumption. Even if white house tried to sue it wouldn&#x27;t be their content. it&#x27;s the content of the person who created it. Otherwise they would be legally suggesting that they are petitioning themselves.... then again IANAL just a human capable of reasoning through things logically, which rarely has any bearing on lawsuits. ;)
gremlinsinc超过 7 年前
Someone should create a scraper&#x2F;aggregator w&#x2F; links back and synopsis... So google does spider the content.