TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

A Curious Case of Disregarded Robots.txt

10 pointsby mikelabattabout 8 years ago

2 comments

sitkackabout 8 years ago
Robots.txt doesn&#x27;t confer copyright.<p>What about domains that have been sharked ? Does controlling robots.txt now give me the right to suppress all content ever originating from that domain, for as long as I control robots.txt?<p>Internet archive is right to spider the site, but defer showing. Collection != dissemenation.<p>The IA isn&#x27;t synthesizing, selling, cross referencing or afaict doing anything nefarious with the data.<p>You are literally picking on the last org on the internet that needs to get picked on.
评论 #14201749 未加载
ryandvmabout 8 years ago
Meh. I&#x27;m pretty ambivalent about voluntary restrictions like robots.txt. As far as I&#x27;m concerned it&#x27;s mostly useful as a way for site operators to document endless dynamic content or requests that are prohibitively expensive (but not so much that they restrict access).<p>I figure if it&#x27;s on the web and a human can read it, my computer ought to be able to read it too.
评论 #14199383 未加载