6 点作者 LinuxBender大约 1 个月前

2 条评论

maniacwhat大约 1 个月前

The ai companies have shown they don't care at all about the preferences of site owners by ignoring them.<p>I don't see why a new language to express preferences would make any difference here.

PeterStuer大约 1 个月前

Honestly, some sites are so ridiculously malconfigured in their anti-bot zeal that it becomes a Heisenberg like dilemma.<p>E.g. I want to pull in the rss. It is there specifically for m2m. If I dare get the robots.txt, i'm flagged as a bot, and denied the whole site. <i>including</i> not just the rss but even the parts that are not denied per the robots.txt

Copyright-ignoring AI scraper bots laugh at robots.txt

2 条评论

Copyright-ignoring AI scraper bots laugh at robots.txt

2 条评论