A key detail a lot of people are missing about "traditional" search vs. ChatGPT style search:<p>ChatGPT/LLMs can essentially crawl _anything_ they want, regardless of legality, license, consent, etc. These models are trained on anything that can be ingested. Once trained, you can release the model with plausible deniability. There's no 1:1 relation between ingested content and outputs. LLMs that "cheat" by ingesting content they shouldn't have will have an advantage over those that don't.<p>Google and other search engines don't have this luxury. If they serve a result, they have to make sure that they're not violating any license. If they crawl the wrong content, they have to make sure they don't serve it.