TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

The New York Times Has Spent $10.8M in Its Legal Battle with OpenAI So Far

84 点作者 marban3 个月前

11 条评论

lesuorac3 个月前
I still find it very (depressingly) hilarious how everybody sees this as a lawsuit about if training on copyrighted context is legal or not.<p>Literally, the NYT claimed that OpenAI maintained a database of NYT&#x27;s works and would just verbatim surface the content. This is not an AI issue, it&#x27;s settled copyright law.
评论 #42953217 未加载
评论 #42953500 未加载
评论 #42953663 未加载
评论 #42956096 未加载
rustc3 个月前
I hope they don&#x27;t settle early and we finally get an answer to whether training AI on copyrighted content is fair use or not.
评论 #42954157 未加载
n0rdy3 个月前
I like following the OpenAI vs. NYT case, as it&#x27;s a great example of the controversial situation:<p>- OpenAI created their models by parsing the internet by disregarding the copyrights, licenses, etc., or looking for a law loopholes<p>- by doing that, OpenAI (alongside others) developed a new progressive tool that is shaping the world, and seems to be the next “internet”-like (impact-wise) thing<p>- NYT is not happy about that, as their content is their main asset<p>- less democratic countries, can apply even less ethical practices for data mining, as the copyright laws don&#x27;t work there, so one might claim that it&#x27;s a question of national defense, considering the fact that AI is actively used in the miltech these days<p>- while the ethical part is less controversial (imho, as I&#x27;m with NYT there), the legal one is more complicated: the laws might simply say nothing about this use case (think GPL vs. AGPL license), so the world might need new ones.<p>And so on...
screye3 个月前
I can&#x27;t imagine a scenario where pre-training on someone else&#x27;s works is fair-use, but distilling from a proprietary LLM isn&#x27;t.
pkamb3 个月前
Is anyone building a <i>public domain</i> repository &#x2F; AI training ground for old newspapers? Anything before 1930 has no restrictions. Newspapers.com has pretty good content but the interface and search is extremely lacking. Google News was abandoned a decade ago. This seems like something where AI could really help, for once. Not in training chatbots or whatever but actually just providing great search for articles in books, newspapers, and magazines.
评论 #42953564 未加载
评论 #42953155 未加载
评论 #42954079 未加载
ViktorRay3 个月前
Would anyone here be able to explain to me where this money is going? Are the lawyers working for the New York Times really this expensive? If so these lawyers must be getting massive amounts of money...
评论 #42953038 未加载
评论 #42953072 未加载
nimish3 个月前
NYT will lose:<p>Copyright only protects the actual text. LLMs have weights, not exact copies. In any case, saying &quot;if I put in some input and get copyrighted output&quot; is tantamount to copyright violations; if I use a generative tool and generate copyrighted info is it the tools fault?<p>An LLM is a dump of effectively arbitrary numbers that, when hooked up to a command line, uses one of the world&#x27;s most awful programming languages to evaluate and execute.<p>OpenAI at most broke an EULA or some technicality on copyright w.r.t. local ephemeral copies. What&#x27;s the damage to the NYT though?
评论 #42954438 未加载
评论 #42954439 未加载
评论 #42954561 未加载
评论 #42954291 未加载
gotoeleven3 个月前
Are they paying the lawyers with government money? I&#x27;m seriously asking. Why is the government paying 10s of millions of dollars&#x2F;year to the New York Times? How can they still claim to be a news organization without having disclosed this? If the government is paying the NYT, then don&#x27;t their productions belong in the public domain?<p><a href="https:&#x2F;&#x2F;x.com&#x2F;stillgray&#x2F;status&#x2F;1887191056074350690" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;stillgray&#x2F;status&#x2F;1887191056074350690</a>
评论 #42954288 未加载
评论 #42954554 未加载
评论 #42965525 未加载
SebFender3 个月前
&quot;OpenAI asserts that training AI models using publicly accessible content, including material from The New York Times, is protected under longstanding fair use principles.&quot;<p>Incredible.<p>The foundation of fair use is a transformative and non-consumptive use of copyrighted material.
tester7563 个月前
Why is it THAT expensive?
评论 #42952873 未加载
user39393823 个月前
My ideal solution would be to public domain anything NYT has written in the past, turn it over to archive.org, and dismantle NYT so it’s no longer an issue in the future.