TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Microsoft will assume liability for legal copyright risks of Copilot

540 点作者 wgx超过 1 年前

35 条评论

tremon超过 1 年前
Let Microsoft first publish a Copilot model that's trained on the internal codebases of Azure, Windows and Office. That's the only way Microsoft can convince me that they truly believe Copilot is non-infringing technology.
评论 #37426261 未加载
评论 #37427147 未加载
评论 #37426183 未加载
评论 #37426840 未加载
评论 #37427383 未加载
评论 #37431944 未加载
评论 #37428386 未加载
评论 #37425770 未加载
评论 #37426103 未加载
评论 #37425991 未加载
评论 #37430423 未加载
评论 #37427111 未加载
satvikpendem超过 1 年前
It&#x27;s likely that generative AI in general will be deemed fair use, due to its (generally) transformative nature. Sure, if you really coax it, you can get code or images out that look similar to existing ones, but the courts might see that generally speaking, it produces new content that has not been seen before, especially in the case of images.<p>Google Books literally copied and pasted books to add to their online database and that was deemed fair use, so something much more transformative like generative AI will likely fall under much broader consideration for fair use. Google Books was, yes, non-commercial, but the courts generally have the provision that the more transformative something is, the less it needs to adhere to the guidelines laid out for determining such fair use.<p><a href="https:&#x2F;&#x2F;ogc.harvard.edu&#x2F;pages&#x2F;copyright-and-fair-use" rel="nofollow noreferrer">https:&#x2F;&#x2F;ogc.harvard.edu&#x2F;pages&#x2F;copyright-and-fair-use</a>
评论 #37424832 未加载
评论 #37424500 未加载
评论 #37423920 未加载
评论 #37424666 未加载
评论 #37424976 未加载
评论 #37423626 未加载
评论 #37424144 未加载
评论 #37424167 未加载
评论 #37424430 未加载
评论 #37424506 未加载
评论 #37424841 未加载
评论 #37424845 未加载
评论 #37426881 未加载
评论 #37424088 未加载
评论 #37425196 未加载
评论 #37426122 未加载
评论 #37425034 未加载
评论 #37424597 未加载
评论 #37423664 未加载
StewardMcOy超过 1 年前
Are there any actual details on this? I get that this is a blog post, but the only links I see on the page are to other blog posts. It leaves a lot of questions.<p>Is this blog post a legally enforceable contract? Is Microsoft specifically indemnifying all users of Copilot against claims of copyright infringement that arise from use of Copilot?<p>The blog post says that &quot;there are important conditions to this program&quot;, and it lists a few, but are those conditions exhaustive, or are there more that the blog post doesn&#x27;t cover? For example, is it only in specific countries, or does it apply to every legal system worldwide?<p>What guarantees do users have that Microsoft won&#x27;t discontinue this program? If Microsoft gets kicked in the teeth repeatedly by courts ruling against them, and they realize that even they can&#x27;t afford to pay out every time Copilot license-launders large chunks of copyrighted code, what means to users have to keep Microsoft to its promises?
评论 #37427273 未加载
评论 #37426356 未加载
评论 #37440308 未加载
jtchang超过 1 年前
This is a very clever move by Microsoft. In essence they are painting a giant bullseye on their back to any lawsuits that may arise. The idea being that they have the resources to challenge them (they aren&#x27;t wrong).<p>The way AI is going I&#x27;m sure we&#x27;ll see some landmark cases very soon. It is very much in Microsoft&#x27;s interest to grow this market as fast as possible and be at the center of it. This removes one of the key impediments to adopting generated code for smaller orgs: &quot;Will I get sued if this product generates code that is copyrighted?&quot;.
评论 #37424641 未加载
评论 #37423135 未加载
评论 #37425382 未加载
评论 #37422779 未加载
fsdavcaa超过 1 年前
With a big asterik-- &quot;customers... must not attempt to generate infringing materials...&quot;<p>It hinges on what *Microsoft* decides &quot;attempting to generate infringing materials&quot; means. You&#x27;d like it to mean that it only excludes use when you&#x27;re doing something you know would infringe copyright, like &quot;reproduce the entire half life 2 source code.&quot; But who knows.
评论 #37423648 未加载
评论 #37422581 未加载
评论 #37422492 未加载
评论 #37422499 未加载
评论 #37423784 未加载
评论 #37424298 未加载
jacquesm超过 1 年前
It may not be that simple: Microsoft may assume liability but an infringer can still be sued separately. MS may then be on the hook for the court costs. But you can&#x27;t just categorically shield the users of a product from being sued.<p>This is the key bit:<p>&quot;Specifically, if a third party sues a commercial customer for copyright infringement for using Microsoft’s Copilots or the output they generate, we will defend the customer and pay the amount of any adverse judgments or settlements that result from the lawsuit, as long as the customer used the guardrails and content filters we have built into our products.&quot;<p>The &#x27;we will defend&#x27; is one important part, I assume that means that you will be using their lawyers rather than your own (which they have in house and so are cheaper to use than the ones that bill you, the would be defendant by the hour).<p>The second part that matters is that there are conditions on how you are supposed to use the product and crucially: you will have to document that this is how you used it.<p>But: interesting development, clearly enterprise customers are a bit wary of accidentally engaging in copyright infringement by using the tool and that may well have slowed down adoption.
评论 #37425703 未加载
lijok超过 1 年前
Only so long as you have the guardrails enabled. One of the guardrails being that copilot will not output any code that exists in any github repo.<p>We tested copilot with those guardrails enabled and it completely lobotomizes it.<p>This by the way is not a change. They already had this “Microsoft will assume liability if you get sued” clause in Copilot Product Specific Terms: <a href="https:&#x2F;&#x2F;github.com&#x2F;customer-terms&#x2F;github-copilot-product-specific-terms">https:&#x2F;&#x2F;github.com&#x2F;customer-terms&#x2F;github-copilot-product-spe...</a>
whitfieldsdad超过 1 年前
I&#x27;ve received a lot of flak for this answer in other communities, but, if a statistical model is producing purely derivative works using a mathematical model that&#x27;s basically a next best token predictor, is it really &quot;stealing&quot;?<p>Is it &quot;stealing&quot; to have a working understanding of the next best token, or even simply the token that shows up the most often (e.g. on GitHub)?<p>I&#x27;m sure that the argument could be made that all AI should be illegal as all ideas worth having have already been had, and all text worth writing has already been written, but, where would that leave us?<p>(e.g. your function for converting a string from uppercase to lowercase will probably look like a function that someone else on Earth has written, and the same goes for your error handling code, your state of the art technique for centering a div, etc.)
评论 #37424657 未加载
评论 #37425023 未加载
评论 #37425208 未加载
评论 #37424628 未加载
littlestymaar超过 1 年前
I wonder how binding this kind of public commitment is. The same way Musk recently said publicly that he&#x27;ll cover the cost of anyone having work or legal issues for something they said on the platform (and now refuses honor the engagement).
scj超过 1 年前
If a codebase was infringing the GPL, the remedy is to publish the offending source code or terminate distribution. Neither are cases I suspect Microsoft cares about when talking about 3rd party code.<p>I don&#x27;t know what case history is like for damages with open source projects, but I suspect it wouldn&#x27;t be that big of a concern for Microsoft.<p>Otherwise stated, Microsoft&#x27;s downside to this is committing their lawyers. And the upside is to improve their code generation tools.<p>IANAL though.
lewhoo超过 1 年前
I&#x27;m just curious why is everyone talking about transformative nature and so little focus is given to:<p><i>4.the effect of the use upon the potential market for or value of the copyrighted work</i> (wiki)<p>I don&#x27;t know if this particular case is good for exploring all angles of fair use, but to me this certainly is a greater hurdle for commercial generative ai.
dataflow超过 1 年前
Wouldn&#x27;t you have to first prove that your content came from Microsoft services? Hopefully you track &amp; certify the provenance of every line of code and content you paste? Microsoft surely won&#x27;t just take your word for it that your content came from them, so how would this play out in practice, exactly?
indymike超过 1 年前
I just had a horrible thought: what happens when there&#x27;s a DMCA takedown request to remove an infringement in a widely used LLM? I&#x27;ve seen requests against training data, but never against the output of an LLM.
评论 #37427935 未加载
tpmx超过 1 年前
Pinky promise. Where&#x27;s the legal agreement? I&#x27;m sure there&#x27;s a cap on their liability.
评论 #37425026 未加载
tboyd47超过 1 年前
What is the financial upside Microsoft is seeing to this that no one else seems to see?
评论 #37423722 未加载
评论 #37423834 未加载
评论 #37423567 未加载
评论 #37432276 未加载
评论 #37425434 未加载
bobobob420超过 1 年前
Copyright related stuff is annoying. I cant see why any one would care. If you publish something to the public domain I dont understand why you get rights to your content that you can self declare. Its completely ludicrous and only works at the corporate money level because they have liability and resources to sue. I wish people would use a little more common sense and understand the words ‘public domain’. Regardless of what people say, I can let you know that no one really cares about copyright and in terms of AI, its an unmovable mountain. Good luck wasting time on figuring out an issue that provides nothing to humanity
coding123超过 1 年前
Another way to look at this is:<p>Microsoft just became a code copyright insurance company. The premium is paid for with individual copilot accounts for each developer. And the policy has its exceptions of course.<p>This is interesting.
soultrees超过 1 年前
Has anyone noticed that Copilot will shade out it’s answers more often when it’s writing code now? Usually I’ll paste in react components and ask it to fix the tailwind styling, but once it starts writing it gets filtered out by some secondary filter about half way through. I thought maybe the code it was outputting was too similar to copyrighted code and it triggered a liability filter of some sort.<p>In any case, super annoying to have that happen so consistently these days that I just use chatgpt to fix my tailwind styling now.
评论 #37426874 未加载
aldousd666超过 1 年前
This has been a seemingly impassable Rubicon, and Microsoft is building a bridge across it and posting guards along the way.
评论 #37430213 未加载
alberth超过 1 年前
Plot twist, generative AI wrote that blog post to convince people to use Copilot more.
评论 #37428405 未加载
elzbardico超过 1 年前
Maybe it is just me, but I found the quality of copilot suggestions so low , it is generally useable only on the most mundane and repetitive contexts. Why all the enthusiasm about it?
treprinum超过 1 年前
Are they going to threaten all small devs with patents when they object to having their code in the copilot almost verbatim?
Havoc超过 1 年前
Which is essentially open ended liability...so their lawyers must be very darn sure there isn&#x27;t much risk.
PeterStuer超过 1 年前
Isn&#x27;t this extremely gamable? Find someones IP, split the gains.
matt3210超过 1 年前
The on-prem people were right the whole time!
dirtyid超过 1 年前
TLDR Microsoft will litigate against any suits until one side goes broke. That side is probably not Microsoft.
heavyset_go超过 1 年前
You can now launder GPL code with the confidence that Microsoft&#x27;s world class legal team will have your back if you&#x27;re sued for it.
评论 #37422426 未加载
评论 #37422782 未加载
评论 #37423276 未加载
评论 #37422899 未加载
评论 #37423264 未加载
评论 #37423388 未加载
评论 #37428984 未加载
评论 #37423600 未加载
评论 #37423001 未加载
thesuperbigfrog超过 1 年前
It used to be &quot;Embrace, extend, and extinguish&quot;: <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Embrace,_extend,_and_extinguish" rel="nofollow noreferrer">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Embrace,_extend,_and_extinguis...</a><p>Now it is &quot;Train, Task, Transform, and Transfer&quot;:<p>Train - Feed copyrighted works into machine learning model or similar system<p>Task - Machine learning model is tasked with an input prompt<p>Transform - Machine learning model generates hybrid output derived from copyrighted works, but usually not directly traceable to a given work in the training set<p>Transfer - Generated output provides the essence of the copyrighted works, but is legally untraceable to the originals
评论 #37424369 未加载
baz00超过 1 年前
Having dealt with Microsoft for 30 years as both a power user and developer, <i>&quot;we believe in standing behind our customers when they use our products&quot;</i>, is a lie.
评论 #37423710 未加载
评论 #37424373 未加载
tyingq超过 1 年前
Yet they don&#x27;t feed their own closed source assets to Copilot for training...why not?
评论 #37422805 未加载
评论 #37423303 未加载
评论 #37422749 未加载
评论 #37424993 未加载
评论 #37423554 未加载
评论 #37424223 未加载
评论 #37424377 未加载
jacquesm超过 1 年前
A very relevant and recent posting:<p>GitHub Copilot and open source laundering<p><a href="https:&#x2F;&#x2F;drewdevault.com&#x2F;2022&#x2F;06&#x2F;23&#x2F;Copilot-GPL-washing.html" rel="nofollow noreferrer">https:&#x2F;&#x2F;drewdevault.com&#x2F;2022&#x2F;06&#x2F;23&#x2F;Copilot-GPL-washing.html</a><p>Previously on HN, in case you missed it:<p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=31848433">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=31848433</a>
评论 #37424359 未加载
评论 #37424383 未加载
CameronNemo超过 1 年前
Meanwhile they strike deals with news agencies to use their content to train on... This is of going to be a hard fight, but I really hope this ends up costing MS.
评论 #37424405 未加载
sublinear超过 1 年前
Yeah is it becoming clear enough to some people yet that you can&#x27;t replace software engineers, let alone really <i>help</i> them, with AI? This is only going to get worse, not better.<p>Copilot is such a flawed product from the start. It&#x27;s not even a matter of its ability to write &quot;good&quot; code. The concept is just dumb.<p>Code is necessarily consumed by people first before it&#x27;s executed by a computer in a production environment. There are many ways to get a computer to do something, but the approval process by experienced humans is vastly more important than the drafting of it. Software dev is already incredibly cheap and the last place to cut costs.<p>There is no AI threat other than the one posed by grifters trying to convince you that there is.
评论 #37423603 未加载
评论 #37423732 未加载
评论 #37424395 未加载
hulitu超过 1 年前
&gt; Microsoft will assume liability for legal copyright risks of Copilot<p>Extinguish.
评论 #37423278 未加载
评论 #37421907 未加载
评论 #37424398 未加载
naikrovek超过 1 年前
This is one of the things people on this site have been saying that Microsoft should do if they really stand behind Copilot, and now that they&#x27;ve done it, you have again moved the goalposts and this announcement is entirely insufficient.<p>How dare they? amirite?
评论 #37423493 未加载
评论 #37423871 未加载