Microsoft will assume liability for legal copyright risks of Copilot

540 pointsby wgxover 1 year ago

35 comments

tremonover 1 year ago

Let Microsoft first publish a Copilot model that's trained on the internal codebases of Azure, Windows and Office. That's the only way Microsoft can convince me that they truly believe Copilot is non-infringing technology.

评论 #37426261 未加载

评论 #37427147 未加载

评论 #37426183 未加载

评论 #37426840 未加载

评论 #37427383 未加载

评论 #37431944 未加载

评论 #37428386 未加载

评论 #37425770 未加载

评论 #37426103 未加载

评论 #37425991 未加载

评论 #37430423 未加载

评论 #37427111 未加载

satvikpendemover 1 year ago

It's likely that generative AI in general will be deemed fair use, due to its (generally) transformative nature. Sure, if you really coax it, you can get code or images out that look similar to existing ones, but the courts might see that generally speaking, it produces new content that has not been seen before, especially in the case of images.Google Books literally copied and pasted books to add to their online database and that was deemed fair use, so something much more transformative like generative AI will likely fall under much broader consideration for fair use. Google Books was, yes, non-commercial, but the courts generally have the provision that the more transformative something is, the less it needs to adhere to the guidelines laid out for determining such fair use.<a href="https://ogc.harvard.edu/pages/copyright-and-fair-use" rel="nofollow noreferrer">https://ogc.harvard.edu/pages/copyright-and-fair-use</a>

评论 #37424832 未加载

评论 #37424500 未加载

评论 #37423920 未加载

评论 #37424666 未加载

评论 #37424976 未加载

评论 #37423626 未加载

评论 #37424144 未加载

评论 #37424167 未加载

评论 #37424430 未加载

评论 #37424506 未加载

评论 #37424841 未加载

评论 #37424845 未加载

评论 #37426881 未加载

评论 #37424088 未加载

评论 #37425196 未加载

评论 #37426122 未加载

评论 #37425034 未加载

评论 #37424597 未加载

评论 #37423664 未加载

StewardMcOyover 1 year ago

Are there any actual details on this? I get that this is a blog post, but the only links I see on the page are to other blog posts. It leaves a lot of questions.Is this blog post a legally enforceable contract? Is Microsoft specifically indemnifying all users of Copilot against claims of copyright infringement that arise from use of Copilot?The blog post says that "there are important conditions to this program", and it lists a few, but are those conditions exhaustive, or are there more that the blog post doesn't cover? For example, is it only in specific countries, or does it apply to every legal system worldwide?What guarantees do users have that Microsoft won't discontinue this program? If Microsoft gets kicked in the teeth repeatedly by courts ruling against them, and they realize that even they can't afford to pay out every time Copilot license-launders large chunks of copyrighted code, what means to users have to keep Microsoft to its promises?

评论 #37427273 未加载

评论 #37426356 未加载

评论 #37440308 未加载

jtchangover 1 year ago

This is a very clever move by Microsoft. In essence they are painting a giant bullseye on their back to any lawsuits that may arise. The idea being that they have the resources to challenge them (they aren't wrong).The way AI is going I'm sure we'll see some landmark cases very soon. It is very much in Microsoft's interest to grow this market as fast as possible and be at the center of it. This removes one of the key impediments to adopting generated code for smaller orgs: "Will I get sued if this product generates code that is copyrighted?".

评论 #37424641 未加载

评论 #37423135 未加载

评论 #37425382 未加载

评论 #37422779 未加载

fsdavcaaover 1 year ago

With a big asterik-- "customers... must not attempt to generate infringing materials..."It hinges on what *Microsoft* decides "attempting to generate infringing materials" means. You'd like it to mean that it only excludes use when you're doing something you know would infringe copyright, like "reproduce the entire half life 2 source code." But who knows.

评论 #37423648 未加载

评论 #37422581 未加载

评论 #37422492 未加载

评论 #37422499 未加载

评论 #37423784 未加载

评论 #37424298 未加载

jacquesmover 1 year ago

It may not be that simple: Microsoft may assume liability but an infringer can still be sued separately. MS may then be on the hook for the court costs. But you can't just categorically shield the users of a product from being sued.This is the key bit:"Specifically, if a third party sues a commercial customer for copyright infringement for using Microsoft’s Copilots or the output they generate, we will defend the customer and pay the amount of any adverse judgments or settlements that result from the lawsuit, as long as the customer used the guardrails and content filters we have built into our products."The 'we will defend' is one important part, I assume that means that you will be using their lawyers rather than your own (which they have in house and so are cheaper to use than the ones that bill you, the would be defendant by the hour).The second part that matters is that there are conditions on how you are supposed to use the product and crucially: you will have to document that this is how you used it.But: interesting development, clearly enterprise customers are a bit wary of accidentally engaging in copyright infringement by using the tool and that may well have slowed down adoption.

评论 #37425703 未加载

lijokover 1 year ago

Only so long as you have the guardrails enabled. One of the guardrails being that copilot will not output any code that exists in any github repo.We tested copilot with those guardrails enabled and it completely lobotomizes it.This by the way is not a change. They already had this “Microsoft will assume liability if you get sued” clause in Copilot Product Specific Terms: <a href="https://github.com/customer-terms/github-copilot-product-specific-terms">https://github.com/customer-terms/github-copilot-product-spe...</a>

whitfieldsdadover 1 year ago

I've received a lot of flak for this answer in other communities, but, if a statistical model is producing purely derivative works using a mathematical model that's basically a next best token predictor, is it really "stealing"?Is it "stealing" to have a working understanding of the next best token, or even simply the token that shows up the most often (e.g. on GitHub)?I'm sure that the argument could be made that all AI should be illegal as all ideas worth having have already been had, and all text worth writing has already been written, but, where would that leave us?(e.g. your function for converting a string from uppercase to lowercase will probably look like a function that someone else on Earth has written, and the same goes for your error handling code, your state of the art technique for centering a div, etc.)

评论 #37424657 未加载

评论 #37425023 未加载

评论 #37425208 未加载

评论 #37424628 未加载

littlestymaarover 1 year ago

I wonder how binding this kind of public commitment is. The same way Musk recently said publicly that he'll cover the cost of anyone having work or legal issues for something they said on the platform (and now refuses honor the engagement).

scjover 1 year ago

If a codebase was infringing the GPL, the remedy is to publish the offending source code or terminate distribution. Neither are cases I suspect Microsoft cares about when talking about 3rd party code.I don't know what case history is like for damages with open source projects, but I suspect it wouldn't be that big of a concern for Microsoft.Otherwise stated, Microsoft's downside to this is committing their lawyers. And the upside is to improve their code generation tools.IANAL though.

lewhooover 1 year ago

I'm just curious why is everyone talking about transformative nature and so little focus is given to:4.the effect of the use upon the potential market for or value of the copyrighted work (wiki)I don't know if this particular case is good for exploring all angles of fair use, but to me this certainly is a greater hurdle for commercial generative ai.

dataflowover 1 year ago

Wouldn't you have to first prove that your content came from Microsoft services? Hopefully you track & certify the provenance of every line of code and content you paste? Microsoft surely won't just take your word for it that your content came from them, so how would this play out in practice, exactly?

indymikeover 1 year ago

I just had a horrible thought: what happens when there's a DMCA takedown request to remove an infringement in a widely used LLM? I've seen requests against training data, but never against the output of an LLM.

评论 #37427935 未加载

tpmxover 1 year ago

Pinky promise. Where's the legal agreement? I'm sure there's a cap on their liability.

评论 #37425026 未加载

tboyd47over 1 year ago

What is the financial upside Microsoft is seeing to this that no one else seems to see?

评论 #37423722 未加载

评论 #37423834 未加载

评论 #37423567 未加载

评论 #37432276 未加载

评论 #37425434 未加载

bobobob420over 1 year ago

Copyright related stuff is annoying. I cant see why any one would care. If you publish something to the public domain I dont understand why you get rights to your content that you can self declare. Its completely ludicrous and only works at the corporate money level because they have liability and resources to sue. I wish people would use a little more common sense and understand the words ‘public domain’. Regardless of what people say, I can let you know that no one really cares about copyright and in terms of AI, its an unmovable mountain. Good luck wasting time on figuring out an issue that provides nothing to humanity

coding123over 1 year ago

Another way to look at this is:Microsoft just became a code copyright insurance company. The premium is paid for with individual copilot accounts for each developer. And the policy has its exceptions of course.This is interesting.

soultreesover 1 year ago

Has anyone noticed that Copilot will shade out it’s answers more often when it’s writing code now? Usually I’ll paste in react components and ask it to fix the tailwind styling, but once it starts writing it gets filtered out by some secondary filter about half way through. I thought maybe the code it was outputting was too similar to copyrighted code and it triggered a liability filter of some sort.In any case, super annoying to have that happen so consistently these days that I just use chatgpt to fix my tailwind styling now.

评论 #37426874 未加载

aldousd666over 1 year ago

This has been a seemingly impassable Rubicon, and Microsoft is building a bridge across it and posting guards along the way.

评论 #37430213 未加载

alberthover 1 year ago

Plot twist, generative AI wrote that blog post to convince people to use Copilot more.

评论 #37428405 未加载

elzbardicoover 1 year ago

Maybe it is just me, but I found the quality of copilot suggestions so low , it is generally useable only on the most mundane and repetitive contexts. Why all the enthusiasm about it?

treprinumover 1 year ago

Are they going to threaten all small devs with patents when they object to having their code in the copilot almost verbatim?

Havocover 1 year ago

Which is essentially open ended liability...so their lawyers must be very darn sure there isn't much risk.

PeterStuerover 1 year ago

Isn't this extremely gamable? Find someones IP, split the gains.

matt3210over 1 year ago

The on-prem people were right the whole time!

dirtyidover 1 year ago

TLDR Microsoft will litigate against any suits until one side goes broke. That side is probably not Microsoft.

heavyset_goover 1 year ago

You can now launder GPL code with the confidence that Microsoft's world class legal team will have your back if you're sued for it.

评论 #37422426 未加载

评论 #37422782 未加载

评论 #37423276 未加载

评论 #37422899 未加载

评论 #37423264 未加载

评论 #37423388 未加载

评论 #37428984 未加载

评论 #37423600 未加载

评论 #37423001 未加载

thesuperbigfrogover 1 year ago

It used to be "Embrace, extend, and extinguish": <a href="https://en.wikipedia.org/wiki/Embrace,_extend,_and_extinguish" rel="nofollow noreferrer">https://en.wikipedia.org/wiki/Embrace,_extend,_and_extinguis...</a>Now it is "Train, Task, Transform, and Transfer":Train - Feed copyrighted works into machine learning model or similar systemTask - Machine learning model is tasked with an input promptTransform - Machine learning model generates hybrid output derived from copyrighted works, but usually not directly traceable to a given work in the training setTransfer - Generated output provides the essence of the copyrighted works, but is legally untraceable to the originals

评论 #37424369 未加载

baz00over 1 year ago

Having dealt with Microsoft for 30 years as both a power user and developer, "we believe in standing behind our customers when they use our products", is a lie.

评论 #37423710 未加载

评论 #37424373 未加载

tyingqover 1 year ago

Yet they don't feed their own closed source assets to Copilot for training...why not?

评论 #37422805 未加载

评论 #37423303 未加载

评论 #37422749 未加载

评论 #37424993 未加载

评论 #37423554 未加载

评论 #37424223 未加载

评论 #37424377 未加载

jacquesmover 1 year ago

A very relevant and recent posting:GitHub Copilot and open source laundering<a href="https://drewdevault.com/2022/06/23/Copilot-GPL-washing.html" rel="nofollow noreferrer">https://drewdevault.com/2022/06/23/Copilot-GPL-washing.html</a>Previously on HN, in case you missed it:<a href="https://news.ycombinator.com/item?id=31848433">https://news.ycombinator.com/item?id=31848433</a>

评论 #37424359 未加载

评论 #37424383 未加载

CameronNemoover 1 year ago

Meanwhile they strike deals with news agencies to use their content to train on... This is of going to be a hard fight, but I really hope this ends up costing MS.

评论 #37424405 未加载

sublinearover 1 year ago

Yeah is it becoming clear enough to some people yet that you can't replace software engineers, let alone really help them, with AI? This is only going to get worse, not better.Copilot is such a flawed product from the start. It's not even a matter of its ability to write "good" code. The concept is just dumb.Code is necessarily consumed by people first before it's executed by a computer in a production environment. There are many ways to get a computer to do something, but the approval process by experienced humans is vastly more important than the drafting of it. Software dev is already incredibly cheap and the last place to cut costs.There is no AI threat other than the one posed by grifters trying to convince you that there is.

评论 #37423603 未加载

评论 #37423732 未加载

评论 #37424395 未加载

hulituover 1 year ago

> Microsoft will assume liability for legal copyright risks of CopilotExtinguish.

评论 #37423278 未加载

评论 #37421907 未加载

评论 #37424398 未加载

naikrovekover 1 year ago

This is one of the things people on this site have been saying that Microsoft should do if they really stand behind Copilot, and now that they've done it, you have again moved the goalposts and this announcement is entirely insufficient.How dare they? amirite?

评论 #37423493 未加载

评论 #37423871 未加载

35 comments

tremonover 1 year ago

评论 #37426261 未加载

评论 #37427147 未加载

评论 #37426183 未加载

评论 #37426840 未加载

评论 #37427383 未加载

评论 #37431944 未加载

评论 #37428386 未加载

评论 #37425770 未加载

评论 #37426103 未加载

评论 #37425991 未加载

评论 #37430423 未加载

评论 #37427111 未加载

satvikpendemover 1 year ago

评论 #37424832 未加载

评论 #37424500 未加载

评论 #37423920 未加载

评论 #37424666 未加载

评论 #37424976 未加载

评论 #37423626 未加载

评论 #37424144 未加载

评论 #37424167 未加载

评论 #37424430 未加载

评论 #37424506 未加载

评论 #37424841 未加载

评论 #37424845 未加载

评论 #37426881 未加载

评论 #37424088 未加载

评论 #37425196 未加载

评论 #37426122 未加载

评论 #37425034 未加载

评论 #37424597 未加载

评论 #37423664 未加载

StewardMcOyover 1 year ago

评论 #37427273 未加载

评论 #37426356 未加载

评论 #37440308 未加载

jtchangover 1 year ago

评论 #37424641 未加载

评论 #37423135 未加载

评论 #37425382 未加载

评论 #37422779 未加载

fsdavcaaover 1 year ago

评论 #37423648 未加载

评论 #37422581 未加载

评论 #37422492 未加载

评论 #37422499 未加载

评论 #37423784 未加载

评论 #37424298 未加载

jacquesmover 1 year ago

评论 #37425703 未加载

lijokover 1 year ago

whitfieldsdadover 1 year ago

评论 #37424657 未加载

评论 #37425023 未加载

评论 #37425208 未加载

评论 #37424628 未加载

littlestymaarover 1 year ago

scjover 1 year ago

lewhooover 1 year ago

dataflowover 1 year ago

indymikeover 1 year ago

评论 #37427935 未加载

tpmxover 1 year ago

Pinky promise. Where's the legal agreement? I'm sure there's a cap on their liability.

评论 #37425026 未加载

tboyd47over 1 year ago

What is the financial upside Microsoft is seeing to this that no one else seems to see?

评论 #37423722 未加载

评论 #37423834 未加载

评论 #37423567 未加载

评论 #37432276 未加载

评论 #37425434 未加载

bobobob420over 1 year ago

coding123over 1 year ago

soultreesover 1 year ago

评论 #37426874 未加载

aldousd666over 1 year ago

This has been a seemingly impassable Rubicon, and Microsoft is building a bridge across it and posting guards along the way.

评论 #37430213 未加载

alberthover 1 year ago

Plot twist, generative AI wrote that blog post to convince people to use Copilot more.

评论 #37428405 未加载

elzbardicoover 1 year ago

Maybe it is just me, but I found the quality of copilot suggestions so low , it is generally useable only on the most mundane and repetitive contexts. Why all the enthusiasm about it?

treprinumover 1 year ago

Are they going to threaten all small devs with patents when they object to having their code in the copilot almost verbatim?

Havocover 1 year ago

Which is essentially open ended liability...so their lawyers must be very darn sure there isn't much risk.

PeterStuerover 1 year ago

Isn't this extremely gamable? Find someones IP, split the gains.

matt3210over 1 year ago

The on-prem people were right the whole time!

dirtyidover 1 year ago

TLDR Microsoft will litigate against any suits until one side goes broke. That side is probably not Microsoft.

heavyset_goover 1 year ago

You can now launder GPL code with the confidence that Microsoft's world class legal team will have your back if you're sued for it.

评论 #37422426 未加载

评论 #37422782 未加载

评论 #37423276 未加载

评论 #37422899 未加载

评论 #37423264 未加载

评论 #37423388 未加载

评论 #37428984 未加载

评论 #37423600 未加载

评论 #37423001 未加载

thesuperbigfrogover 1 year ago

评论 #37424369 未加载

baz00over 1 year ago

Having dealt with Microsoft for 30 years as both a power user and developer, "we believe in standing behind our customers when they use our products", is a lie.

评论 #37423710 未加载

评论 #37424373 未加载

tyingqover 1 year ago

Yet they don't feed their own closed source assets to Copilot for training...why not?

评论 #37422805 未加载

评论 #37423303 未加载

评论 #37422749 未加载

评论 #37424993 未加载

评论 #37423554 未加载

评论 #37424223 未加载

评论 #37424377 未加载

jacquesmover 1 year ago

评论 #37424359 未加载

评论 #37424383 未加载

CameronNemoover 1 year ago

Meanwhile they strike deals with news agencies to use their content to train on... This is of going to be a hard fight, but I really hope this ends up costing MS.

评论 #37424405 未加载

sublinearover 1 year ago

评论 #37423603 未加载

评论 #37423732 未加载

评论 #37424395 未加载

hulituover 1 year ago

> Microsoft will assume liability for legal copyright risks of CopilotExtinguish.