TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

We've filed a lawsuit against GitHub Copilot

724 点作者 iworshipfaangs2超过 2 年前

88 条评论

an1sotropy超过 2 年前
Seems important to point out that the announcement on this page (<a href="https:&#x2F;&#x2F;githubcopilotlitigation.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;githubcopilotlitigation.com&#x2F;</a>) is a followup to <a href="https:&#x2F;&#x2F;githubcopilotinvestigation.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;githubcopilotinvestigation.com&#x2F;</a> previously discussed here: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=33240341" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=33240341</a> (with 1219 comments)
Cort3z超过 2 年前
I’m not a lawyer, but here is why I believe a class action lawsuit is correct;<p>“AI” is just fancy speak for “complex math program”. If I make a program that’s simply given an arbitrary input then, thought math operations, outputs Microsoft copyright code, am I in the clear just because it’s “AI”? I think they would sue the heck out of me if I did that, and I believe the opposite should be true as well.<p>I’m sure my own open source code is in that thing. I did not see any attributions, thus they break the fundamentals of open source.<p>In the spirit of Rick Sanchez; It’s just compression with extra steps.
评论 #33459862 未加载
评论 #33459314 未加载
评论 #33458947 未加载
评论 #33459145 未加载
评论 #33459134 未加载
评论 #33458507 未加载
评论 #33458523 未加载
评论 #33462292 未加载
评论 #33458511 未加载
评论 #33459799 未加载
评论 #33459379 未加载
评论 #33458525 未加载
评论 #33458441 未加载
评论 #33458657 未加载
评论 #33459975 未加载
blackbrokkoli超过 2 年前
I am sorry for not bringing any kind of legal perspective here, but:<p>*Jesus Christ*, I hope I live long enough to see copyright die. Here we are at the cusp of a new paradigm of commanding computers to do stuff for us, right at the beginning of the first AI development which actually impresses me.<p>And we are fucking bickering about how we were cheated out of $0.00034 because our repo from 2015 might have been used for training.<p>I am also deeply disappointed in HackerNews; where is that deep hatred of patent trolls and smug satisfaction whenever something gets cracked or pirated now?
评论 #33461515 未加载
评论 #33459455 未加载
评论 #33461421 未加载
评论 #33460211 未加载
评论 #33461568 未加载
评论 #33460229 未加载
评论 #33459893 未加载
评论 #33459457 未加载
评论 #33460188 未加载
评论 #33462113 未加载
评论 #33460737 未加载
评论 #33461314 未加载
评论 #33459536 未加载
评论 #33459527 未加载
评论 #33461051 未加载
评论 #33460973 未加载
评论 #33459450 未加载
评论 #33468952 未加载
评论 #33459632 未加载
评论 #33476461 未加载
评论 #33460285 未加载
评论 #33470876 未加载
CobrastanJorji超过 2 年前
As a non-lawyer, I am very suspicious of the claim that &quot;Plaintiffs and the Class have suffered monetary damages as a result of Defendants’ conduct.&quot; Flagrant disregard for copyright? Sure, maybe. The output of the model is subject to copyright? Who knows! But the copyright holders being damaged in some what? Seems doubtful. The best argument I could think of would be &quot;GitHub would have had to pay us for this, and they didn&#x27;t pay us, so we lost money,&quot; but that&#x27;d presumably work out to pennies per person.
评论 #33457643 未加载
评论 #33457558 未加载
评论 #33457511 未加载
评论 #33457496 未加载
评论 #33458028 未加载
评论 #33471346 未加载
r3trohack3r超过 2 年前
I&#x27;m not confident in this stance - sharing it to have a conversation. Hopefully some folks can help me think through this!<p>The value of copyleft licenses, for me, was that we were fighting back against the notion of copyright. That you couldn&#x27;t sell me a product that I wasn&#x27;t allowed to modify and share my modifications back with others. The right to modify and redistribute transitively though the software license gave a &quot;virality&quot; to software freedom.<p>If training a NN against a GPL licensed code &quot;launders&quot; away the copyleft license, isn&#x27;t that a good thing for software freedom? If you can launder away a copyleft license, why couldn&#x27;t you launder away a proprietary license? If training a NN is fair use, couldn&#x27;t we bring proprietary software into the commons using this?<p>It seems like the end goal of copyleft was to fight back against copyright, not to have copyleft. Tools like copilot seem to be an exceptionally powerful tool (perhaps more powerful than the GPL) for liberating software.<p>What am I missing?
评论 #33457998 未加载
评论 #33458392 未加载
评论 #33459030 未加载
评论 #33458158 未加载
评论 #33458429 未加载
评论 #33458529 未加载
评论 #33458023 未加载
评论 #33457970 未加载
评论 #33458276 未加载
评论 #33457993 未加载
adlpz超过 2 年前
It feels weird saying this but, for once, I hope the big evil corporation gets to keep selling their big bad product.<p>I find the pattern matching and repetitive code generation <i>really</i> helpful. And the library autocomplete on steroids, too.<p>Meh. Tricky subject.
评论 #33457562 未加载
评论 #33457940 未加载
评论 #33463489 未加载
评论 #33460585 未加载
评论 #33458307 未加载
albertzeyer超过 2 年前
I really don&#x27;t understand how there can be a problem with how Copilot works. Any human just works in the same way. A human is trained on lots and lots of of copyrighted material. Still, what a human produces in the end is not automatically derived work from all the human has seen in his life before.<p>So, why should an AI be treated different here? I don&#x27;t understand the argument for this.<p>I actually see quite some danger in this line of thinking, that there are different copyright rules for an AI compared to a human intelligence. Once you allow for such arbitrary distinction, it will get restricted more and more, much more than humans are, and that will just arbitrarily restrict the usefulness of AI, and effectively be a net negative for the whole humanity.<p>I think we must really fight against such undertaking, and better educate people on how Copilot actually works, such that no such misunderstanding arises.
评论 #33459186 未加载
评论 #33459361 未加载
评论 #33460196 未加载
评论 #33461993 未加载
评论 #33459714 未加载
herpderperator超过 2 年前
The title of the submitted PDF document: &quot;Microsoft Word - 2022-11-02 Copilot Complaint (near final)&quot;[0]<p>I&#x27;ve noticed this a lot and it&#x27;s quite funny seeing what the actual filename of the document was. Does this just get included as metadata by default when you export to PDF?<p>[0] <a href="https:&#x2F;&#x2F;githubcopilotlitigation.com&#x2F;pdf&#x2F;1-0-github_complaint.pdf" rel="nofollow">https:&#x2F;&#x2F;githubcopilotlitigation.com&#x2F;pdf&#x2F;1-0-github_complaint...</a>
评论 #33457876 未加载
评论 #33458837 未加载
评论 #33457827 未加载
评论 #33458255 未加载
deanjones超过 2 年前
This will fail very quickly. The licence that project owners publish with their code on Github applies to third parties who wish to use the code, but does not apply to Github. Authors who publish their code on Github grant Github a licence under the Github Terms: <a href="https:&#x2F;&#x2F;docs.github.com&#x2F;en&#x2F;site-policy&#x2F;github-terms&#x2F;github-terms-of-service" rel="nofollow">https:&#x2F;&#x2F;docs.github.com&#x2F;en&#x2F;site-policy&#x2F;github-terms&#x2F;github-t...</a><p>Specifically, sections D.4 to D.7 grant Github the right to &quot;to store, archive, parse, and display Your Content, and make incidental copies, as necessary to provide the Service, including improving the Service over time. This license includes the right to do things like copy it to our database and make backups; show it to you and other users; parse it into a search index or otherwise analyze it on our servers; share it with other users; and perform it, in case Your Content is something like music or video.&quot;
评论 #33458632 未加载
评论 #33458755 未加载
评论 #33459906 未加载
评论 #33463458 未加载
评论 #33459012 未加载
评论 #33458641 未加载
karaterobot超过 2 年前
Does everybody credit the author when using Stack Overflow code? I have, but don&#x27;t always. Not that I&#x27;m trying to steal, I just don&#x27;t take the time, especially in personal projects.<p>This isn&#x27;t exactly the same thing, but it seems to me that three of the biggest differences are:<p>1. Stack Overflow code is posted for people to use it (fair enough, but they do have a license that requires attribution anyway, so that&#x27;s not an escape)<p>2. Scale (true; but is it a fundamental difference?)<p>3. People are paying attention in this case. Nobody is scanning my old code, or yours, but if they did, would they have a case?<p>I dunno. I&#x27;m more sympathetic to visual artists who have their work slurped up to be recapitulated as someone else&#x27;s work via text to image models. Code, especially if it is posted publicly, doesn&#x27;t feel like it needs to be guarded. I&#x27;m not saying this is <i>correct</i>, just saying that&#x27;s my reaction, and I wonder why it&#x27;s wrong.
Imnimo超过 2 年前
On page 18, they show Copilot produces the following code:<p>&gt;function isEven(n) {<p>&gt; return n % 2 === 0;<p>&gt;}<p>They then say, &quot;Copilot’s Output, like Codex’s, is derived from existing code. Namely, sample code that appears in the online book Mastering JS, written by Valeri Karpov.&quot;<p>Surely everyone reading this has written that code verbatim at some point in their lives. How can they assert that this code is derived specifically from Mastering JS, or that Karpov has any copyright to that code?
评论 #33457549 未加载
评论 #33458340 未加载
评论 #33457829 未加载
评论 #33457681 未加载
评论 #33457706 未加载
评论 #33457951 未加载
评论 #33476619 未加载
评论 #33457650 未加载
评论 #33461555 未加载
评论 #33457899 未加载
celestialcheese超过 2 年前
Maybe I&#x27;m being too cynical, but this feels like it&#x27;s more a law firm and individual looking to profit and make their mark in legal history rather than an aggrieved individual looking for justice.<p>Programmer&#x2F;Lawyer Plaintiff + upstart SF Based Law Firm + novel technology = a good shot at a case that&#x27;ll last a long time, and fertile ground to establish yourself as experts in what looks to be a heavily litigated area over the next decade+.
评论 #33458093 未加载
评论 #33458165 未加载
评论 #33457945 未加载
评论 #33458164 未加载
评论 #33458230 未加载
评论 #33458220 未加载
评论 #33458249 未加载
评论 #33458510 未加载
评论 #33458235 未加载
评论 #33458277 未加载
评论 #33458311 未加载
评论 #33457881 未加载
评论 #33458205 未加载
xchip超过 2 年前
LOL we look like taxi drivers fighting Uber.<p>If Kasparov uses chess programs to be better at chess maybe we can use copilot to be better developers?<p>Also, anyone, either a person or a machine, is welcome to learn from the code I wrote, actually that is how I learnt how to code, so why would I stop others from doing the same?.
评论 #33461116 未加载
评论 #33458204 未加载
abouttyme超过 2 年前
I suspect this will be the first of many lawsuits over training data sets. Just because it is obscured by artificial neural networks doesn&#x27;t mean it&#x27;s an original work that is not subject to copyright restrictions.
评论 #33457520 未加载
naillo超过 2 年前
I&#x27;m kinda sceptical that this goes anywhere given that basically they say that whatever copilot outputs is your responsibility to vet that it doesn&#x27;t break any copyright (obviously that goes against the promise of it and the PR but that&#x27;s the small print that gets them out of trouble).
评论 #33457408 未加载
评论 #33457414 未加载
评论 #33458536 未加载
评论 #33457383 未加载
评论 #33458426 未加载
iworshipfaangs2超过 2 年前
It&#x27;s also a class action,<p>&gt; behalf of a pro­posed class of pos­si­bly mil­lions of GitHub users...<p>The appendix includes the 11 licenses that the plaintiffs say GitHub Copilot violates: <a href="https:&#x2F;&#x2F;githubcopilotlitigation.com&#x2F;pdf&#x2F;1-1-github_complaint_appendix_a.pdf" rel="nofollow">https:&#x2F;&#x2F;githubcopilotlitigation.com&#x2F;pdf&#x2F;1-1-github_complaint...</a>
cmrdporcupine超过 2 年前
If Microsoft is so confident in the legality and ethics of Copilot, and that it doesn&#x27;t leak or steal proprietary IP... they should go train it on the MS Word and Windows and Excel source trees.<p>What&#x27;s that? They don&#x27;t want to do that? Why not?
评论 #33459687 未加载
评论 #33476710 未加载
评论 #33459329 未加载
jeffhwang超过 2 年前
Wow, this is interesting iteration in the ongoing divide between &quot;East Coast code&quot; vs. &quot;West Coast code&quot; as defined by Larry Lessig. For background, see <a href="https:&#x2F;&#x2F;lwn.net&#x2F;Articles&#x2F;588055&#x2F;" rel="nofollow">https:&#x2F;&#x2F;lwn.net&#x2F;Articles&#x2F;588055&#x2F;</a>
IceWreck超过 2 年前
I am not against this lawsuit but I&#x27;m against the implications of this because it can lead to disastrous laws.<p>A programmer can read available but not oss licensed code and learn from it. Thats fair use. If a machine does it, is it wrong ? What is the line between copying and machine learning ? Where does overfitting come in ?<p>Today they&#x27;re filing a lawsuit against copilot.<p>Tomorrow it will be against stable diffusion or (dall-e, gpt-3 whatever)<p>And then eventually against Wine&#x2F;Proton and emulators (are APIs copyrightable)
评论 #33457603 未加载
评论 #33457546 未加载
评论 #33457517 未加载
评论 #33457928 未加载
评论 #33457791 未加载
评论 #33457711 未加载
评论 #33457668 未加载
评论 #33457525 未加载
评论 #33457903 未加载
评论 #33457694 未加载
评论 #33457874 未加载
评论 #33457647 未加载
评论 #33457554 未加载
评论 #33473690 未加载
评论 #33457533 未加载
评论 #33458250 未加载
评论 #33457655 未加载
评论 #33457976 未加载
评论 #33457564 未加载
评论 #33457727 未加载
elcomet超过 2 年前
This is why we can&#x27;t have nice things. Copilot is the best thing that happened in developper tools since a long time, it increased a lot my productivity. Please don&#x27;t ruin it.
评论 #33462073 未加载
protomyth超过 2 年前
I really feel that Andy Warhol Foundation for the Visual Arts, Inc. v. Goldsmith[0] is going to have a big effect on this type of thing. They are basically relying on their AI magic to make it transformative. I&#x27;m starting to think the era of learning from material other people own without a license &#x2F; permission is going to end quickly.<p>0) <a href="https:&#x2F;&#x2F;www.scotusblog.com&#x2F;case-files&#x2F;cases&#x2F;andy-warhol-foundation-for-the-visual-arts-inc-v-goldsmith&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.scotusblog.com&#x2F;case-files&#x2F;cases&#x2F;andy-warhol-foun...</a>
topher6345超过 2 年前
Is it not in the agency of the developer to hit the save button?<p>It seems like GitHub Copilot can spit out copyrighted works all day but the person running the text editor has to &quot;choose&quot; which Copilot output to actually save&#x2F;commit&#x2F;deploy.<p>Does it really matter that much &quot;how&quot; the text in your text editor gets there? You write it yourself or copy&#x2F;paste it or have Copilot generate it. Ultimately the individual that &quot;approved&quot; it to be saved to the disk is the one violating the copyright, Copilot is just making a &quot;suggestion&quot;.
nullc超过 2 年前
I think if this is successful it will be very bad for the open world.<p>Large platforms like github will just stick blanket agreements into the TOS which grant them permission (and require you indemnify them for any third party code you submit). By doing so they&#x27;ll gain a monopoly on comprehensively trained AI, and the open world that doesn&#x27;t have the lever of a TOS will not at all be able to compete with that.<p>Copilot has seemed to have some outright copying problems, presumably because its a bit over-fit. (perhaps to work at all it must be because its just failing to generalize enough at the current state of development) --- but I&#x27;m doubtful that this litigation could distinguish the outright copying from training in a way that doesn&#x27;t substantially infringe any copyright protected right (e.g. where the AI learns the &#x27;ideas&#x27; rather than verbatim reproducing their exact expressions).<p>The same goes for many other initiatives around AI training material-- e.g. people not wanting their own pictures being used to train facial recognition. Litigating won&#x27;t be able to stop it but it will be able to hand the few largest quasi-monopolisits like facebook, google, and microsoft a near monopoly over new AI tools when they&#x27;re the only ones that can overcome the defaults set by legislation or litigation.<p>It&#x27;s particularly bad because the spectacular data requirements and training costs already create big centralization pressures in the control of the technology. We will not be better off if we amplify these pressures further with bad legal precedents.
评论 #33476876 未加载
bkuhn超过 2 年前
In case folks here were curious, we at the Software Freedom Conservancy have asked the Plaintiffs to endorse the Principles of Community-Oriented GPL enforcement: <a href="https:&#x2F;&#x2F;sfconservancy.org&#x2F;news&#x2F;2022&#x2F;nov&#x2F;04&#x2F;class-action-lawsuit-filing-copilot&#x2F;" rel="nofollow">https:&#x2F;&#x2F;sfconservancy.org&#x2F;news&#x2F;2022&#x2F;nov&#x2F;04&#x2F;class-action-laws...</a><p>… &amp; of course we again ask Microsoft&#x27;s GitHub to start respecting FOSS licenses, cooperate with the community, &amp; retract their incorrect claim that their behavior is “fair use”.<p>A few more links to our work on this issue:<p><a href="https:&#x2F;&#x2F;sfconservancy.org&#x2F;blog&#x2F;2022&#x2F;feb&#x2F;03&#x2F;github-copilot-copyleft-gpl&#x2F;" rel="nofollow">https:&#x2F;&#x2F;sfconservancy.org&#x2F;blog&#x2F;2022&#x2F;feb&#x2F;03&#x2F;github-copilot-co...</a> <a href="https:&#x2F;&#x2F;sfconservancy.org&#x2F;news&#x2F;2022&#x2F;feb&#x2F;23&#x2F;committee-ai-assisted-software-github-copilot&#x2F;" rel="nofollow">https:&#x2F;&#x2F;sfconservancy.org&#x2F;news&#x2F;2022&#x2F;feb&#x2F;23&#x2F;committee-ai-assi...</a>
foooobaba超过 2 年前
It seems like we should come to agreement on what the license is intended for, given that when the licenses were created in a time before AI like this existed. If the authors did not intend their code to be used like this, should we not respect it? Also, does it make sense to create new licenses which explicitly state whether using it for AI training is acceptable or not - or are our current licenses good enough?
solomatov超过 2 年前
The most important part of this is not whether the lawsuit will be won or lost by one of the parties, but what is the legality of fair use in machine learning, and language models. There&#x27;s a good chance that it gets to Supreme Court and there will be a defining precedent to be used by future entrepreneurs about what&#x27;s possible and what&#x27;s not.<p>P.S. I am not a lawyer.
warbler73超过 2 年前
It seems obvious that AI models are derivative works of the works they are trained on but it also seems obvious that it is totally legally untested whether they are derivative works in the formal legal sense of copyright law. So it should be a good case <i>assuming</i> we have wise and enlightened judges who understand all nuances and can guide us into the future.
buzzy_hacker超过 2 年前
Copilot has always seemed like a blatant GPL violation to me.
评论 #33498220 未加载
评论 #33457538 未加载
foooobaba超过 2 年前
If github or google indexes source code using a neural net to help you find it, given a query, is that also illegal? If you think of copilot as something that helps you find code you’re looking for, is it all that different, and if so, why?<p>In this case, wouldn’t the users of copilot be the ones responsible for any copyrighted code they may have accessed using copilot?
评论 #33457909 未加载
评论 #33458503 未加载
hu3超过 2 年前
A a GitHub user, is there a way to support GitHub against this lawsuit?<p>Obviously not financially as Microsoft has basically YES amounts of money.
评论 #33458296 未加载
awestroke超过 2 年前
If this leads anywhere I&#x27;ll be pissed. I love CoPilot.
评论 #33458067 未加载
评论 #33458231 未加载
still_grokking超过 2 年前
I hope MS used a lot of AGPL code to train Copilot… This would be fun.<p>But no matter how this goes, in case training AI with copyrighted inputs is &quot;fair use&quot; that&#x27;ll end up as the ultimate &quot;copyright laundry machine&quot; like this &quot;joke&quot; project here:<p><a href="https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20220104214929&#x2F;https:&#x2F;&#x2F;fairuseify.ml&#x2F;" rel="nofollow">https:&#x2F;&#x2F;web.archive.org&#x2F;web&#x2F;20220104214929&#x2F;https:&#x2F;&#x2F;fairuseif...</a><p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=27796124" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=27796124</a> (302 points, 151 comments)
rafaelturk超过 2 年前
Like everything legally related: This is not about open source fairness, protecting innovation, it&#x27;s all about making money.
throwaway675309超过 2 年前
Even if this succeeds, you&#x27;ve already lost.<p>1. The ability to be able to run and train these models is going to eventually be perfectly plausible on a home machine.<p>2. It&#x27;s only a matter of time before models, e.g. a popular model scraped from all of the code on GitHub, is a publicly available torrent.<p>3. People will be able to just run it locally as an integrated plug-in in jet brains or VS code.<p>4. You&#x27;ll never know if somebody has lifted their code in violation of a license anymore than you would be able to tell if somebody used code from stack overflow without attribution in any commercial endeavor.<p>The End.
评论 #33460559 未加载
falcolas超过 2 年前
Crackpot Theory: Copilot (and by association many ML tools) is a form of probabilistic encryption. Once encoded, it&#x27;s virtually impossible to pull the code (plaintext) directly out of the raw ML model (the cyphertext), yet when the proper key is input (&#x27;&#x2F;&#x2F;sparse matrix transpose&#x27;), you get the relevant segment of the original function (the plaintext) back.<p>We&#x27;ve even seen this with stable diffusion image generation, where specific watermarks can be re-created (decrypted?) deterministically with the proper input.
评论 #33476807 未加载
spir超过 2 年前
The part of GitHub Copilot to which I object is that it&#x27;s trained on private repos. Where does GitHub get off consuming explicitly private intellectual property for their own purposes?
garfieldnate超过 2 年前
If GitHub ends up having to tweak their product to avoid ethical&#x2F;legal concerns, I actually imagine it could still be pretty cool. Right now Copilot is a black box that spits out code with no attributes; what if they worked on instead making it a glass box, where it always brings up snippets of other projects along with their licensing info so that you can decide how to incorporate the ideas fairly yourself? Or they could still output the same code suggestions, but always include attribution and license data along with it. Making the product more transparent would probably make more people comfortable with using it, anyway.
Cloudef超过 2 年前
Unless the copilot spits out complete programs or libraries that are 1:1 to someone elses who cares? Caring about random small code snippets is dumb.
bilsbie超过 2 年前
Laws need to change to match technology.<p>Did you know before airplanes were invented common law said you owned the air above your land all the way to the heavens.
评论 #33457578 未加载
brookst超过 2 年前
I wonder if the plaintiffs&#x27; code would stand up to scrutiny of whether any of it was copied, even unintentionally, from other code they saw in their years of learning to program? I know that I have more-or-less transcribed from Stack Overflow&#x2F;etc, and I have a strong suspicion that I have probably produced code identical to snippets I&#x27;ve seen in the past.
评论 #33457493 未加载
layer8超过 2 年前
Copilot reminds me of the Borg: You will be assimilated. We will add your technological distinctiveness to our own. Resistance is futile.
omegacharlie超过 2 年前
Think some of the negativity about Copilot may be the perception that if an individual or small startup attempted training an ML model from public source-code and commercialised a service from it they would be drowning in legal issues from big companies not happy with their code used in such a product.<p>In addition just because code is available publicly on GitHub does not necessarily mean it is permissively licensed to use elsewhere, even with attribution. Copyright holders not happy with their copyrighted works publicly accessible can use the DMCA to issue take-downs that GitHub does comply with but how that interacts with Copilot and any of its training data is a different question.<p>As much as the DMCA is bad law rather funny seeing Microsoft be charged in this lawsuit with the less known provision against &#x27;removal of copyright management information&#x27;. Microsoft does have more resources to mount at defence so it will probably end up different compared to a smaller player facing this action.
rolenthedeep超过 2 年前
Consider each repo on github to be a movie. What copilot does is to search for sequences of frames from any movie which line up to create a new coherent movie.<p>Individually, each frame is protected by the copyright of the movie it belongs to. But what happens if you take a million frames from a million different movies and just arrange them in a new way?<p>That&#x27;s the core question here. Is the new movie a new copyrightable work, or is it plagiarizing a million other works at once? Is it legal to use copyrighted works in this way?<p>The other question is if it is <i>right</i> to use copyrighted works this way. Is this within the spirit of open source software? Or is this just a bad corporation taking advantage of your good will?<p>I&#x27;m not sure where I stand on this, it&#x27;s a complicated problem for sure. Definitely interested to see how this plays out in court.
评论 #33476818 未加载
poulpy123超过 2 年前
&gt;By train­ing their AI sys­tems on pub­lic GitHub repos­i­to­ries (though based on their pub­lic state­ments, pos­si­bly much more) we con­tend that the defen­dants have vio­lated the legal rights of a vast num­ber of cre­ators who posted code or other work under cer­tain open-source licenses on GitHub.<p>I don&#x27;t know about the US laws in copyright so I can&#x27;t comment on the legal documents but this website is not complaining that copilot is reproducing copyrighted content but it was trained on copyrighted content. I don&#x27;t see how you can forbid someone or something to read and learn from something that is public (once again producing is another problem)
throwaway675309超过 2 年前
How much code is necessary to be considered a copyright infringement from an existing code base?<p>For example let&#x27;s say I&#x27;ll take a single frame of animation from a cartoon, The frame contains a mountain, house, and a couple characters although those characters are not integral to the actual cartoon maybe they&#x27;re extras (villagers and not named characters something like Mickey Mouse for example)<p>I draw a picture of a lake with a cabin next to it, then start to draw a frontiersman but I trace one of his arms from a villager of that previous frame of animation... Number one am I in danger of copyright infringement (have I hit some arbitrary threshold), and number two: am I causing monetary losses for the cartoon?
jasonladuke0311超过 2 年前
Merits of the case aside, I&#x27;m befuddled that a company with a legal team like Microsoft approved this product. Is their assumption that this would bring in more revenue than potentially defending it in court? The math doesn&#x27;t make sense to me.
RamblingCTO超过 2 年前
lol @ &quot;open-source soft­ware piracy&quot;<p>If I&#x27;m being honest I&#x27;m a bit annoyed at this. What&#x27;s the problem and what&#x27;s the point of this?
评论 #33458178 未加载
评论 #33457428 未加载
评论 #33457502 未加载
renewiltord超过 2 年前
It doesn&#x27;t make sense. If I make a piece of software that curls a random gist and then puts it into your editor am I infringing or are you infringing when you run it or are you infringing when you use that file and distribute it somewhere?
评论 #33458015 未加载
mezbot超过 2 年前
This issue seems to have an obvious solution that I fail to see anyone mention: Treat copilot simply as a tool, let it be trained on whatever without any consent requirements. However the outputs should be subject to copyright as with any other code produced by a human. Then on a case by case basis courts can decide if infringement has occurred. The idea of banning copilot or other AI models as a whole just seems like a collective case of sour grapes because innovation and automation is finally threatening some people who only expected these things to affect the working class
EMIRELADERO超过 2 年前
I think it&#x27;s a great time to explain why this won&#x27;t hit AI art such as Stable Diffusion, even if GitHub loses this case.<p>The crux of the lawsuit&#x27;s argument is that the AI unlawfully <i>outputs copyrighted material</i>. This is evident in many tests with many people here and on Twitter even getting <i>verbatim comments</i> out of it.<p>AI art, in the other hand, is not capable of outputting the images from its training set, as it&#x27;s not a collage-maker, but an artificial brain with a paintbrush and virtual hand.
评论 #33458088 未加载
评论 #33458020 未加载
评论 #33458149 未加载
评论 #33476830 未加载
评论 #33458354 未加载
fancyfredbot超过 2 年前
If a software developer learns how to code better by reading GPL software and then later uses the skills they developed to build closed source for profit software should they be sued?
评论 #33458614 未加载
评论 #33458300 未加载
评论 #33458233 未加载
评论 #33458292 未加载
hjroberts超过 2 年前
Whether it is legally wrong or not to scan OSS code (I think it <i>is</i> wrong), there has been a time-honored precedent for disallowing automated scanning:<p><pre><code> robots.txt </code></pre> This is exactly what is needed for source code, and the default (no robots.txt) should be &quot;disallow&quot;.<p>The fact that the Web <i>has</i> considered this moral issue should be a strong hint for the AI people not to take a purely legal stance but consider the OSS community that they are so heavily using.
atum47超过 2 年前
Forgive my ignorance, but who is going to benefit from this lawsuit? I have a lot of code on GitHub, can I, for instance, expect a check in the mail in case of a win?
评论 #33458751 未加载
datacruncher01超过 2 年前
I think the software is probably ok provided that, the sources are credited (ie, if co-pilot copies code from say SDL, then the relevant code sections need to be correctly attributed, the mandatory license readme copied to the project so all code is following the open source licenses used. That&#x27;s literally the purpose of open source licenses. If Copilot can&#x27;t be bothered to do that, then yeah it should be shut down.
cothrowaway88超过 2 年前
Made a throwaway since I guess this stance is controversial. I could not care less about how copilot was made and what kind of code it outputs. It&#x27;s useful and was inevitable.<p>I&#x27;m 1000% on team open source and have had to refer to things like tldrlegal.com many times to make sure I get all my software licensing puzzle pieces right. Totally get the argument for why this litigation exists in the present.<p>Just saying in general my friends I hope you have an absolutely great day. Someone will be wrong on the internet tomorrow, no doubt about it. Worry about something productive instead.<p>This one has the feel of being nothing more than tilting at windmills in the long run.
0cf8612b2e1e超过 2 年前
Is there any amount of public data&#x2F;code&#x2F;whatever I can make an offline backup of today in the event this gets pulled?
评论 #33457792 未加载
matthewwolfe超过 2 年前
I will never understand why people push code to public repos and then complain when someone or something uses that code. Code that you want to keep private or make money off of should be private. Only publish stuff to the public that you want other people to see and learn from. All the complaints about attribution… who cares.
评论 #33461393 未加载
pmarreck超过 2 年前
This will fail. Copilot is too good, and only suggests snippets or small functions, not entire classes for example.
User23超过 2 年前
Copilot is clearly a derivative work. So is every other similar model. How is this even up for discussion?
stovenctl超过 2 年前
The comparison I would draw is it&#x27;s a statistics based search engine for code.<p>Sometimes the query is the first half of a small statement that we can fill in with common patterns. Useful, fair.<p>Sometimes the query is a signature like `fn fast_inv_sqrt` that copies someone&#x27;s code and doesn&#x27;t attribute it.
nuc1e0n超过 2 年前
My own view is that it is not legal for humans to produce derivatives of copyrighted works currently. So therefore it is probably already not legal to train an artificial intelligence using copyrighted works to in order to produce derivatives either.
jjgon1781超过 2 年前
I am surprise in the amount of people that in favor in copilot being train with copyright data.
scoot超过 2 年前
The editorialized title isn&#x27;t correct. The lawsuit is against GitHub for Copilot not against GitHub Copilot, which is not a &quot;legal person&quot;.<p>A better shortening if the original title is simple &quot;We’ve filed a law­suit chal­leng­ing GitHub Copi­lot&quot;
reachableceo超过 2 年前
Let me (start or join the call) for federal investigation and the filing of criminal complaints in all relevant locales.<p>Grand theft , interstate wire fraud and conspiracy for same.<p>This is a criminal matter as well as civil. Intentional and knowing violation of the law.<p>We must not let our work be taken!
gcau超过 2 年前
As much as I love the little guy beating the big evil company, I hope the lawsuit doesn&#x27;t cause anything to happen to copilot. Maybe some changes, like better protection against emitting 1:1 licensed code or opting out your code from training.
vlovich123超过 2 年前
Can someone explain to me Microsoft’s decision here to use GPL code in the training set? It would seem like sticking to non-attribution &#x2F; non-viral licenses would have kept them in the clear. Was that an insufficient size data set?
评论 #33476853 未加载
eurasiantiger超过 2 年前
Maybe we just need to prompt it to include the proper licenses and attributions. &#x2F;s
评论 #33458120 未加载
thesuperbigfrog超过 2 年前
How original is the generated code?<p>Can the generated code be traced back to the code used for training and the original copyrights and licenses for that code?<p>If so, what attribution(s) and license(s) should apply to the generated code?
评论 #33457400 未加载
arpowers超过 2 年前
The proper way to think about these LLM is similar to plagiarism.<p>Seems to me the underlying data should be opt-in from creators and licenses should be developed that take AI into consideratiin.
Aeolun超过 2 年前
I find this whole subject exhausting. The only reason I’m glad there is a lawsuit is that we can finally put this thing to rest when either party wins.
Yahivin超过 2 年前
Copilot does include the licenses...<p>Start off a comment with &#x2F;&#x2F; MIT license<p>Then watch parts of various software licenses come out including authors&#x27; names and copyrights!
marmada超过 2 年前
All these people whining about copyright need to consider: is the issue Copilot, or is the issue copyright.
amelius超过 2 年前
Can Copilot reproduce Numerical Recipes in C?<p>(asking because I know the authors were kinda famous for being very litigious).
HeavyStorm超过 2 年前
&quot;Angry people brandish their fists against the incoming revolution&quot; is also a good title.
sensanaty超过 2 年前
I personally hope they win, and win big. Anything that ruins Micro$oft&#x27;s day is a boon to mine.
clusterhacks超过 2 年前
Did Microsoft use the source code of Windows (in whole or in part) as training input to Copilot?
评论 #33476865 未加载
machiste77超过 2 年前
bruh, come on! you&#x27;re gonna ruin it for the rest of us
kgarten超过 2 年前
on a tangent ... beautiful typography, I love Matthew Butterick&#x27;s work on legible fonts an his guide to practicle typography.<p>all the best with the lawsuit.
barelysapient超过 2 年前
MSFT to $0 anyone?
i_like_apis超过 2 年前
I love that this is going to loose.
SighMagi超过 2 年前
I did not see that coming.
SurgeArrest超过 2 年前
I hope this case will fail and establish a good precedent for all future AI litigations and may be even prevent new ones. Your code is open source - irregardless of license, one might read it as a text book and then remember or even copy snippets and re-use this somewhere else unrelated to the original application. If you don&#x27;t like this, don&#x27;t make your code open source. This was happening and is happening independent of any license all over the world by majority of developers. What Copilot and similar tools did was to make those snippets accessible for extrapolation in new applications.<p>If these folks win - we again throw progress under the bus.
评论 #33458148 未加载
评论 #33458107 未加载
评论 #33458200 未加载
评论 #33458227 未加载
评论 #33458269 未加载
评论 #33458299 未加载
评论 #33458270 未加载
评论 #33458316 未加载
ISL超过 2 年前
Can anyone with Copilot access give a short summary of its response to the prompts:<p><pre><code> function force=Gmmr2Array(mass1, mass2) </code></pre> and<p><pre><code> function [force, torque]=pointMatrixGravity(array1,array2) </code></pre> ?<p>I&#x27;d love to know if some of my GPL v3 code [1, 2] has landed in the training set<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;4kbt&#x2F;NewtonianEotWashToolkit&#x2F;blob&#x2F;master&#x2F;matlab-src&#x2F;Gmmr2Array.m" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;4kbt&#x2F;NewtonianEotWashToolkit&#x2F;blob&#x2F;master&#x2F;...</a><p>[2] <a href="https:&#x2F;&#x2F;github.com&#x2F;4kbt&#x2F;NewtonianEotWashToolkit&#x2F;blob&#x2F;master&#x2F;matlab-src&#x2F;pointMatrixGravity.m" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;4kbt&#x2F;NewtonianEotWashToolkit&#x2F;blob&#x2F;master&#x2F;...</a>
评论 #33457539 未加载
评论 #33457745 未加载
评论 #33457587 未加载
评论 #33457486 未加载
m00x超过 2 年前
The only people who gain out of class lawsuits are the lawyers.<p>This person (a lawyer) saw an opportunity to make money and jumped on it like a hungry tiger on fresh meat.
评论 #33458774 未加载
评论 #33459089 未加载
评论 #33457758 未加载
评论 #33457735 未加载
Entinel超过 2 年前
I don&#x27;t have a comment on this personally but I want to throw this out there because every time I see people criticizing Copilot or Dall-E someone always says &quot;BUT ITS FAIR USE! Those people don&#x27;t seem to grasp that &quot;Fair Use&quot; is a defense. The burden is not on me to prove what you are doing is not fair use; the burden is on you to prove what you are doing is fair use
评论 #33457955 未加载
VoodooJuJu超过 2 年前
As celestialcheese says [1], it seems like a manufactured case for the purpose of furthering someone&#x27;s legal career rather than seeking remittance for any violations made by Copilot.<p>But I like to put on my conspiracy hat from time to time, and right now is one such time, so let&#x27;s begin...<p>Though the motivations behind this case are uncertain, what is certain is that this case will establish a precedent. As we know, precedents are very important for any further rulings on cases of a similar nature.<p>Could it be the case that Microsoft has a hand in this, in trying to preempt a precedent that favors Copilot in any further litigation against it?<p>Wouldn&#x27;t put it past a company like Microsoft.<p>Just a wild thought I had.<p>[1] <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=33457826" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=33457826</a>
评论 #33458420 未加载
bugfix-66超过 2 年前
Ask HN: I want to modify the BSD 2-Clause Open Source License to explicitly prohibit the use of the licensed software in training systems like Microsoft&#x27;s Copilot (and use during inference). How should the third clause be worded?<p><pre><code> The No-AI 3-Clause Open Source Software License Copyright (C) &lt;YEAR&gt; &lt;COPYRIGHT HOLDER&gt; All rights reserved. Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met: 1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. 2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and&#x2F;or other materials provided with the distribution. 3. Use in source or binary forms for the construction or operation of predictive software generation systems is prohibited. THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS &quot;AS IS&quot; AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. </code></pre> <a href="https:&#x2F;&#x2F;bugfix-66.com&#x2F;f0bb8770d4b89844d51588f57089ae5233bf67e8c0ace80303bfd66059a507c4" rel="nofollow">https:&#x2F;&#x2F;bugfix-66.com&#x2F;f0bb8770d4b89844d51588f57089ae5233bf67...</a>
评论 #33457782 未加载
评论 #33457895 未加载
评论 #33457943 未加载
评论 #33457563 未加载
评论 #33476924 未加载
评论 #33457816 未加载
评论 #33457497 未加载
60secs超过 2 年前
This is why we can&#x27;t have nice dystopias.
评论 #33458351 未加载