TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: What license to choose in an open source project with code from an AI?

2 点作者 elnatro4 个月前
I have been working for several days in a small package that could be useful for developers as it is useful for me.<p>The problem is that the package uses some obscure built-in API that I asked a coding AI to help me with and I worked on top of that. The structure is being the same, but I changed the internal behavior (think of a callback system having all the internal code re-implemented by me).<p>Could I release the project as open source? What would be a good license? Would it be ethical? How much different needs the code to be from the one that was outputted by the AI tool? What do you do in this cases?

3 条评论

Ukv4 个月前
In the US, to my understanding:<p>1. The Copyright Office requires human authorship for a work to be protected by copyright. So in theory, someone else could copy the parts of your project that were LLM-generated without obeying your license&#x27;s requirements (such as to give credit). Parts that you added and the work as a whole would still be protected<p>2. Regarding potential infringement of the training data: from progress in ongoing cases so far, copyright&#x27;s requirement for substantial similarity has been upheld[0] for model output. So if your code doesn&#x27;t resemble some protected work (and I think common coding LLM services have some attempts at preventing this) you should be in the clear there<p>So I don&#x27;t see anything against using whatever open-source license you normally would.<p>[0]: E.G: in Stable Diffusion case, Judge William H. Orrick agreed with the defendants that &quot;plaintiffs cannot plausibly allege the Output Images are substantially similar or re-present protected aspects of copyrighted Training Images, especially in light of plaintiffs&#x27; admission that Output Images are unlikely to look like the Training Images&quot;
eesmith4 个月前
The AI adds no copyright. It either adds no copyright or there is unattributed copyright remaining from whatever training set was used.<p>The rest is under your copyright as it is the result of your creative work.<p>1) Yes, you can release it as open source.<p>2) These days I release as GPL. My experience releasing MIT&#x2F;BSD has been joyless.<p>3) It&#x27;s a shade unethical. OTOH, given the popularity of these tools, it seems your future won&#x27;t be negatively affected.<p>4) Enough to convince a judge.<p>5) I don&#x27;t use AI tools. Legally speaking, does the AI-generated code show enough creativity (which is the term for the type of materials which can be under copyright) that it could be traced back to an originator? If not, no worries. If so, how much are you willing to deal with that happening?
评论 #42791518 未加载
not_your_vase4 个月前
It&#x27;s a legal mess, unfortunately no one really knows the answer, because it&#x27;s a brand new problem.<p>Once someone challenges the status quo[1] with a big splash, then we will have a precedent at least.<p>[1] - the status quo is kind of pretending that there is nothing to see here, and that it will stay like. AI code is kind of treated like open code on GitHub with random OSS license (without having an explicit license)
评论 #42791205 未加载