Microsoft, OpenAI sued for ChatGPT 'privacy violations'

194 pointsby Nadeusalmost 2 years ago

23 comments

knaik94almost 2 years ago

>For the 16 plaintiffs, the complaint indicates that they used ChatGPT, as well as other internet services like Reddit, and expected that their digital interactions would not be incorporated into an AI model.I don't expect this lawsuit to lead anywhere. But if it does, I hope it leads to some clear laws regarding data privacy and how TOS is binding. The recent ruling regarding web scraping makes the case against OpenAI a lot weaker. [1] Data scraping publicly available data is legal. People didn't need consent to having their data be used, there was an implicit assumption the moment the data was published to the public, like on reddit or youtube.I keep seeing this idea reoccur in the suit:>Plaintiff ... is concerned that Defendants have taken her skills and expertise, as reflected in [their] online contributions, and incorporated it into Products that could someday result in [their] professional obsolescence ...Anyone is able to file a suit, I wish people stopped assuming that a news report automatically means it has merit.1. <a href="https://www.natlawreview.com/article/hiq-and-linkedin-reach-proposed-settlement-landmark-scraping-case" rel="nofollow noreferrer">https://www.natlawreview.com/article/hiq-and-linkedin-reach-...</a>

评论 #36520584 未加载

评论 #36520449 未加载

评论 #36521412 未加载

评论 #36520782 未加载

评论 #36520438 未加载

评论 #36520922 未加载

评论 #36521072 未加载

评论 #36523283 未加载

评论 #36521189 未加载

评论 #36520875 未加载

chasingalmost 2 years ago

I mean, it ingested all of the content from my blog. Without my permission. It's not a major part of their corpus of data, but still -- I wasn't asked and I don't really care to donate work to large corporations like that.So the technology is cool, but I'm firmly of the stance that they cut corners and trampled peoples' rights to get a product out the door. I wouldn't be entirely unhappy if this iteration of these products were sued into the ground and were forced to start over on this stuff The Right Way.

评论 #36520726 未加载

评论 #36521396 未加载

评论 #36520537 未加载

评论 #36532907 未加载

评论 #36521701 未加载

评论 #36526782 未加载

评论 #36520739 未加载

zug_zugalmost 2 years ago

Hard to understand how this is a crime, or how they came up with 3 billion dollars of damage.Seems like if it's legal for a person to do it should be legal for software to do for the most part.

评论 #36520168 未加载

评论 #36520205 未加载

评论 #36520134 未加载

评论 #36520299 未加载

评论 #36520238 未加载

评论 #36520550 未加载

评论 #36520585 未加载

评论 #36520192 未加载

评论 #36520264 未加载

anigbrowlalmost 2 years ago

Fishing expedition. Will probably get thrown out because no particular injury can be enunciated. OpenAI scraped HN as well, and I don't consider my HN posts private because anyone can come here and read them, including artificial intelligences.

shubhamgrg04almost 2 years ago

If we dissect this case, it seems to revolve around two central questions: what constitutes 'public' data and to what extent can AI models leverage such data without infringing upon individual privacy. This lawsuit may well set a significant precedent in defining the boundaries of AI ethics and data privacy.

hospitalJailalmost 2 years ago

When this happened to Stable Diffusion, it was easy for me to consider it a necessary evil to progress humanity.When this happens to closedAI, it just seems like a profit grab.Not that it changes the legality of it. Just optics.Wonder if that matters in court.

评论 #36520153 未加载

numbsafarialmost 2 years ago

It’s okay, I’m sure everything is going to be fine when Microsoft and ChatGPT hot mic your next doctor appointment.<a href="https://news.ycombinator.com/item?id=36498294">https://news.ycombinator.com/item?id=36498294</a>

评论 #36520351 未加载

评论 #36520370 未加载

评论 #36520270 未加载

lionkoralmost 2 years ago

Discord rolled out a ChatGPT based bot that can be used in (and thus can read) all private conversations. Not surprised there are issues with it.

replwoacausealmost 2 years ago

A tangential question...but does anyone know what software is used to generate legal documents that look like the PDF linked in the article? I’ve played with LaTeX templates a bit, but I seriously doubt law firms are futzing around with LaTeX for documents as complex as this. They must have some software that produces this formatting.

评论 #36520336 未加载

评论 #36520053 未加载

评论 #36520814 未加载

评论 #36520231 未加载

评论 #36521196 未加载

I_am_tiberiusalmost 2 years ago

What I noticed is that the privacy setting which should prevent OpenAI to use my data for training purposes, was already deleted twice and I had to set it again. No idea what that means and if the data that I entered before I noticed that setting was gone is now being owned by OpenAI. Anyway, it is obvious that privacy is no priority to them. Also, it's known that YC companies are informally being told they should not worry about privacy while scaling up. Open AI is not a YC company, but its culture is definitely derived from it.

评论 #36521246 未加载

jwx48almost 2 years ago

I am not a lawyer, just a Sysadmin; but with that said, the linked pdf of the complaint is absolutely fascinating to me. It's worth it (to me) for the list of resources it cites.

seaerkinalmost 2 years ago

Do we think this is related to media platforms seemingly walling themselves off? Requiring accounts to view content, removing API access. It seems if they can silo data off and make it difficult to access at a large scale, then they are the gatekeeper of the data and can control usage and pricing.

评论 #36521146 未加载

janvanlooyalmost 2 years ago

We talk to a lot of companies and many want to start using generative AI but are afraid of litigation. As long as it is not clear on which data a given model has been trained and that it is explicitly licensed permissively by the owner you are not sure what can happen.We are actually working on a tool to create billion-size free-to-use Creative Commons image datasets and prepare them for training models like Stable Diffusion. There is a blogpost about it here: <a href="https://blog.ml6.eu/ai-image-generation-without-copyright-infringement-a9901b64541c" rel="nofollow noreferrer">https://blog.ml6.eu/ai-image-generation-without-copyright-in...</a>

barathralmost 2 years ago

Rather than there being lawsuit after lawsuit of this sort, we wrote an op-ed this morning that says there should be a simple, compulsory licensing fee that AI companies pay to the public -- something we called the AI Dividend: <a href="https://www.politico.com/news/magazine/2023/06/29/ai-pay-americans-data-00103648" rel="nofollow noreferrer">https://www.politico.com/news/magazine/2023/06/29/ai-pay-ame...</a>

评论 #36520566 未加载

评论 #36521542 未加载

评论 #36520936 未加载

ravenstinealmost 2 years ago

Does every business in the 21st century need to be some form of low-level scam in order to make headway and grow enough to satisfy VCs or investors?

评论 #36520715 未加载

评论 #36520731 未加载

elforce002almost 2 years ago

Only 3? They should go for the whole 10, and settle for 1.Now that the gates are open, we'll probably be entering the "free money" cycle soon.

locallostalmost 2 years ago

I wonder if we'll see a license for content that forbids its use for training of language models.

zer0c00leralmost 2 years ago

Maybe as a result OpenAI will have to publish how they trained and what data was exactly used.

m3kw9almost 2 years ago

Why not sue for 30 billion instead if you are to go full stupid on the price

WAalmost 2 years ago

Interesting, for once this doesn't have anything to do with the GDPR. It's by 16 (US) individuals, filing the complaint in SF.

评论 #36520491 未加载

评论 #36520223 未加载

ourafalmost 2 years ago

If anything major comes out of this, is probably EVEN MORE prompts and popups asking for permission to use your data. even with GDPR, data collection and sales never stopped, it just made things more annoying by transforming every webpage into a granular term of service to continue doing the same.It isn't even turned off by default. Many sites just give you an "i accept" button or even if you want to manage the preferences, the "accept all choices" button is where the "confirm my choices" should be.Bigger companies will just append this to their TOS and push it down the customer's throat. That if MS doesn't settle out of court and the case gets thrown together with any major oppositon to the data mining

submetaalmost 2 years ago

Well, it was too good to be true. Reminds me of the early days of music sharing and Napster.

评论 #36520437 未加载

评论 #36520184 未加载

mbgerringalmost 2 years ago

Wow, I really don’t get it, if I were to memorize billions of pages worth of people’s private messages and medical records, then recited them live in the Internet, would that be a crime??

评论 #36520740 未加载

评论 #36520424 未加载

评论 #36520958 未加载