50M Facebook profiles harvested for Cambridge Analytica in major data breach

558 pointsby tsneed290about 7 years ago

29 comments

olivermarksabout 7 years ago

My problem with this 'outing' of CA is that Facebook explicitly commercially exists to harvest user data for Procter & Gamble, Johnson & Johnson, Fidelity etc etc so they can profile us. A million dollars is chump change in the crazy US election game. This all seems overly selective - it's ok for some people to profile but not for others. I'm not in favor of any of it to be clear but there is a definite political bias going on here. Let's not forget FB itself has a formal political unit that exists to push propaganda in foreign elections, 'stifling opposition and stoking extremism'<a href="https://www.bloomberg.com/news/features/2017-12-21/inside-the-facebook-team-helping-regimes-that-reach-out-and-crack-down" rel="nofollow">https://www.bloomberg.com/news/features/2017-12-21/inside-th...</a>

评论 #16607965 未加载

评论 #16608854 未加载

评论 #16608552 未加载

评论 #16607900 未加载

评论 #16608646 未加载

评论 #16608417 未加载

评论 #16609641 未加载

评论 #16608772 未加载

评论 #16607877 未加载

评论 #16607990 未加载

评论 #16608312 未加载

评论 #16607796 未加载

评论 #16607910 未加载

评论 #16609115 未加载

评论 #16609511 未加载

gfodorabout 7 years ago

I remember when the Obama campaign hired data scientists and used targeted social networking tools to pursuade voters who were on the fence and it was heralded as brilliant and the future of politics.I worked for a company crawling Facebook data by creating viral apps the year the original API came out. By now I am sure this is done by many companies.Why is any of this news? My understanding is that companies harvesting social networking data via viral apps and then reselling it to perform targeted voter advertising is literally a 10 year old concept. Were any laws broken here? Were there any techniques used here that were novel or done by one political party and not the other? Why are we talking about this one firm and not the many others that surely exist that are trying to do the same thing for <insert political candidate of choice>

评论 #16610288 未加载

评论 #16611122 未加载

评论 #16611761 未加载

评论 #16610219 未加载

评论 #16611154 未加载

评论 #16610496 未加载

评论 #16619260 未加载

评论 #16610992 未加载

评论 #16611391 未加载

评论 #16626212 未加载

评论 #16612220 未加载

评论 #16610691 未加载

评论 #16610325 未加载

heckanoobsabout 7 years ago

I used to make fb apps, any app gets full access to fb's user graph as long as they request the relevant permissions.Users don't comprehend what permissions they are giving to apps they run. A quiz site getting full access is not surprising.Once an app has any amount of access the only thing stopping them from harvesting their own clone of your data is an agreement in the ToS that you won't store PII for more than x hours.These rules are like the bare minimum to stop good actors. If you're a bad actor fb does not do a single thing to protect users from you. As evident in this report fb is also not above blaming the users for the hostile environment fb created and placed them in.There must be countless copies of harvested fb data out there. My employer at the time once realized we were accidentally storing some PII permanently in a derived field. If good actors can't even keep above the law what do you think the ecosystem looks like in the shadows?IMO we aren't having the right conversation with fb over how they mistreat our PII and we should loosen the definition of that term when companies like the one in the article can infer our political preferences from the innocuous bits of our lives we tag on facebook.We should be asking why even an authorized API that can't stop you from copying the data doesn't count as a systemetized data breach.

评论 #16609478 未加载

评论 #16609846 未加载

patjaabout 7 years ago

I was curious how the figure leaped from the 270k cited in the Facebook press release to this 50M figure.It sounds like they never had full access to the Facebook profiles beyond the 270k who installed the app, but just harvested the friend lists of those 270k. This doesn't give the app developer full access to the friends' profile data, but I guess once you have the network of friend connections you can use other public data sources to fill in or infer the gaps. And of course some of those 50M will have FB profiles that are fully public open books ready for anyone to harvest.I will say as someone who has developed Facebook apps, the whole ecosystem is pretty much on the honor system for protecting user data. There are some seemingly random and capricious (and often erroneous) abuse detection algorithms, but once an app has access to user data who knows what they do with it and whether it was kept secure -- surely Facebook has no idea unless they perform invasive manual physical audits.

评论 #16607481 未加载

评论 #16607835 未加载

评论 #16610179 未加载

评论 #16608396 未加载

loxiasabout 7 years ago

Minor point of confusion -- this article refers multiple times to a "data breach". ("...one of the largest-ever breaches of Facebook data...", "At the time of the data breach...", "...first reported the breach...")As far as I can tell, there is no data breach, right? It sounds like CA got facebook data through an app they wrote, thisisyourdigitallife, which did some shady things.Also, "The New York Times is reporting that copies of the data harvested for Cambridge Analytica could still be found online".The link is: <a href="https://www.nytimes.com/2018/03/17/us/politics/cambridge-analytica-trump-campaign.html" rel="nofollow">https://www.nytimes.com/2018/03/17/us/politics/cambridge-ana...</a>Anyone know what they're talking about? I haven't heard of any 50-million-profile data dump, and I really like collecting corpora...

评论 #16610572 未加载

评论 #16610549 未加载

ENOTTYabout 7 years ago

One thing other commenters haven't mentioned is that Facebook asked the other parties to delete the data and promise never to use it again and the other parties even certified that they had done so, but the whistleblower is alleging they lied to Facebook.Maybe that's legally actionable.

urlwolfabout 7 years ago

OK, this feels like it will bring about the end. Of something. Facebook? Massive use of data for political campaigns? Anything?If we keep consuming news like this, and do nothing, it's going to scalate massively. Same way as when Snowden told people they were spyed on and they collectively shrugged and continued with their lives as if nothing had happened.We, people in tech, have a massive moral burden to educate 'normals' on the meaning of news like this!

评论 #16608088 未加载

评论 #16607966 未加载

评论 #16607998 未加载

评论 #16607924 未加载

734786710934about 7 years ago

This wasn't a data breach, it was a misuse of data by a third party.

评论 #16607136 未加载

评论 #16607290 未加载

评论 #16607285 未加载

评论 #16607117 未加载

评论 #16608170 未加载

评论 #16611308 未加载

评论 #16607751 未加载

评论 #16607311 未加载

评论 #16607080 未加载

mcintyre1994about 7 years ago

I think I finally understand what the point of Facebook apps is and why they've always felt in some way dodgy. It's been clear for years that Facebook apps can get your user data, and that of your friends, and that Facebook designed them that way and were aware of that. The Guardian article even mentions that one of the apps used by GSR to gather data for Cambridge Analytica triggered Facebook security protocols trying to pull too much data.What I didn't understand is why Facebook would grant this - maybe at some point they needed viral apps on the platform and giving user data away encouraged people to make them - but why did it still work a few years ago? But this article made it click: all you can really do to monetise or use millions of profiles of Facebook users is target them with ads, and Facebook is the only place you can target those ads effectively given Facebook user data, and the more data you have the more effective those ads are, the more you pay Facebook.Facebook don't sell user data, they've long said that - and it's true. They sell the ability to target advertising to their users, and you can do that a whole lot better if you have their user data. So they don't sell it, they give an API for their users to freely give it away, knowing that once you've done all your analysis on it you'll conclude that you should spend money paying Facebook to actually deliver your messages to those users.

fjsolwmvabout 7 years ago

> Facebook denies that the harvesting of tens of millions of profiles by GSR and Cambridge Analytica was a data breach. It said in a statement that Kogan “gained access to this information in a legitimate way and through the proper channels” but “did not subsequently abide by our rules” because he passed the information on to third parties.This is exactly how Facebook was designed. You get a stupid quiz or photo frame in exchange for a copy of your friends list. It's always worked that way, and it's why Facebook OAuth was more popular than Google+ and other Oauth since 5+ years ago -- because app devs can make more money from Facebook OAuth since it comes with a copy of your friends list, so they prefer to integrate Facebook.

评论 #16607758 未加载

gaiusabout 7 years ago

Facebook: "no-one herds our sheep but us, mmmkay?"

auntienomenabout 7 years ago

So... If I were in Cambridge Analytica's position, employed to influence the US election, one of the first things I'd do is match this data with any data I could find on voting patterns. Which reminds me, didn't some of the Russian APTs hack into state voter databases?

评论 #16608181 未加载

shiftfocustimeabout 7 years ago

I think it is much more important to focus on an investigation to make clear to the public how this data was used. That i think will lead into a much more interesting story. No one seems to want to go there and i don't understand why. Maybe because a lot of its clients are political parties/political individuals around the world and they do not want to be ousted for using "public opinion manipulation technology" on a wide scale.

评论 #16608470 未加载

dawhizkidabout 7 years ago

Think about all those apps where you connect your bank account via your online banking creds that have full access to everything you buy.

评论 #16609098 未加载

ceejayozabout 7 years ago

I wonder how many of the "see what you'll look like when you're 80" and "find out how you'll die" quiz apps are doing this behind the scenes.

评论 #16607336 未加载

评论 #16607147 未加载

评论 #16607177 未加载

megousabout 7 years ago

Whistleblower's account suspended.<a href="https://twitter.com/chrisinsilico/status/975335430043389952" rel="nofollow">https://twitter.com/chrisinsilico/status/975335430043389952</a>

andy_pppabout 7 years ago

This kind of work combining propaganda and disinformation with AI models and feedback into them to get a progressive change of belief is fascinating. I think of this as the first of many wars democracy will fight against AI and we are currently loosing.This comment is from the “Duped” article that has a different headline and more detail.

trhwayabout 7 years ago

For example, "Weev" got 3 years for downloading ATT user data. I wonder whether Bannon&Co would get anything ... So far it doesn't look like FB makes any push for CFAA case here. I wonder what would FB do if instead of Bannon it were a nobody like the above mentioned "weev".

myth_busterabout 7 years ago

50M doesn't strike much in FB scale, that's until...<pre><code> At the time, more than 50 million profiles represented around a third of active North American Facebook users, and nearly a quarter of potential US voters.</code></pre>

评论 #16610052 未加载

svbillabout 7 years ago

Nothing new about Campaign Data companies. In fact knew of a South San Francisco company called 'Campaign Data' in the '90s that ran a SAS on DECUnix. They collected voter registrar data from counties for targeted voting campaigns. Usually for passing more restrictive laws or raising taxes. Like raise property taxes for schools; send flyers to renters with kids and send nothing to homeowners with no kids. It was always in a way, unfair and evil.

allthenewsabout 7 years ago

Let's be realistic here. This headline is nothing but partisanship. The only reason this is exaggerated as a "data breech" is because of the connection to the Trump campaign.The real scandal is that such data is so easily harvested and freely available.I'd be interested in seeing how much of facebook's data repository was used in targeted political ads by all parties. Including Russian agitators who have been shown playing both sides.

评论 #16608542 未加载

评论 #16608430 未加载

aetherspawnabout 7 years ago

I hadn’t thought of it like this before, but from a political POV everyone’s vote, whether they are a dole bludger or a quantum physiscist, are worth the same. So really, to win an election .. take that as you will. Identifying these people is a very profitable area.Interesting side note .. in Australia we assign school funding based on the highest education received or wage class of the parent (classes A, B ... E or such).

inetknghtabout 7 years ago

1) Facebook collects and builds a profile about you 2) Facebook allows third parties to target advertisements based on the profile 3) Advertisements are tracked 4) Browsing habits and advertisement tracking reconstructs who was targeted

muddi900about 7 years ago

ITT: people who did not read the link Astrotrufing and conservative martyrs bleeding all over the site.

dretaabout 7 years ago

Why bother protecting any data, if you can put a footnote in your ToS.

whiddershinsabout 7 years ago

I don’t understand the use of the word “breach” in this headline.

hux_about 7 years ago

Can't wait for Sheryl Sandberg to write a new book now on garden soil or something.

评论 #16608124 未加载

matchagauchoabout 7 years ago

This is hardly news... Facebook ads cannot target specific users, they only target audience segments.It's actually far easier to create ads targeted at segments with likely political beliefs, and Marketers have access to aggregate numbers of niche segments today.There's no need to scrape people's profiles or get down to the individual level.

评论 #16622209 未加载

MechEStudentabout 7 years ago

China has more. They have enough that this is a drop in the bucket. While they might be as blatant and ineffective as Russia by interfering with an election, they want a low profile and to maximize capture of revenue, so they are more about making money than trying to put feces on the face of the American political process.You people should pick your battles. It would help if you knew the battlefield first.

评论 #16608935 未加载

评论 #16608426 未加载