TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

How to De-Identify Your Data: Balancing Accuracy and Privacy

51 pointsby alanfranzoniover 9 years ago

4 comments

macoboover 9 years ago
De-identification has its limits and information can still be learned even from anonymized datasets. An alternative to this is something like Sharemind [1][2] where sound cryptography is used to make secure multi-party computation possible.<p>[1]: <a href="http:&#x2F;&#x2F;sharemind.cyber.ee&#x2F;" rel="nofollow">http:&#x2F;&#x2F;sharemind.cyber.ee&#x2F;</a><p>[2]: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=bAp_aZgX3B0" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=bAp_aZgX3B0</a>
评论 #10459080 未加载
评论 #10457707 未加载
评论 #10457703 未加载
m0nsterover 9 years ago
While data de-identification surely has its limits, it is useful in many contexts.<p>If someone is interested in tools for data de-identification, ARX [1, 2] is an open source software that (among other features) supports exactly the set of methods used in this study.<p>Full disclosure: I&#x27;m one of the developers of ARX.<p>[1] Website: <a href="http:&#x2F;&#x2F;arx.deidentifier.org" rel="nofollow">http:&#x2F;&#x2F;arx.deidentifier.org</a><p>[2] Source: <a href="https:&#x2F;&#x2F;github.com&#x2F;arx-deidentifier&#x2F;arx" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;arx-deidentifier&#x2F;arx</a>
djyaz1200over 9 years ago
Best research I&#x27;ve seen on the topic... <a href="http:&#x2F;&#x2F;latanyasweeney.org&#x2F;work&#x2F;identifiability.html" rel="nofollow">http:&#x2F;&#x2F;latanyasweeney.org&#x2F;work&#x2F;identifiability.html</a>
grflynnover 9 years ago
&quot;Your data&quot; assumes there is some sort of Doppelganger attached to a data bundle which is mostly hot air and used to persuade those who buy from data brokers that the data is in-fact correct. I know some FOIA pests who are purposefully polluting such data-sets and then asking for the information and seeing some very skewed results. What if I sell back my data, since that&#x27;s what they&#x27;re after anyway? I keep more logs than brokerages and would be happy to hand them over for a fee. One item of browsing history alone is probably worth upwards of $10,0,00