TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Using NYC Taxi Data to Identify Muslim Taxi Drivers

104 pointsby tshtfover 10 years ago

4 comments

rmxtover 10 years ago
If you dig through to the source reddit posting [1], you can see that the post only really puts 6 different individual taxi IDs out there for visualization. To me, only the first 4 seem like a good visual fit for the prayer times, and the overall trip heatmap [2] suggests that they may just be part of a larger pattern of eating&#x2F;taking a break at sunrise, noon, and sunset for all cab drivers. While the blog posting is begging the question of whether or not the data release contains personal data as a result of these findings, much more invasive findings have already been published (and posted to HN) with this NYC Taxi data, like corroborating individual trips by high profile people. [3] Notwithstanding the privacy issues and questionable methods used to obfuscate the data, I personally think that the release is a great step in the right direction for open data.<p>[1] <a href="https://www.reddit.com/r/dataisbeautiful/comments/2t201h/identifying_muslim_cabbies_from_trip_data_and/" rel="nofollow">https:&#x2F;&#x2F;www.reddit.com&#x2F;r&#x2F;dataisbeautiful&#x2F;comments&#x2F;2t201h&#x2F;ide...</a> [2] <a href="https://i.imgur.com/lyK0qTI.png" rel="nofollow">https:&#x2F;&#x2F;i.imgur.com&#x2F;lyK0qTI.png</a> [3] <a href="http://research.neustar.biz/2014/09/15/riding-with-the-stars-passenger-privacy-in-the-nyc-taxicab-dataset/" rel="nofollow">http:&#x2F;&#x2F;research.neustar.biz&#x2F;2014&#x2F;09&#x2F;15&#x2F;riding-with-the-stars...</a><p>EDIT: For better or worse, deducing ethnicity, country of origin, and&#x2F;or religion is probably much easier based on this data set [4]. People have come up with analyses like this [5]. The data analysis is great, but my fingers are crossed that tabloid newspapers and their ilk don&#x27;t pick this up and run off xenophobic, fear-mongering articles.<p>[4] <a href="https://data.cityofnewyork.us/Transportation/Medallion-Drivers-Active/jb3k-j3gp" rel="nofollow">https:&#x2F;&#x2F;data.cityofnewyork.us&#x2F;Transportation&#x2F;Medallion-Drive...</a> [5] <a href="http://vizual-statistix.tumblr.com/image/107987401281" rel="nofollow">http:&#x2F;&#x2F;vizual-statistix.tumblr.com&#x2F;image&#x2F;107987401281</a>
flexieover 10 years ago
In Copenhagen, it was revealed a few years ago that racist customers who didn&#x27;t want a muslim taxi driver told the central when ordering a cab that they were &#x27;bringing a large dog&#x27;, and the operators at the centrals knew and respected this as the secret code that the customer wanted an ethnic Danish driver. This is obviously illegal but I dont know whether they stopped the practise.
评论 #8930735 未加载
评论 #8932257 未加载
评论 #8931256 未加载
评论 #8931355 未加载
dansoover 10 years ago
Yeah, the OP has serious blinders here. Like the student who threw alarmist BS about how the data could be used to track visits to stripper clubs...this seems to be an example of cherry-picking the data. So, I&#x27;m not good at discerning deviation from thousands of small dots...what&#x27;s the numerical measure of the correlation here? What are the significance of those blue lines when they show a trend over empty black space?<p>And what does the analysis&#x2F;trendlines look like on drivers who can be discerned as <i>not</i> being Muslim?<p>I&#x27;m all for personal data protection, and the TLC erred in not anonymizing the data. But this weak analysis and alarmist bullshit by a PhD serves little but to give governments more excuse to hide public data.<p>edit: Linking to someone more well versed than me, regarding the cherry-picking of data in the &quot;Riding with the Stars&quot; article <a href="http://blogs.law.harvard.edu/infolaw/2014/11/21/the-antidote-for-anecdata-a-little-science-can-separate-data-privacy-facts-from-folklore/" rel="nofollow">http:&#x2F;&#x2F;blogs.law.harvard.edu&#x2F;infolaw&#x2F;2014&#x2F;11&#x2F;21&#x2F;the-antidote...</a>
评论 #8930680 未加载
rhino369over 10 years ago
Observant Muslims<i>