Clever. One of the more interesting challenges that I've run into in the last few years is just the sheer amount of raw data out there. It's mind-boggling how many problems can be solved if we could sift through that data quickly, from human trafficking down to weather. I'm particularly fascinated by her intuition that writing patterns and templates can identify pimps. I'm not sure how long it would have taken me to come to that conclusion.. but now that it's out there, it's obvious.<p>I wonder what other problems we can solve with the same toolset.
"When I asked her how detectives differentiate Traffic Jam's data between trafficked victims and sex workers, she said that they rely on their intuition and knowledge of the community they protect."<p>It sounds like she's busting low class pimps, and hoping that a few of them are human traffickers.
This is interesting. The article doesn't talk about she implemented the Traffic Jam program, but it does discuss how she came to 'know' sex ads, as way to keep tabs on pimps.<p>""I would literally just spend hours on these websites, looking at ads, getting a sense for what was the norm," she said. She began to pick up the nuances of every post, understand how a template was made, and get a feel for the different voices behind these ads."<p>I don't know how this information fits with her implementation, but I was reminded of an old article by Paul Graham "A Plan For Spam" (<a href="http://www.paulgraham.com/spam.html" rel="nofollow">http://www.paulgraham.com/spam.html</a>), where he talks about automating the process of detecting spam using Bayesian Filtering.<p>"I think it's possible to stop spam, and that content-based filters are the way to do it. The Achilles heel of the spammers is their message. They can circumvent any other barrier you set up. They have so far, at least. But they have to deliver their message, whatever it is. If we can write software that recognizes their messages, there is no way they can get around that."<p>Substitute the spam message for the sex message, and we're talking about the same thing. It would be an interesting exercise to try Bayesian Filtering on sex ads, or any other kind of message, to see where it leads.
Fascinating article, trying to automate what the CIA would call an Analyst. Back when I was building my old computer collection I would read hundreds of ebay listings to find the "good stuff" and started to recognize sellers that listed under a variety of user names, or buyers who were also sellers. Just by the way they talked about the hardware, and did they call it by its "common" name or the product catalog name, etc. Never thought about making a resarch project out of it though.
I'm really uneasy with police analyzing our social media data, it's heading into thought-crime territory. But there's tons of money to be made off of it.