Greg Kidd, the founder of 3taps, did not have to keep fighting this fight - AT ALL. He is one of the top execs at Ripple Labs, was in the first round of Twitter (and Square), and doesn't have his net OR self worths tied up in 3taps. He continued this because he believes it was right - and I, for one, thank him for it.
Back when the first bubble burst in late 2001, I scraped a bunch of historical craigslist data from a secondary archive and built an interactive gnuplot webpage of post-traffic by category over time. At the time, it got slashdotted, a couple hundred thousand people looked at it and it was all fun and good.<p>So I thought afterwards, hey, the economy is kinda sketchy still and looking at this stuff sure is neat... I should build a real tool that robustly and respectfully logs daily post totals for more locales, and maybe build out a cool little graph portal. Maybe I can even do a little NLP to make it smarter. hey, it's craigslist, they're community minded.. they thank me when I post, they won't mind. They give pencils to teachers even.<p>So I email them, and Craig responds in a cc'd message with a 'hey cool, can this guy use our RSS feeds'? At which point, the assholes that worked there started inventing every excuse under the sun as to why doing so would totally damage their infrastructure (because you know, polling RSS every half an hour is total abuse.)<p>Anyway, that's when I realized that all the hippie-dippie stuff was just window dressing and that I really truly was dealing with a really special species of asshole.<p>I put the project down and walked away. The end.
<i>the same statute that led to the demise of Aaron Swartz</i><p>For fuck's sake.<p>CFAA criminal sentencing guidelines may very well have contributed to Swartz's suicide. They incentivized prosecutors to create complex, showy indictments cross-linking multiple felony charges (because exploiting unauthorized access in furtherance of other felonies is an accelerator in the CFAA). CFAA may be broken in several ways.<p><i>But CFAA is also the sole federal statute governing unauthorized access</i>. In civil litigation, CFAA is the only statute that provides a civil cause of action relating to unauthorized access to computers of any sort.<p>People like to write about civil CFAA as if it was some sort of nuclear option. But civil and criminal cases are worlds apart. If you're going to sue someone for misusing your computer systems, or even just violating your terms of use, CFAA is merely the statute that enables that. That has nothing whatsoever to do with overzealous prosecution.<p>Invoking Aaron Swartz in an argument over who's allowed to show apartment ads where is manipulative and grotesque.
We really need a non-profit organization that provides a data store with an api for common things like classified listings, sms messages, pictures, likes, etc.<p>That can help us move away from this sort of chicken and egg problem with user generated data. These companies are basically hogging it because they were able to build the user base.<p>If we can get the data in a non-profit store with a licensing scheme that basically says you must as a part of using this data add any user-generated data submitted to your website back to this store so other developers can build products on top of it, we could really innovate in classifieds and social networks.<p>Perhaps something like that can be funded by EFF or related organization... because then we can potentially apply governance to that user generated data which has not been possible with private companies.<p>The chicken and egg problem can be solved if big non-profit tech and civil rights brands like the ACLU, EFF, Wikipedia, etc. all get behind this and market it.
Excellent update on one of the hard cases EFF has been fighting.<p>There's a link to an interesting law review article on how the CFAA can make it a criminal act for arbitrarily banned users to even browse to a public webpage:
<a href="http://digitalcommons.law.umaryland.edu/cgi/viewcontent.cgi?article=3216&context=mlr" rel="nofollow">http://digitalcommons.law.umaryland.edu/cgi/viewcontent.cgi?...</a><p>It's an absurd result and frustratingly unaddressed by the courts.
> and will make its API source code, the settlement agreement, and other legal filings and public policy resources available.<p>This is interesging to me. A couple years ago being young and naive i received a cease and desist order from craigslist legal team demanding i remove my craigslist scraper from github. It was largely a toy project to play around with an html parser library i wanted to learn anx thought it could be useful. Of course I now understand it was against their tos and from an ethical standpoint, avoid scraping anything unless getting permission, but at the time I was terrified I'd be sued for a ton of money. It felt incredibly aggressive to go after me , a student at the time.<p>So I'm curious.. is it illegal to scrape but ok to release the source code? Where is the line drawn?
I don't understand.<p>>> The Court has ruled that users—not craigslist—own the copyrights in their postings.<p>>> ... Craigslist finally conceded in Court that no such harm or impairment ever occurred.<p>>> Craigslist completely rewrote its Terms of Use, removing many of the most abusive clauses.<p>Everything above seems to be against Craigslist. Then why does 3taps have to agree to a settlement to pay Craigslist $1 million?<p>And if there are other parts of the court ruling that went against 3taps which this blog post doesn't mention, then how can Craigslist be forced to forward that money to EFF?
It's not clear why they are shutting down if "the Court has ruled that users—not craigslist—own the copyrights in their postings."<p>Maybe "3taps lacks the resources to continue the fight" implies that the lawsuit has drained their bank accounts and they are out of money.
The actual lawsuit docket is here:<p><a href="http://www.plainsite.org/dockets/k5ulex5l/california-northern-district-court/craigslist-inc-v-3taps-inc-et-al/" rel="nofollow">http://www.plainsite.org/dockets/k5ulex5l/california-norther...</a>
Not sure if I understand this correctly. Does this mean that if Instagram or Twitter terms does not allow scraping of their user's generated content, any developer can just go ahead and do it because the copyright holder of the post is the user?<p>Since the site does not hold the copyright (and rightly so), the site owner does not have the rights to say what can be done with the data. That belongs to the users that generates it.<p>In that case, how do we know if all the individual users consent to the scraping? If you scrape 10,000 data, and one user complains, would you be in trouble? And does the user has the right to know who are accessing the data outside of the normal use (since if they don't know, they can't object)?
"3taps replied that it did not access craigslist and instead obtained the data from Google"
What does this mean? how do they get that data from Google?
Company A made a chocolate fountain for the "public" to see. People enjoyed it. Company B thought this is a great opportunity to make cakes out of it. Because the fountain is "public" they made this as the source of their cake business. Company A complained to take down the fountain and....<p>Well you know the rest of the story. :)
interesting outcome. Do users own the copyright to their pictures and postings on Facebook? twitter?<p>Can I build a Facebook scrapper and redistribute it to other sites?
Would a "cannot use for commercial purposes" clause have nipped this one in the bud? I'm still on the fence with this one. I think Craigslist could have played it a lot better but I find it hard to believe no-one here can empathise with the founder.
<i>3taps built a data exchange that aggregated user-generated data housed on various websites and then made that data available through an API to developers, including PadMapper and Lovely.</i><p>Craigslist discovered that it had become (has become) the "MLS" of rentals... and perhaps even more accurately -- it's a brokerage of _housing_ data -- both rentals and sales. So when property management companies (PMCs) discovered how darn easy it was, for example, to flood craigslist with multiple ads for the same unit, or to flood it with units that were never available to begin and thus alter market perception -- certain people got exactly what they wanted: hyperinflation in rents, or the subsequent upward pressure on housing prices, or both.<p><i>As recently as 2010, craigslist welcomed innovative uses of the publicly available data ... Over the next two years, as innovators like PadMapper and AirBnB began to thrive, craigslist reversed course, and punished the innovators it previously welcomed to use the data. In February 2012, craigslist rewrote its Terms of Use, abandoning its long-articulated position that users own their own content which was freely available on the “public” part of craigslist's website.</i><p>As outraged as everybody was about this, it is exactly what the real MLS does when you decide to sell your house. You sign a contract promising to pay some Realtor's brokerage company 6 percent of whatever your house goes for -- in that contract you are essentially giving them the "copyright" of your house listing; they own it on the MLS and that is why you have to pay them the big bucks. Never mind that they do basically NOTHING other than simple photography and data entry to post on the MLS... but now they require you give them ~$66K of your equity for their 3 hours of work. (Source: <a href="http://www.mercurynews.com/business/ci_28512250/report-silicon-valleys-housing-affordability-crisis-worsens" rel="nofollow">http://www.mercurynews.com/business/ci_28512250/report-silic...</a> Median price of "entry level" home in San Mateo County = $1.1M).<p>Same thing is happening in rentals / property management co's (PMCs), but slightly different symptoms.<p>Nobody is attacking the problem the right way, though. 42Floors tried the experiment and found it to be a failure, too. (Source: <a href="https://news.ycombinator.com/item?id=9881213" rel="nofollow">https://news.ycombinator.com/item?id=9881213</a>)<p>The market should be putting more pressure on brokers to compete with each other ... damn that 6 percent. (Right, but the NAR signed a non-compete agreement with itself so it gets to do that)<p>Hackers should stop building tools that make it easier and cheaper for the PMCs and real estate agents to steal everybody's equity.