One of the most interesting aspects of working on this project was the strategy we used for refining our primitive city detection. We gathered pairs of article titles & bodies (hoping we'd later accurately scrape them with the readability.js library...) and set them up as test fixtures. While TDD seems to go out the window during hackathons, it was crucial for us to quickly and efficiently iterate while making sure to catch as many edge cases as possible.<p>In typical hackathon fashion, the code got mangled when I had to switch the data store from leveldb (node) to flat json files (chrome extension) so the latest version in the repo isn't the most graceful. But here's a hacky node app for processing cities1000 data and then running mocha tests against a set of articles:<p><a href="https://github.com/dzhang50/rlt/tree/master/node" rel="nofollow">https://github.com/dzhang50/rlt/tree/master/node</a><p>Hope someone finds the technique and code useful, interesting, or at least amusing.
Looks great but you guys have a name conflict with a major bus provider up and down the east cost. <a href="http://www.vamoosebus.com/" rel="nofollow">http://www.vamoosebus.com/</a>