One thing that really puts me off Tensorflow is the large amount of work you have to do just to get your data in.<p>I recently trained a detector using Nvidia's detectnet and the DIGITS environment really makes it easy from dataset/database creation to model generation and training. The web interface handles all the padding and resizing for you when you import your images, and detectnet does online augmentation during training. It takes data in KITTI format which is just image + textfile with labels. No wrangling in Python needed.<p>The documentation is pretty terrible though, you have to pore over github issues to fix and modify things, but it does work.
I mean, this is neat but as someone who actually tried to build a computer vision product, can I just say Open Images data aren't quite enough? Also, computer vision isn't quite at "human level" yet. For your own project, building a model that has 90% accuracy on the test set is awesome but for an actual product to be released into the wild, it could have serious problems (not to mention adversarial examples).
A minor nit but the function to dedup the image ids and the corresponding comments seem to be off from a data structure/algorithmic POV.<p>"Looking at our deduplication function, it’s functional and performant, but not very descriptive. Essentially it checks a running set called seen, which is checked for originality as the deduplication script progresses. As python Dicts are essentially a hash map, the in check compares element hashes instead of each individual component of the dictionary object. This massively speeding up the deduplication process."<p>Uhm..I don't think so.<p>It is just extra work to check if the objects are already in the set vs just stuffing them all in and let the set handle uniqueness.<p>Am I missing something?<p><a href="https://gist.github.com/pstoll/ae73582763540051d321a4eb15304226" rel="nofollow">https://gist.github.com/pstoll/ae73582763540051d321a4eb15304...</a><p>Again, a minor thing. But seeing something like that makes my 'what else do I need to review' detectors go up.
live demo: <a href="https://algorithmia.com/algorithms/zeryx/openimagesDemo" rel="nofollow">https://algorithmia.com/algorithms/zeryx/openimagesDemo</a>