I have a chance to get 4 talented data scientists to work on a project for three months. But I don't know what will generate a good enough amount of value. Any ideas? Also, getting datasets might also be an interesting problem.
An open source version of amazon glue,<p>Esp. the classifiers <a href="https://youtu.be/4N_ktE4NFIk?t=6m11s" rel="nofollow">https://youtu.be/4N_ktE4NFIk?t=6m11s</a> and data extraction / schema heuristics