Skimming the slide set #1 (Systems) is highly useful, even for people that don't do machine learning.<p>For example, he covers the frequency of hardware failure, and also gives latencies for different operations (L1 cache read, disk read, etc.)<p>Slide 25 lists many different types of data on the web, categorized. This jumped out at me because, reading the list in one big picture got the gears in my head turning about potential data sources, and what could be done with them.