Two things are often vastly underappreciated in books/courses on DS: Data wrangling and use of the shell command line. This book does both, in an engaging manner. Chapter on parallel comp. is a nice bonus. Highly recommended for the budding data scientist, and for quite a few of the experienced ones.