I participated in a organizational data challenge in 2018 and chose awk to process the massive data to solve several interesting challenges.<p>The repo is: https://github.com/ketancmaheshwari/SMC18<p>A report detailing the approach and results is here: https://github.com/ketancmaheshwari/SMC18/blob/master/report/SMC18_DataChallenge4.pdf