I've been participating in the data bounties from Dolt and I suggested some of the schema changes on the Discord chat. I do think this is a turning point for dealing with these files. Many 100's of TB gets in to real money, really quick. Just the S3 data charges alone are six figures a year. Not to mention the memory and compute to chew through all of it. If the final data set is more in the 1TB range, this data will get a lot more use. We might even get closer to a real marketplace for healthcare services.