I'd love to see a GIL-less python that competes head-for-head with serial python. I am in the same boat as the author (I use python for command-and-control of C++ libraries that have their own concurrency and I spend too much time trying to figure out which bit of training or data processing code is keeping the system from using its 96 cores, 384GB RAM, and 8 GPUs).