In my experience ray in AWS is a good way to badly utilize resources and waste a lot of money (as is generally anything cloud or anything python; when you do both it multiplies).<p>I'd rather have a real HPC cluster.
I do not quite get this. How does this enable someone to run ray or metaflow on a typical batch scheduled HPC system (slurm or alike)? Inter node communication is done via the lustre file system, right?