GH actions has been great for us for CI but incredibly opaque when you're trying to diagnose problems. Simple jobs running on the gh hosted runners are fine, but we have a large pool of self-hosted runners running more complex workflows, and whenever things go sideways the answer is usually "did you try restarting all of them?"
Hopefully this is helpful, at least someone is investing in making gh actions work for larger organizations.