"It introduces two extra round trips" is an implementation flaw/detail, not an aspect of this strategy. There is no reason that servers cannot periodically advertise their load measures to clients, or staple current load reports to query responses, which the client can cache and use for a period of time.