We enable locality aware load balancing, which means that the local endpoint is given a higher priority in the envoy endpoints list. When it needs to do a retry after an error from an endpoint envoy will by default retry higher priority endpoints ...