Hello,
I was seeing the same issue: when upstream node is down, healthcheck correctly flags it as down yet subsequent requests are still routed to that node (even though others are healthy).
Tried this patch with 1.0.13 and .15 - no relief: https://github.com/liseen/healthcheck_nginx_upstreams/blob/master/healthcheck.patch
Is the above supposed to work when ip_hash directive is used?
Thank you,
-nikita
My config:
upstream admin-cluster {
ip_hash; # clientIP-based session affinity
healthcheck_enabled;
healthcheck_send "GET /running HTTP/1.0" "Host: www.mydomain.com" "User-Agent: FooBar/1.0 nginx" "Connection: close";
healthcheck_delay 2000;
healthcheck_failcount 2;
healthcheck_timeout 2000;
server admin1.staging.mydomain.com:8080 max_fails=1 fail_timeout=5; #max sec to connect to upstream host
}