I have an internal service (rest APIs) in AKS with HPA configured to it. It works fine when it needs to scale-out to handle a higher traffic period. Although when AKS understands that it can scale-in I start having issues. I got a lot of 503 errors in the clients, even after the pod starts its termination process the pod continues receiving requests.
I found a very similar 2-years-age issue with ingress: https://github.com/Azure/application-gateway-kubernetes-ingress/issues/641
Anyone else with the same issue? Anyone with an idea on how to solve it?