On the Azure portal for our app service, we've recently seen a recommendation to scale out to 3 instances (we're currently running 2 as minimum + autoscale rules). However, we're not able to find any official documentation on how app service instances are getting distributed and how that would prevent any downtime. The recommendation also states that "since you have only two instances you can expect a downtime of upto 50% when the App Service platform is upgraded".

Have a few questions in this particular scenario:
1) When the platform upgrade happens and one of the instances starts the upgrade process, isn't that instance supposed to be brought down and the remaining 1 instance would handle all of the requests, given that the load balancer is supposed to treat the instance as not available and direct all traffic to the remaining 1 instance? That would technically still serve any incoming requests and prevent downtime or is that not the case? Need an explanation.
2) How does the instance distribution happen? If we run 3 instances as a minimum is the 3rd instance hosted on a different server? Is that always the case?
3) If the 3rd instance is in a different server and one of the instances starts the upgrade process, doesn't that still give us a 75% success rate? How does that prevent any downtime?
4) How long does the platform upgrade process actually last on average? How long would the downtime be or is it just a matter of the application restart?
Any links to official documentation are welcome, thank you.