Hello All,
Just intrigued to get some insights on why all the query performance issue with my SQL Server 2016 SP2 CU5 (Enterprise) was fixed by just a SQL Server restart.
Some background:
We had a network degradation starting from 7:30 AM however it was resolved by 10:30 AM but unfortunately the queries were still behaving badly and became worst once the load (throughput) increased on the SQL Server.
We have a 5 nodes Always on Cluster with 3 nodes in a Data Center and 2 nodes in AWS and the network degradation happened between the physical data center and AWS (Writer node is in the physical data center and 30% of the traffic comes from AWS)
I got the following error during the network issue at 10 AM:
DESCRIPTION: A fatal error occurred while reading the input stream from the network. The session will be terminated (input error: 10060, output error: 0).
Apart from this particular error there were no other errors specifically pointing to any degradation such as memory pressure\disk IO etc.
During the high CPU on the DB all I can see that even the smallest query (querying tables on few rows) is also taking time.
There were no memory pressure alerts\errors and disk was also performing optimally.
IMO it seems to be related to plan cache which eventually got cleared after the restart but how can I prove it??