Monitor server health metrics in Lifecycle Services

Completed

Environment monitoring is a simple way to access current activity on environments in your finance and operations apps implementation. By using the Environment Monitoring and SQL Insights that come in a user-friendly dashboard, you can see measurements and diagnostics of your environment's health. Within the Health metrics dashboard, some issues are reported directly to the Microsoft Service Engineering Team and some are mitigated immediately. If there are any concerns regarding performance, this is a good way to gain insight into potential issues. This unit examines how to view the health concerns for environments and determine how to mitigate many issues before they happen.

Types of health checks performed with Health metrics

Several types of health checks can be performed within the Health metrics interface, which you can use to make observations about your environment. The health checks span various services and components of the environments, including the following:

  • AOS
  • Batch frameworks
  • Data Import/Export framework
  • Microsoft Azure SQL
  • Management reporter

The checks are performed based on multiple data sources, like the telemetry that is collected from the environments, the checks that are done by a continuously monitoring watchdog service, CP counters, along with other system-level counters that the environment produces. As previously mentioned, some health checks are reported to the Microsoft Service Engineering team for direct investigation.

Viewing the activity log and the user load in the activity monitor

On the Environment Monitoring page, select the Health metrics tab to view the monitoring dashboard. Health metrics are collected for all machines and components that relate to the environment that you selected in Lifecycle Services. The health metrics that you can view are CPU usage, available memory, errors logged per second, and batch heartbeat. You will be alerted about any abnormal activities in the metrics. Some alerts are taken care of by the system, but others will need you to take action to mitigate them.

The Activity tab contains several different charts and sections. The user interaction chart shows user activities on various machines in the environment, as well as the SQL utilization trend. The activity load section shows the various activities that were performed on each machine, while the user load section shows all the system users and the time that the user spent on a specific machine. If you hover over an activity in the activity load section, you can see an advanced look into the user's specific actions. You can also use filters on the page to narrow the information logs. Time duration, user, and search terms are only some of the filters that you can use to peer into the information for more comprehensive statistics. Note that the page doesn't load any data by default, and you must select a time duration and then select Submit time for the data to show up. The activity monitoring tool retains data for 30 days.