Azure health monitoring runs by default on all our resource groups. If you go to a resource group page and select Insights (Preview) from under Monitoring on the left had menu you can see a bunch of stuff. Check the Show Azure resource health box for current resource health: https://portal.azure.com/#@034gc.onmicrosoft.com/resource/subscriptions/fde9cf97-7bf0-4373-9fbd-eba18c6e0dfc/resourceGroups/TC-PLT-EGIS-RG/e2emonitoring
Alerts get triggered if these health metrics fail. Alerts are just listed in portal by default unless we define another action. eGIS has worked with the cloud team to send email alerts for all default health metrics in the pilot environment.
Default health metrics
Azure’s default health metrics can be found here:
https://docs.microsoft.com/en-us/azure/azure-monitor/insights/vminsights-health
Enabling email alerts for default health metrics
Only the subscription owner can enable these alerts, and it was done via script. Contact the cloud team for support.
TODO summarize email exchange with cloud team on scripting the actions.