What all should I be monitoring, alerting, etc. in production?

David Thielen 2,776 Reputation points
2024-07-08T18:01:19.3466667+00:00

Hi all;

At my past company there were DevOps people who took the code we wrote and pushed it up to Azure and insured it was running fine. And then pulled us in if there was a problem.

Now I'm on my own and need to handle all this myself. I have the app running fine on Azure. But now I need to make sure it stays running. (It is a Blazor Interactive Server app using Azure SQL Database and Azure Storage - BLOBs.) So...

  1. What should I set up to monitor and/or have alerts on?
  2. I have Health Checks set up and running.
    1. Can I get an alert when they fail?
    2. Can I have them write to the log when they fail?
  3. What alerts should I have for the App Service?
  4. What alerts should I have for the database?
  5. Is there anything I should be proactively checking?
  6. Anything else???

thanks - dave

Azure App Service
Azure App Service
Azure App Service is a service used to create and deploy scalable, mission-critical web apps.
7,359 questions
0 comments No comments
{count} votes

Accepted answer
  1. SnehaAgrawal-MSFT 20,396 Reputation points
    2024-07-09T06:49:27.29+00:00

    @David Thielen Thanks for asking question!

    It's great to hear that your app is running fine on Azure. To ensure that it stays running, there are several monitoring and alerting options available to you.

    Firstly, you mentioned that you have Health Checks set up and running. That's a great start! You can set up alerts for when the Health Checks fail by using Azure Monitor. Azure Monitor allows you to create alerts based on metrics, logs, and events. You can create an alert rule that triggers when the Health Check fails. You can also configure the alert to write to the log when it fails.

    In addition to Health Checks, you should also set up alerts for the App Service and the database. For the App Service, you can set up alerts for CPU usage, memory usage, and HTTP errors. For the database, you can set up alerts for high CPU usage, high memory usage, and long-running queries.

    It's also a good idea to proactively check the logs for any errors or warnings. You can use Azure Monitor to collect and analyze logs from your app and database. This will help you identify any issues before they become critical.

    Finally, you may want to consider setting up a backup and disaster recovery plan for your app and database. This will ensure that you can quickly recover from any unexpected downtime or data loss.

    Refer to below doc links for details-

    I hope this helps! Let me know if you have any other questions.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful