Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the ajax-load-more-anything domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /home/devwp/public_html/p225-newweb/wp-includes/functions.php on line 6114

Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the wordpress-seo domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /home/devwp/public_html/p225-newweb/wp-includes/functions.php on line 6114
Setting up the first SLO - Vsceptre

Setting up the first SLO

10 May 2023

Blog, News

Setting up the first SLO
This is the final piece of the 3 part series “The path to your first SLO”.

We have discussed on the basics of what to observe and how to get the relevant metrics in part 1 and part 2 of this series. This time we are going to have a quick look on to setup a simple service availability monitoring SLO with Nobl9 and SolarWinds Pingdom.

Nobl9 is used for building the SLO dashboards in this example as it supports a lot of data sources input, with options to gather metrics through the Nobl9 agent or using direct API integration. To save some time, we use the SolarWinds Pingdom to run a script and monitor a web URL.

You can setup a free trial on SolarWinds Pingdom as well as Nobl9 for a 30 days trial. That will provide you with a nice playground on this exercise. Point this to an important service URL and runs a simple availability test on a per minute basis. SolarWinds PingDom will return up/time as well as a service response metrics for you.

Setting up the data source in Nobl9 to connect to SolarWinds is a breeze. We do not want to repeat the details here. If you are interested, you can follow this nice tutorial. At the end you can get a nice dashboard similar to this.

We set a SLO target of 99% if the response time of this slow endpoint is within 8 seconds for a 1 day rolling window (Satisfactory). Another similar SLO target of 99% for response time within 6 seconds (Optimal). Based on above, we can comfortably commit and SLA to the end user for an SLA with response time < 8s for 99% of the time on a 1 day rolling window. At the same time leaving some room for system downtime or pushing new releases to the production.

This just scratched the surface of how to utilize SLO for service reliability tracking. You can also build composite SLO, setting alerts or changing the time windows of the SLO. Of course you can build all these dashboards with other tools but Nobl9 can make you life a bit easier. The whole process can be setup using SLO as code with Terraform or OpenSLO.

Hope you enjoy the series of “The path to your SLO”. If you have a need to revisit the observability practice feel free to reach out and talk to us. Our team of consultants from Vsceptre can help you on different aspects of your observability journey from monitoring, log aggregation, DevOps integration, SRE practice as well as data consolidation.
New to SLO?
#SLOconf is a free, virtual event focused on #SLOs! 🔥
Whether you are doing SRE, SLO, or DevOps, or Ops, or a Dev – SLOconf is the perfect platform to share insights and ideas on the latest trends and developments in SRE/SLO.
Vsceptre is a sponsor at SLOconf 2023, hosted by Nobl9! 📢
For more details & speaker lineup, register here: 👇
www.sloconf.com

Related Articles

Demystifying Log to Trace correlation in DataDog

Demystifying Log to Trace correlation in DataDog

At around end of March, I want to get my hands on the old raspberry pi cluster again as I need a testbed for K8S, ChatOps, CI/CD etc. The DevOps ecosystem in 2023 is more ARM ready compared with 2020 which makes building a usable K8S stack on Pi realistic. I upgraded from a 4 nodes cluster to a 7 Pi4 nodes with POE capabilities, SSD, USB and sitting inside a nice 1U rack. Then spending the next two months’ time on testing various OS. Re-installing the whole stack multiple times and struggling with the home router is fun. At the end the cluster is up with all platform engineering tools deployed.

Log Sensitive Data Scrubbing and Scanning on Datadog

Log Sensitive Data Scrubbing and Scanning on Datadog

In today’s digital landscape, data security and privacy have become paramount concerns for businesses and individuals alike. With the increasing reliance on cloud-based services and the need to monitor and analyze application logs, it is crucial to ensure that sensitive data remains protected. Datadog offers robust features to help organizations track and analyze their logs effectively.

Monitoring temperature of my DietPi Homelab cluster with Grafana Cloud

Monitoring temperature of my DietPi Homelab cluster with Grafana Cloud

At around end of March, I want to get my hands on the old raspberry pi cluster again as I need a testbed for K8S, ChatOps, CI/CD etc. The DevOps ecosystem in 2023 is more ARM ready compared with 2020 which makes building a usable K8S stack on Pi realistic. I upgraded from a 4 nodes cluster to a 7 Pi4 nodes with POE capabilities, SSD, USB and sitting inside a nice 1U rack. Then spending the next two months’ time on testing various OS. Re-installing the whole stack multiple times and struggling with the home router is fun. At the end the cluster is up with all platform engineering tools deployed.

This site is registered on wpml.org as a development site.

Notice: ob_end_flush(): failed to send buffer of zlib output compression (1) in /home/devwp/public_html/p225-newweb/wp-includes/functions.php on line 5464

Notice: ob_end_flush(): failed to send buffer of zlib output compression (1) in /home/devwp/public_html/p225-newweb/wp-includes/functions.php on line 5464