How to monitor PerfectScale Agent
PerfectScale psc_exporter metrics and associated alerts overview
Last updated
PerfectScale psc_exporter metrics and associated alerts overview
Last updated
PerfectScale Exporter installed in your Kubernetes cluster provides helpful for monitoring, reviewing, and alerting on PerfectScale platform behavior. These metrics will instantly detect abnormal behavior and notify you through .
The Exporter presents metrics in the Prometheus format and utilizes two categories of metrics:
Counter - the increasing metric
Gauge - system state at a specific time point. This metric can go down and can go up.
PerfectScale Exporter takes and as inputs and uploads as output.
These metrics monitor the communication between the Exporter and the KSM in your cluster.
Metric Name: psc_exporter_ksm_instances_scraped
Description: The number of KSM instances the exporter tried to scrape during the latest scraping round. Usually, you have only 1.
Type: Gauge
Metric Name: psc_exporter_ksm_scraping_errors_total
Description: A counter that grows incrementally to keep track of errors during KSM scraping.
Type: Counter
cAdvisor component provides the Exporter with information on specific containers and their resource usage.
Metric Name: psc_exporter_cadvisor_instances_scraped
Description: The number of cAdvisor instances the exporter tried to scrape during the latest scraping round.
Type: Gauge
Metric Name: psc_exporter_cadvisor_scraping_errors_total
Description: A counter that grows incrementally to keep track of errors during cAdvisor scraping.
Type: Counter
PerfectScale provides an additional group of metrics that pertains to exporters communicating with the PerfectScale SaaS platform.
Metric Name: psc_exporter_time_windows_upload_errors_total
Description: Total number of failed attempts to upload time windows to PerfectScale.
Type: Counter
Metric Name: psc_exporter_auth_errors_total
Description: Total number of PerfectScale exporter authorization errors.
Type: Counter
Metric Name: psc_exporter_upload_policy_errors_total
Description: Total number of errors when updating the upload policy.
Type: Counter
To enable or configure the Alerts, update the Helm values according to your requirements.
Alert Name: PerfectScale Exporter High KSM Scraping Error Rate
Description: Within the last 5 minutes, over 30% of kube-state-metrics scraping attempts have failed.
Alert Name: PerfectScale Exporter High cAdvisor Scraping Error Rate
Description: Over a specified percentage of cAdvisor scraping attempts have failed within the specified time range.
Alert Name: PerfectScale exporter Time Windows Upload Error Rate
Description: Within 1 hour, three or more time windows upload errors to PerfectScale occurred.
Alert Name: PerfectScale exporter Authorization Errors
Description: Within 1 hour, two or more PerfectScale Exporter authorization errors occurred.
Alert Name: PerfectScale exporter Upload Policy Refresh Errors
Description: Within 1 hour, two or more errors occurred when updating the upload policy.
Alerts are useful in immediately informing about the abnormal behavior of the Exporter based on the described above.