Rules

CPU-load

13.222s ago

8.186ms

Rule State Error Last Evaluation Evaluation Time
alert: HighNodeCPU expr: rate(node_load15[5m]) * 100 > 85 for: 5m labels: severity: warning annotations: description: |- CPU load is high VALUE = {{ $value }} LABELS: {{ $labels }} summary: High CPU load (instance {{ $labels.instance }}) ok 13.225s ago 8.176ms

Cpu-usage

8.777s ago

709.3us

Rule State Error Last Evaluation Evaluation Time
alert: HostHighCpuLoad expr: 100 - (avg by(instance) (rate(node_cpu_seconds_total{mode="idle"}[2m])) * 100) > 95 for: 3m labels: severity: warning annotations: description: |- CPU load is > 95% VALUE = {{ $value }} LABELS = {{ $labels }} summary: Host high CPU load (instance {{ $labels.instance }}) ok 8.777s ago 700.7us

Disk-usage

365ms ago

1.864ms

Rule State Error Last Evaluation Evaluation Time
alert: Low data disk space expr: ceil(((node_filesystem_size_bytes{mountpoint!="/boot"} - node_filesystem_free_bytes{mountpoint!="/boot"}) / node_filesystem_size_bytes{mountpoint!="/boot"} * 100)) > 95 labels: severity: critical annotations: description: 'Partition : {{$labels.mountpoint}}' host: '{{$labels.instance}}' summary: Disk usage is `{{humanize $value}}%` title: Disk Usage ok 365ms ago 1.856ms

Disk-usage-check

8.46s ago

1.35ms

Rule State Error Last Evaluation Evaluation Time
alert: Host Out Of DiskSpace expr: (node_filesystem_avail_bytes * 100) / node_filesystem_size_bytes < 10 and on(instance, device, mountpoint) node_filesystem_readonly == 0 for: 5m labels: severity: warning annotations: description: |- Disk is getting full (< 10% left) Available Space is only= {{ $value }} % LABELS = {{ $labels }} summary: Host out of disk space (instance {{ $labels.instance }}) title: Low Disk space ok 8.461s ago 1.341ms

Memory-usage

9.365s ago

333.8us

Rule State Error Last Evaluation Evaluation Time
alert: HostOutOfMemory expr: node_memory_MemAvailable_bytes / node_memory_MemTotal_bytes * 100 < 8 for: 10m labels: severity: warning annotations: description: |- Node memory is filling up (< 8% left) VALUE = {{ $value }} LABELS = {{ $labels }} summary: Host out of memory (instance {{ $labels.instance }}) ok 9.365s ago 324.5us

node-exporter.rules

7.357s ago

4.464ms

Rule State Error Last Evaluation Evaluation Time
record: instance:node_num_cpu:sum expr: count without(cpu) (count without(mode) (node_cpu_seconds_total{job="node"})) ok 7.357s ago 276us
record: instance:node_cpu_utilisation:rate5m expr: 1 - avg without(cpu, mode) (rate(node_cpu_seconds_total{job="node",mode="idle"}[5m])) ok 7.357s ago 149.7us
record: instance:node_load1_per_cpu:ratio expr: (node_load1{job="node"} / instance:node_num_cpu:sum{job="node"}) ok 7.357s ago 260.6us
record: instance:node_memory_utilisation:ratio expr: 1 - (node_memory_MemAvailable_bytes{job="node"} / node_memory_MemTotal_bytes{job="node"}) ok 7.357s ago 124.1us
record: instance:node_vmstat_pgmajfault:rate5m expr: rate(node_vmstat_pgmajfault{job="node"}[5m]) ok 7.357s ago 67.88us
record: instance_device:node_disk_io_time_seconds:rate5m expr: rate(node_disk_io_time_seconds_total{device!="",job="node"}[5m]) ok 7.357s ago 1.836ms
record: instance_device:node_disk_io_time_weighted_seconds:rate5m expr: rate(node_disk_io_time_weighted_seconds_total{device!="",job="node"}[5m]) ok 7.355s ago 1.212ms
record: instance:node_network_receive_bytes_excluding_lo:rate5m expr: sum without(device) (rate(node_network_receive_bytes_total{device!="lo",job="node"}[5m])) ok 7.354s ago 146.4us
record: instance:node_network_transmit_bytes_excluding_lo:rate5m expr: sum without(device) (rate(node_network_transmit_bytes_total{device!="lo",job="node"}[5m])) ok 7.354s ago 116.8us
record: instance:node_network_receive_drop_excluding_lo:rate5m expr: sum without(device) (rate(node_network_receive_drop_total{device!="lo",job="node"}[5m])) ok 7.354s ago 137.5us
record: instance:node_network_transmit_drop_excluding_lo:rate5m expr: sum without(device) (rate(node_network_transmit_drop_total{device!="lo",job="node"}[5m])) ok 7.354s ago 116.6us