Managing Alert Policies
Also available as:
PDF

Component Types and Metrics for Alert Policies

You create an alert policy for a component type. The component type drives the list of metrics to select for creating a threshold.

The following table lists the component types and metrics for an alert policy:
Table 1. Component Types and Metrics
Component Type Metric Description
Topic UNDER REPLICATED PARTITIONS COUNT Total number of partitions that are under replicated for a topic.
BYTES IN PER SEC Bytes per second coming in to a topic.
BYTES OUT PER SEC Bytes per second going out from a topic. It does not count the internal replication traffic.
OUT OF SYNC REPLICA COUNT Total number of replicas that are not in sync with the leader for a topic.
Producer IS PRODUCER ACTIVE Checks whether a producer is active.
Cluster ACTIVE CONTROLLER COUNT Number of brokers in the cluster reporting as the active controller in the last interval.
ONLINE BROKER COUNT Number of brokers that are currently online.
UNCLEAN LEADER ELECTION COUNT Number of unclean partition leader elections in the cluster reported in the last interval.
UNDER REPLICATED PARTITIONS COUNT Total number of topic partitions in the cluster that are under replicated.
LEADER ELECTION PER SEC Rate of partition leader elections.
OFFLINE PARTITIONS COUNT Total number of topic partitions, in the cluster, that are offline.
Consumer CONSUMER GROUP LAG How far consumer groups are behind the producers.
IS CONSUMER ACTIVE Checks whether a consumer is active.
Broker BYTES IN PER SEC Number of bytes per second produced to a broker.
ZOOKEEPER SESSION EXPIRATION PER SEC Rate at which brokers are experiencing Zookeeper session expirations per second.
TOTAL PRODUCE REQUESTS PER SEC Total number of produce requests to a broker per second.
PARTITION IMBALANCE PERCENTAGE The partition imbalance for a broker. It is calculated as: (abs(average_no_of_partitions_per_broker - actual_no_of_partitions_per_broker) / average_no_of_partitions_per_broker) * 100
BYTES OUT PER SEC Number of bytes per second fetched from a broker. It does not count the internal replication traffic.
IS BROKER DOWN Checks whether a broker is down.
TOTAL PRODUCE REQUEST LATENCY Latency of produce requests to this broker at the 99th percentile (in ms).
ISR SHRINKS PER SEC Rate at which brokers are experiencing InSync Replica Shrinks (number of shrinks per second).
TOTAL FETCH CONSUMER REQUEST LATENCY Latency of fetch consumer requests to this broker at 99th percentile (in ms).