Auto Data Balancing in Confluent Platform¶

The confluent-rebalancer tool balances data so that the number of leaders and disk usage are even across brokers and racks on a per topic and cluster level while minimizing data movement. It also integrates closely with the replication quotas feature in Apache Kafka® to dynamically throttle data-balancing traffic.

The tool is part of Confluent Platform and can also be installed on its own using the confluent-rebalancer package.

Important

The Manage Self-Balancing Kafka Clusters in Confluent Platform feature is the preferred alternative to Auto Data Balancer, starting in Confluent Platform 6.0.0. For a detailed feature comparison, see Self-Balancing vs. Auto Data Balancer.
Auto Data Balancer and Self-Balancing cannot be used together. If you want to run Auto Data Balancer, you must first make sure that Self-Balancing is off.
If Configure Multi-Region Clusters in Confluent Platform is enabled and a topic is created with a replica placement policy, Auto Data Balancer will redistribute the preferred leaders among all racks that have replicas.