Auto Data Balancing in Confluent Platform¶
The confluent-rebalancer
tool balances data so that the number of leaders and
disk usage are even across brokers and racks on a per topic and cluster level while
minimizing data movement. It also integrates closely with the replication quotas
feature in Apache Kafka® to dynamically throttle data-balancing traffic.
The tool is part of Confluent Platform and can also be installed on its own using the confluent-rebalancer
package.
Important
- The Manage Self-Balancing Kafka Clusters in Confluent Platform feature is the preferred alternative to Auto Data Balancer, starting in Confluent Platform 6.0.0. For a detailed feature comparison, see Self-Balancing vs. Auto Data Balancer.
- Auto Data Balancer and Self-Balancing cannot be used together. If you want to run Auto Data Balancer, you must first make sure that Self-Balancing is off.
- If Configure Multi-Region Clusters in Confluent Platform is enabled and a topic is created with a replica placement policy, Auto Data Balancer will redistribute the preferred leaders among all racks that have replicas.