Scale Confluent Platform Clusters and Balance Data¶
Scale Kafka cluster¶
At a high level, adding brokers to a cluster involves a few key steps:
- Define the configuration for each of the new brokers.
- Provision storage, networking, and compute resources to the brokers.
- Start the brokers with the defined configurations and provisioned resources.
- Reassign partitions across the cluster so that the new brokers share the load and the cluster’s overall performance improves.
To automate the above process, Confluent for Kubernetes (CFK) leverages Self-Balancing, which is enabled by default with CFK.
If you need to manually enable Self-Balancing, see Disable or re-configure Self-Balancing for the steps.
Scale up Kafka cluster¶
To scale up a Kafka cluster:
Increase the number of Kafka replicas using one of the following options:
Use the
kubectl scale
command:kubectl scale kafka <Kafka-CR-name> --replicas=N
Increase the number of Kafka replicas in the Kafka custom resource (CR) and apply the new setting with the
kubectl apply
command:spec: replicas:
Ensure that proper DNS records are configured for the new brokers, and ensure that the CFK can resolve the new broker hostname, using a command such as
nslookup
.If you are using
hosts
file instead of a DNS service, updatehosts
file with the new brokers information. For example:Get the new broker IP addresses:
kubectl get services -n <namespace>
Refer to the existing broker host names with the broker prefix, and derive the hostnames of the new brokers.
Add the new broker hosts to the
/etc/hosts
file, and inject the updated file to the CFK pod as described in Adding entries to Pod /etc/hosts.
Scale down Kafka cluster¶
You have two options to scale down Kafka clusters:
Automatic scale down
This option is supported for Confluent Platform versions 7.x and later.
Manual scale down
This option is supported for Confluent Platform versions 6.x and later.
With either option above, do not decrease the number of Kafka brokers to less than the largest replication factor of any topic in your Kafka cluster. CFK sets a default replication factor of 3 for all Kafka topics.
Automatically scale down Kafka cluster¶
With Confluent Platform 7.x and later, you can have CFK scale down Kafka clusters.
CFK leverages the Self-Balancing feature to automate the shrinking process. Self-Balancing is enabled by default with CFK.
To have CFK automatically scale down your cluster, the following requirements must be satisfied:
Set up the Admin REST Class as described in Manage Confluent Admin REST Class. CFK uses the KafkaRestClass resource in the namespace where the Kafka cluster is running.
If the Admin REST Class is set up with the basic authentication for the REST client, the first user listed in
basic.txt
will be used to shrink the cluster. See Basic authentication for details onbasic.txt
.This first user must have a role that is listed under
spec.services.kafkaRest.authentication.basic.roles
in the Kafka custom resource (CR).If the Kafka brokers use the
DirectoryPathInContainer
property to specify the credentials to authenticate to Confluent Admin REST Class, you need to set up Vault and add the required Vault annotations to the CFK Helm values before you deploy CFK.If updating an existing CFK pod, you need to roll the CFK pod after updating the CFK Helm values. See Provide secrets for Confluent Platform operations without CRs for details.
To automatically scale down a Kafka cluster:
Make sure the Kafka cluster is stable.
Enable the feature through annotation:
kubectl annotate <Kafka CR name> platform.confluent.io/enable-shrink="true"
Decrease the number of brokers in the Kafka CR and apply the change using the
kubectl apply
command:spec: replicas:
replicas:
should not be set to less than3
.CFK triggers the workflow to shrink the Kafka cluster according to the value of
replicas
updated in the Kafka custom resource (CR).
Manually scale down Kafka cluster¶
With Confluent Platform 6.x, you must scale down Kafka clusters one broker at a time. Scaling down Kafka clusters multiple brokers at a time is not supported.
When CFK scales down a Kafka cluster by one broker, it deletes the broker pod and the backing persistentVolume. This deletes any partitions that were stored on that broker.
To manually scale down a Kafka cluster:
Trigger a broker removal using the kafka-remove-brokers command available as a part of Self-Balancing.
<id>
specifies the identifier for the broker you are removing:kafka-remove-brokers --bootstrap-server <bootstrap-broker-endpoint>:<port> \ --command-config <client.properties> \ --broker-id <id> \ --delete
The broker id (
<id>
) should be the one that will likely be removed. For example, with 5 brokers (0
-4
), you want to remove broker id4
(with the--broker-id 4
flag) as the replica scale down will remove broker-4 in the next step.You can then monitor the removal using the below command:
kafka-remove-brokers --bootstrap-server <bootstrap-broker-endpoint>:<port> \ --command-config <client.properties> \ --broker-id <id> \ --describe
Wait an appropriate time for partitions that were on that broker to be replicated to other existing brokers.
You can check if the partitions were replicated by looking at the Kafka metric,
Total Under Replicated Partitions Across Kafka Brokers
, in Control Center UI or in your monitoring solution. The value should be0
.Depending on the cluster size and the number of topics, this time could vary.
Decrease the number of Kafka brokers by one using one of the following options. The number of Kafka brokers should not be set to fewer than
3
.The examples scale down Kafka from
5
to4
.Use the
kubectl scale
command:kubectl scale kafka <Kafka-CR-name> --replicas=4
Decrease the number of Kafka replicas in the Kafka CR and apply the new setting with the
kubectl apply
command:spec: replicas: 4
Repeat the above steps, decreasing the number of Kafka brokers by one, waiting until the desired broker count is achieved.
Disable or re-configure Self-Balancing¶
The Self-Balancing feature is enabled by default in Confluent for Kubernetes.
To balance the load across the cluster whenever an imbalance is detected, set
confluent.balancer.heal.uneven.load.trigger
to ANY_UNEVEN_LOAD
. The default is EMPTY_BROKER
.
kafka:
configOverrides:
server:
- confluent.balancer.heal.uneven.load.trigger=ANY_UNEVEN_LOAD
For a complete list of available settings you can use to control Self-Balancing, see Configuration Options and Commands for Self-Balancing Clusters.
Scale other Confluent Platform clusters¶
Use the below command to scale up or down other Confluent Platform components:
kubectl scale <CP-component-CR-kind> <component-CR-name> --replicas=N