Plan for Confluent Platform Deployment Using Confluent for Kubernetes

This topic contains the supported and recommended options to consider when you plan to deploy Confluent Platform using Confluent for Kubernetes (CFK).

Deployment workflow

At the high level, the workflow to configure, deploy, and manage Confluent Platform using CFK is as follows:

Review this topic and review the required and recommended options for the deployment environment.
Prepare your Kubernetes environment.
For details, see Prepare Kubernetes Cluster for Confluent Platform and Confluent for Kubernetes.
Deploy Confluent for Kubernetes.
For details, see Deploy Confluent for Kubernetes.
Configure Confluent Platform.
For details, see Configure Confluent Platform for Deployment with Confluent for Kubernetes.
Deploy Confluent Platform.
For details, see Deploy Confluent Platform using Confluent for Kubernetes.
Manage Confluent Platform.
For details, see Manage Confluent Platform with Confluent for Kubernetes.

Deployment checklist

The following is a deployment checklist to prepare for a Confluent Platform deployment:

Kubernetes platform
Cluster sizing for Confluent Platform
Docker registry
Storage
Kubernetes security
Confluent security
Network
Use Observer container (available starting with CFK 3.2.0)
Upgrades and updates

Supported environments and prerequisites

Review and address the following prerequisites before you start the installation process.

Kubernetes

Confluent for Kubernetes 3.2 supports the Kubernetes distributions with Cloud Native Computing Foundation (CNCF) conformant offerings. See the Supported Versions section for the specific versions supported.
Install kubectl.
Configure the kubeconfig file for your cluster.

If you are using Red Hat OpenShift as your Kubernetes distribution, you need to determine how you work with Red Hat’s Security Context Constraint (SCC). See this documentation set to understand how.

Hardware

The underlying processor architecture of your Kubernetes worker nodes must be a supported version for the Confluent Platform version you plan to deploy.

Currently, Confluent Platform supports x86 and ARM64 hardware architecture. For supported hardware for a specific version of Confluent Platform, see Hardware Requirements for Confluent Platform.

Note

Starting with version 3.2.0, on Linux s390x architecture, the Confluent for Kubernetes image is available only for Managing Flink Applications. CFK does not suppot deploying or managing Confluent Platform on Linux s390x. While Confluent provides CFK Docker images for Linux s390x, Confluent does not support issues specific to this architecture. Confluent provides support only if the issue also occurs on a supported architecture, such as Linux AMD64 or Linux ARM64. To discuss third-party support options for Linux s390x, contact Confluent support.

Operating systems

The Operating System (OS) of your Kubernetes worker nodes should match the supported Linux kernel version for the Confluent Platform version you plan to deploy.

Supported Linux kernels are the kernels that the supported Linux OS distributions are built upon. For supported OSs for a specific version of Confluent Platform, see OS Requirements for Confluent Platform.

Even if an OS is not on the Confluent Platform supported list, you can still deploy CFK on the container-optimized OS for the Kubernetes worker nodes if that OS is based on the same Linux kernel.

For example, you can deploy Confluent Platform 7.9 with CFK 2.11 on BottleRocket because the OS is built on the same Linux kernel 6.1 as AWS Linux 2023 certified for Confluent Platform.

Supported Linux kernel versions in CFK 3.2 are:

OS Distro and Version	Linux Kernel Version	Reference
AWS Linux 2023	6.1	AL2023 kernel changes from AL2 - Amazon Linux 2023
Debian 12	6.1
Debian 10	4.19
Ubuntu 22	5.15	Kernels covered by Livepatch
Ubuntu 20	5.4	Kernels covered by Livepatch
RHEL 9	5.14	Red Hat Enterprise Linux
RHEL 8	4.18	Red Hat Enterprise Linux

Helm

Helm 3 is required for Confluent for Kubernetes.

Confluent Platform

The following table shows the compatibility information with CFK, Confluent Platform, and Kubernetes.

Note that the Kubernetes Versions column has the Kubernetes and OpenShift versions that are supported for the first release of that CFK version, for example, 2.10.0 for CFK 2.10.x. For the supported Kubernetes and OpenShift versions of the subsequent patch releases, see the release notes.

For the CFK image tag of a specific patch release, see Confluent for Kubernetes image tags.

CFK Operator Version	Confluent Platform Versions	Kubernetes Versions	Release Date	Standard End of Support
CFK 3.2.x	7.4.x - 8.2.x	1.27 - 1.35 (OpenShift 4.14 -4.21)	Mar 11, 2026	Mar 11, 2028
CFK 3.1.x	7.3.x - 8.1.x	1.26 - 1.34 (OpenShift 4.13 -4.19)	Oct 15, 2025	Oct 15, 2027
CFK 3.0.x	7.2.x - 8.0.x	1.25 - 1.33 (OpenShift 4.12 -4.18)	Jun 13, 2025	Jun 13, 2027
CFK 2.11.x	7.1.x - 7.9.x	1.25 - 1.32 (OpenShift 4.12 -4.18)	Feb 21, 2025	Feb 21, 2027
CFK 2.10.x	7.0.x - 7.8.x	1.25 - 1.31 (OpenShift 4.11 - 4.17)	Dec 4, 2024	Dec 4, 2026
CFK 2.9.x	7.0.x - 7.7.x	1.25 - 1.30 (OpenShift 4.11 - 4.16)	Jul 30, 2024	Jul 30, 2026

Starting with CFK 2.9, the standard support policy of CFK is for 2 years from the first patch release (.0) date.
Platinum tier support is not offered for CFK.
You can apply your Confluent Platform Platinum tier support contract to the Confluent Platform components deployed by CFK when both of the following conditions are true:
- You are on a currently supported version of CFK.
- The Confluent Platform version you want to use is compatible with a currently supported version of CFK.
CFK supports the same versions of Confluent Control Center as corresponding Confluent Platform versions support.

Kafka inter-broker protocol version

When you perform a greenfield installation of Confluent Platform using CFK, set the Kafka inter-broker protocol (IBP) version to make sure you use the correct version of IBP and avoid issues during deployments and future upgrades.

Set the IBP version in the Kafka custom resource (CR) according to the version of Kafka you are installing. For the correct IBP version, refer to the table in the Confluent Platform Upgrade Guide.

kind: Kafka
spec:
  configOverrides:
    server:
      - inter.broker.protocol.version=<current Kafka broker protocol version>

The example below is for installing Confluent Platform 8.2. inter.broker.protocol.version is set to 4.2 that corresponds to 8.2.

kind: Kafka
spec:
  configOverrides:
    server:
      - inter.broker.protocol.version=4.2

Confluent for Kubernetes image tags

The Confluent Platform and Confluent for Kubernetes images are hosted in the confluentinc repository in Docker Hub.

The following table shows the mapping among Confluent for Kubernetes (CFK) versions corresponding image tags, and the custom resource definition (CRD) versions. Note that these image tag versioning is available in:

2.9.5 and later versions of 2.9.x
2.10.1 and later versions of 2.10.x
2.11.0 and later versions

CFK Operator Version	CFK Image Tag	CRD App Version	CRD Chart Version
3.2.0	0.1514.1	3.2.0	v0.1514.1
3.1.1	0.1351.59	3.1.1	v0.1351.59
3.1.0	0.1351.24	3.1.0	v0.1351.24
3.0.3	0.1263.105	3.0.3	v0.1263.105
3.0.2	0.1263.79	3.0.2	v0.1263.79
3.0.1	0.1263.34	3.0.1	v0.1263.34
3.0.0	0.1263.8	3.0.0	v0.1263.8
2.11.3	0.1193.70	2.11.3	v0.1193.70
2.11.2	0.1193.47	2.11.2	0.1193.47
2.11.1	0.1193.34	2.11.1	v0.1193.34
2.11.0	0.1193.1	2.11.0	v0.1193.1
2.10.3	0.1145.71	2.10.3	v0.1145.71
2.10.2	0.1145.50	2.10.2	v0.1145.50
2.10.1	0.1145.35	2.10.1	v0.1145.35
2.9.7	0.1033.110	2.9.7	v0.1033.110
2.9.6	0.1033.87	2.9.6	v0.1033.87
2.9.5	0.1033.71	2.9.5	v0.1033.71

To get the list of your current CFK CRDs and the versions, install the CFK plugin and run the following plugin option:

kubectl confluent cluster list-crd

The following table shows the mapping among Confluent for Kubernetes (CFK) versions (2.9.4, 2.10.0, and earlier) corresponding image tags, and the CRD versions.

CFK Operator Version	CFK Image Tag	CFK CRD Version
2.10.0	0.1145.6	controller-gen.kubebuilder.io/version: v0.15.0
2.9.4	0.1033.43	controller-gen.kubebuilder.io/version: v0.15.0
2.9.3	0.1033.33	controller-gen.kubebuilder.io/version: v0.15.0
2.9.2	0.1033.22	controller-gen.kubebuilder.io/version: v0.15.0
2.9.1	0.1033.10	controller-gen.kubebuilder.io/version: v0.15.0
2.9.0	0.1033.3	controller-gen.kubebuilder.io/version: v0.15.0
2.8.5	0.921.77	controller-gen.kubebuilder.io/version: v0.15.0
2.8.4	0.921.63	controller-gen.kubebuilder.io/version: v0.15.0
2.8.3	0.921.40	controller-gen.kubebuilder.io/version: v0.15.0
2.8.2	0.921.20	controller-gen.kubebuilder.io/version: v0.9.2
2.8.1	N/A	controller-gen.kubebuilder.io/version: v0.9.2
2.8.0	0.921.2	controller-gen.kubebuilder.io/version: v0.9.2
2.7.5	0.824.84	controller-gen.kubebuilder.io/version: v0.14.0
2.7.4	0.824.61	controller-gen.kubebuilder.io/version: v0.9.2
2.7.3	0.824.40	controller-gen.kubebuilder.io/version: v0.9.2
2.7.2	0.824.33	controller-gen.kubebuilder.io/version: v0.9.2
2.7.1	0.824.17	controller-gen.kubebuilder.io/version: v0.9.2
2.7.0	0.824.2	controller-gen.kubebuilder.io/version: v0.9.2

Cluster sizing for Confluent Platform

Review the sizing guidelines and recommendations in this section before creating your Confluent Platform cluster.

The following table provides the guidance on the minimum cluster sizing:

Important

Starting with Confluent Platform version 8.0, ZooKeeper is no longer part of Confluent Platform.

Important

Starting with Confluent Platform version 8.0, Confluent Control Center (Legacy) is no longer supported with Confluent Platform. Use Control Center with Confluent Platform 8.0 and later.

Cluster Type	Production				Minimum
Resource	Pods	CPU	Memory	Disk	Pods	CPU	Memory	Disk
ZooKeeper	5	4	4 GB	100 GB	3	2	4 GB	100 GB
KRaft Controllers	5	5	4 GB	64 GB; use SSD	3	3	4 GB	64 GB; use SSD
Kafka Brokers	3	24	64 GB	12 TB	3	4	16 GB	1 TB
Connect Workers	2	12	24 GB	50 GB ^[1]	2	4	16 GB	50 GB ^[1]
Schema Registry	2	2	4 GB	N/A ^[2]	2	2	4 GB	N/A ^[2]
Control Center ^[3]	1	4	8 GB	200 GB; preferably SSD	1	4	8 GB	200 GB; preferably SSD
Confluent Control Center (Legacy)	1	12	32 GB	300 GB; preferably SSD	1	4	16 GB	250 GB; preferably SSD
ksqlDB	2	4	32 GB	100 GB; use SSD ^[4]	2	4	20 GB	100 GB; use SSD ^[4]
Confluent REST Proxy	2	16	1 GB+ ^[5]	N/A ^[2]	2	16	1 GB+ ^[5]	N/A ^[2]

^[1] The disk storage requirement for Connect workers depends on how many Connect plugins you are downloading per Connect clusters.
^[2] Only required for installation.
^[3] The resource requirement for Control Center should be set at the pod level. It is not required to set container level resources, specifically for Prometheus and Alertmanager.
^[4] The disk storage requirement for ksqlDB depends on the number of concurrent queries and the aggregation performed.
^[5] 1 GB overhead plus 64 MB per producer and 16 MB per consumer.

Additional considerations for Confluent Platform

Avoid placing multiple replicas of the same component, such as KRaft, Kafka, ZooKeeper, on a single Kubernetes node.
At least three Kafka brokers are required for a fully functioning Confluent Platform deployment. A one- or two-broker configuration is not supported and should not be used for development, testing, or production.
For more comprehensive, workload-based system requirements for Control Center, see Control Center System Requirements.

Considerations for Kubernetes worker nodes

The number of Kubernetes worker nodes required in your cluster depends on whether you are deploying a development testing cluster or a production-ready cluster.

Production Cluster

Review the default capacity values in the Confluent Platform component custom resources (CRs). Determine how these values affect your production application and build out your nodes accordingly.

You can also use the on-premises System Requirements to determine what is required for your cloud production environment. Note that the on-premises storage information provided is not applicable for cloud environments.

Development Testing Cluster

Each node should typically have a minimum of 2 or 4 CPUs and 7 to 16 GB RAM. If you are testing a deployment of CFK and all Confluent Platform components, you can create a 10-node cluster with six nodes for Apache ZooKeeper™ and Apache Kafka® pods (three replicas each) and four nodes for all other components pods.

Confluent Platform component cluster sizing and resource requirements

In CFK, you specify resource requirements using the limits and requests properties for custom resources (CR). The Confluent Platform pods are configured with the following default CPU and memory resources:

resources:
  limits:
    cpu: 500m
    memory: 512Mi
  requests:
    cpu: 100m
    memory: 256Mi

The above default resource values are sufficient for most use cases. If you plan to manage a high number of day-2 application custom resources (CRs), such as 100 or more, refer to manage resources about increasing the number of application CRs and the corresponding memory and CPU.

Docker registry

Confluent for Kubernetes pulls Confluent Docker images from a Docker registry and deploys those on to your Kubernetes cluster.

By default, Confluent for Kubernetes deploys publicly-available Docker images hosted on Docker Hub from the confluentinc repositories.

If you choose to use your own Docker registry and repositories, you need to pull the images from the Confluent repositories and upload to your Docker registry repositories.

See Use Custom Docker Registry for Confluent Platform Using Confluent for Kubernetes for details on using custom private registry.

Storage

You need to provide dynamic persistent storage for all Confluent Platform components with block-level storage solutions, such as AWS EBS, Azure Disk, GCE Disk, Ceph RBD, and Portworx.

See Configure Storage for Confluent Platform Using Confluent for Kubernetes for details on storage configuration options.

Kubernetes security

With Kubernetes Role-based access control (RBAC) and namespaces, you can deploy Confluent Platform in one of two ways:

(Recommended) Provide Confluent for Kubernetes with access to provision and manage Confluent Platform resources in one specific namespace.
Provide Confluent for Kubernetes with access to provision and manage Confluent Platform resources across all namespaces in the Kubernetes cluster.

Both options above require Kubernetes role bindings configuration. See Configure Kubernetes RBAC and Custom Resource Definitions for details.

Confluent security

Confluent supports the following processes to enforce security.

Authentication
Authorization
Network encryption
Configuration secrets

For production deployments, Confluent recommends the following security mechanisms:

Enable one of the following methods for Kafka client authentication:
- mTLS
- SASL/PLAIN
- SASL/PLAIN with LDAP
  For SASL/PLAIN, the identity can come from your LDAP server.
Enable Confluent Role Based Access Control (RBAC) for authorization, with user/group identity coming from the LDAP server.
Enable TLS for network encryption for both internal traffic between Confluent Platform components and external traffic from clients to Confluent Platform components.

See Production recommended secure setup for a tutorial scenario to configure these security settings.

Network

Confluent Platform components can be accessed by users and client applications from:

Inside the Kubernetes network
Outside of the Kubernetes network

The following are the options to externally expose Confluent Platform:

Load balancers
- For Kafka, a Layer 4 load balancer that supports TLS passthrough is required.
- For other Confluent components with HTTP endpoints, a Layer 4/7 load balancer is required.
Kubernetes node ports
Static external access with host-based or port-based routing
OpenShift routes

Default ports in Confluent for Kubernetes

CFK uses the following default ports for Confluent Platform components. You can override the default ports in the component custom resources.

7203: JMX port
7777: Jolokia port
7778: Prometheus port
8081: Schema Registry default port
9081: Schema Registry internal listener port
8082: Confluent REST Proxy port
8083: Connect port
8088: ksqlDB default port
9088: ksqlDB internal listener port
8090: MDS default port
9090: MDS internal listener port
9021: Control Center port
9071: Kafka Internal port
9072: Replication port
9073: Token port
9092: Kafka External port

Internet Protocol versions

You can use CFK to deploy Confluent Platform on Kubernetes clusters on the following Internet Protocol (IP) versions:

IPv4
IPv6
Dual-stack with both IPv4 and IPv6.

The following are the requirements and considerations for IP versions.

AWS supports IPv6 only clusters and does not support dual stack clusters.
Google Cloud supports dual stack clusters and does not support IPv6 only clusters.
Your load balancers and identity providers must be configured to support the network protocol which your cluster is configured with.
IPv6 or dual stack clusters can be enabled only on clusters using Java 11 or above.
IPv6 or dual stack clusters can be enabled only on clusters using KRaft.

Confluent Platform cluster log retention

When deploying Confluent Platform through Confluent for Kubernetes (CFK), it’s essential to plan a log retention management early in the planning phase, and should have a working log retention in the production environment.

When a Kubernetes pod is restarted, whether due to a crash or manual restart, the logs that were generated during the pod’s previous lifecycle are not automatically retained by the pod. This is because pods are designed to be ephemeral, and data stored within them can be lost when they are restarted or deleted. However, the historic Confluent Platform pod logs are crucial for troubleshooting purposes.

There are several common approaches available for Kubernetes pod log retention, such as node-level logging, sidecar containers, or centralized Logging Solutions.

Centralized Logging is one of the solutions that forwards the Confluent Platform pod logs to a centralized logging system, such as, Elasticsearch, Logstash, and Kibana (ELK), or to a log management tool, such as Grafana, Loki. It is important to note that the integration with these third-party log management tools is not provided by Confluent, but its recommended that you implement a Confluent Platform pod log retention solution in the production environment.

Upgrades and updates

CFK provides a declarative API and configuration automation for running Confluent Platform on Kubernetes. You can update configurations and upgrade versions for an existing deployment by applying the updated declarative specs in custom resource files.

However, the following are configuration scenarios that cannot be enabled or changed for an existing deployment:

Confluent RBAC
You cannot disable Confluent RBAC on an existing cluster set up with RBAC.
As a workaround, you can grant ClusterAdmin role to a root group containing all users.
TLS certificates mechanism
You cannot change the mechanism for how TLS certificates are provided between auto-generated certificates and user-provided certificates.
TLS encryption
You cannot enable TLS encryption on a TLS disabled cluster.
You cannot enable TLS encryption on a TLS disabled Kafka REST API server (kafka.spec.services.kafkaRest.tls).
Kafka listener authentication
You cannot enable the authentication mechanism on an existing Kafka and KRaft controller listeners.
Also, you cannot change the authentication mechanism used for an existing Kafka and KRaft controller listeners.
Kafka metrics TLS and authentication configurations
External network access mechanism for Kafka brokers
- You cannot change the external network access mechanism for Kafka brokers among load balancer, node ports, static ingress controller based routing.
- Changing the configuration of the external access mechanism, such as updating the domain or port, may require a manual restart of the Kafka cluster for the configuration to take effect. Refer to the specific external access configuration guide for more details.
Storage class for persistent storage
You cannot change the storage class used to create Persistent Volume Claims for Confluent components.
Configuration secrets mechanism
You cannot change the configuration secrets mechanism between using Kubernetes secrets and using the directory in path containers feature.

Support for Kubernetes ecosystem

The Confluent for Kubernetes (CFK) product encapsulates a set of Kubernetes Controllers and business logic that automate configuring, deploying, and managing multiple aspects of the Confluent Platform on a Kubernetes of your choice.

CFK uses the standard Kubernetes API. It calls this Kubernetes API, and it expects that the Kubernetes API does what it needs to do. CFK does not manage aspects beyond the Kubernetes API - such as vendor implementations like Amazon LoadBalancers, or StorageClasses like Amazon EBS.

The following examples illustrate the support boundaries of CFK.

For storage, the Kubernetes API implements and provides the APIs for StorageClass, PersistentVolumes, PersistentVolumeClaims. When CFK configures and deploys Kafka brokers, it takes a user-provided StorageClass and uses that to create a PersistentVolumeClaim for the Kafka broker storage. CFK does not check and validate what’s in the user-provided StorageClass. CFK does not check whether the Persistent Volume is created - CFK relies on the storage vendor implementation for that.
For networking with load balancers, the Kubernetes API implements and provides the APIs for Network Service and LoadBalancer. When CFK configures and deploys Kafka with a load balancer, CFK creates a LoadBalancer type service, one for every Kafka broker. Kafka then relies on LoadBalancer vendor implementation for the actual LoadBalancer instance to be configured and deployed. Amazon ELB, Google LB, Azure LB, MetalLB are all examples of such LoadBalancer vendor implementations.

Within the described boundary, CFK is tested to ensure that it invokes the Kubernetes API in the right way and creates the correct Kubernetes objects. CFK tests do validate that a LoadBalancer type service is created, with the configurations that the user specified in the Kafka broker custom resource. CFK does not test that a Google load balancer is properly configured and deployed to route traffic to Kafka brokers. Confluent depends on the Google implementation of the load balancer to do the right thing.

With the above points in mind, when you are deploying CFK, consider the following guidelines in order to efficiently and effectively run a production system:

Identify the architecture you want to use - the Confluent components and the Kubernetes runtime - and validate core deployment and management functions in your environment.
Develop a runbook and troubleshooting steps for your deployment. Ensure your team is familiar with this runbook before you go to production.
When you need to get support for issues in your deployment, be prepared to pull in the respective vendors to cover the entire architecture. If there is a networking issue in your CFK deployment, be prepared to pull in the Kubernetes vendor and the vendor for the networking service you are using along with Confluent.

Custom deployment pipelines and support scope

Confluent documents and validates standard workflows for installing and upgrading Confluent for Kubernetes (CFK) and Confluent Platform. For most deployments, Confluent recommends following these workflows:

Many customers integrate these documented workflows into their own continuous integration/continuous delivery (CI/CD) systems. Confluent supports using CI/CD to orchestrate documented workflows, provided the workflows are not modified in unsupported ways.

A custom deployment pipeline is any installation or upgrade flow that significantly changes or replaces the documented workflows, examples include:

Generating or modifying CFK or Confluent Platform manifests in non-standard ways,
Inserting additional proprietary automation steps that alter the order, scope, or behavior of the documented workflow, or
Chaining multiple tools in a way that changes how CFK or Confluent Platform are deployed or upgraded.

Confluent Support is responsible for CFK and Confluent Platform behavior when you deploy using the documented workflows, whether they are run manually or orchestrated by your CI/CD system. Confluent Support cannot own or troubleshoot the design, implementation, or operation of your custom CI/CD tooling and end-to-end pipelines, and issues isolated to those pipelines are outside the scope of standard Confluent Support.