Configuration Reference for Splunk Sink Connector for Confluent Platform

To use this connector, specify the name of the connector class in the connector.class configuration property.

connector.class=com.splunk.kafka.connect.SplunkSinkConnector

Connector-specific configuration properties are described below.

Note

These are properties for the self-managed connector. If you are using Confluent Cloud, see Splunk Sink Connector for Confluent Cloud.

splunk.hec.token

Splunk Http Event Collector (HEC) token.

  • Type: password
  • Importance: high
splunk.hec.uri

Splunk HEC URIs. Either a list of FQDNs or IPs of all Splunk indexers, separated with a ,, or a load balancer. The connector load balances to indexers using round robin. Splunk Connector round robins to this list of indexers: https://hec1.splunk.com:8088,https://hec2.splunk.com:8088,https://hec3.splunk.com:8088

  • Type: string
  • Importance: high
splunk.hec.ssl.trust.store.password

Password for the trust store.

  • Type: password
  • Default: [hidden]
  • Importance: high
splunk.hec.ssl.trust.store.path

Path on the local disk to the certificate trust store.

  • Type: string
  • Default: “”
  • Importance: high
splunk.hec.total.channels

Total HEC Channels used to post events to Splunk. When enabling HEC ACK, setting to the same or 2X number of indexers is generally good.

  • Type: int
  • Default: 2
  • Importance: high
splunk.header.custom

This setting enables looking for Record headers with these values and adding them to each event if present. Multiple headers are separated by comma. For example: custom_header_1,custom_header_2,custom_header_3.

  • Type: string
  • Default: “”
  • Importance: medium
splunk.header.host

Header to use for Splunk Header Host.

  • Type: string
  • Default: splunk.header.host
  • Importance: medium
splunk.header.index

Header to use for Splunk Header Index.

  • Type: string
  • Default: splunk.header.index
  • Importance: medium
splunk.header.source

Header to use for Splunk Header Source.

  • Type: string
  • Default: splunk.header.source
  • Importance: medium
splunk.header.sourcetype

Header to use for Splunk Header Sourcetype.

  • Type: string
  • Default: splunk.header.sourcetype
  • Importance: medium
splunk.header.support

This setting enables Kafka Record headers to be used for meta data override.

  • Type: boolean
  • Default: false
  • Importance: medium
splunk.hec.ack.enabled

When set to true, the connector polls event ACKs for POST events before check-pointing the Kafka offsets. This setting enables guaranteed delivery and prevents data loss but may result in lower overall throughput.

  • Type: boolean
  • Default: false
  • Importance: medium
splunk.hec.ack.poll.interval

Controls the event ACKs polling interval. This setting is only applicable when splunk.hec.ack.enabled is set to true. By default, this setting is 10 seconds.

  • Type: int
  • Default: 10
  • Importance: medium
splunk.hec.ack.poll.threads

Controls how many threads should be spawned to poll event ACKs. This setting is used for performance tuning and is only applicable when splunk.hec.ack.enabled is set to true. By default, this is set to 2.

  • Type: int
  • Default: 2
  • Importance: medium
splunk.hec.backoff.threshhold.seconds

The amount of time the connector waits before attempting to resend failed events to Splunk.

  • Type: int
  • Default: 60
  • Importance: medium
splunk.hec.event.timeout

This setting determines how long the connector will wait for an event to be acknowledged before timing out and attempting to resend the event. This setting is applicable when splunk.hec.ack.enabled is set to true. By default, this is set to 300 seconds.

  • Type: int
  • Default: 300
  • Importance: medium
splunk.hec.http.keepalive

This setting enables or disables HTTP connection keep-alive. By default, this is set to true.

  • Type: boolean
  • Default: true
  • Importance: medium
splunk.hec.max.batch.size

The maximum batch size when posting events to Splunk. The size is the actual number of Kafka records, not the byte size. By default, this is set to 500.

  • Type: int
  • Default: 500
  • Importance: medium
splunk.hec.max.http.connection.per.channel

The maximum number of HTTP connections pooled for one HEC Channel when posting events to Splunk.

  • Type: int
  • Default: 2
  • Importance: medium
splunk.hec.max.outstanding.events

The maximum amount of unacknowledged events kept in memory by the connector. When the threshold is exceeded, a back pressure event is triggered to slow the collection of events. By default, this threshold is set to 1000000 events.

  • Type: int
  • Default: 1000000
  • Importance: medium
splunk.hec.max.retries

The maximum number of retries for a failed batch before the task is killed. When set to -1 (the default) the connector retries indefinitely.

  • Type: int
  • Default: -1
  • Importance: medium
splunk.hec.raw

Enable this setting to ingest data using the /raw HEC endpoint instead of the /event HEC endpoint. By default, this setting is false and the /event HEC endpoint is used.

  • Type: boolean
  • Default: false
  • Importance: medium
splunk.hec.raw.line.breaker

This setting is used to specify a custom line breaker to help Splunk separate events correctly. For example, you can specify ##### as a special line breaker and Splunk will split events on those characters. This is only applicable when splunk.hec.raw is set to true.

  • Type: string
  • Default: “”
  • Importance: medium
splunk.hec.ssl.validate.certs

Enables or disables HTTPS certification validation. By default, this is set to true.

  • Type: boolean
  • Default: true
  • Importance: medium
splunk.hec.use.record.timestamp

When set to true, the timestamp is retrieved from the Kafka record and passed to Splunk as a HEC meta-data override. This indexes events in Splunk with the record timestamp. By default, this is set to true.

  • Type: boolean
  • Default: true
  • Importance: medium
splunk.indexes

Splunk index names for Kafka topic data separated by a comma for multiple topics to indexers. Example: “prod-index1,prod-index2,prod-index3”

  • Type: string
  • Default: “”
  • Importance: medium
splunk.sources

Splunk event source metadata for Kafka topic data. The same configuration rules as indexes apply. If unconfigured, the default source binds to the HEC token.

  • Type: string
  • Default: “”
  • Importance: medium
splunk.sourcetypes

Splunk event source type metadata for Kafka topic data. The same configuration rules as indexes apply here. If unconfigured, the default source binds to the HEC token. Only configure this when using the JSON Event endpoint (splunk.hec.raw=false).

  • Type: string
  • Default: “”
  • Importance: medium
splunk.hec.json.event.enrichment

This setting is used to enrich raw data with extra metadata fields. It contains a list of key value pairs separated by ,. The configured enrichment metadata will be indexed along with raw event data by Splunk. This is only applicable to the /event HEC endpoint (splunk.hec.raw=false). Data enrichment for the /event HEC endpoint is only available in Splunk Enterprise 6.5 and above. By default, this setting is empty.

  • Type: string
  • Default: “”
  • Importance: low
splunk.hec.json.event.formatted

This setting ensures events are preformatted into the proper HEC JSON format and have metadata and event data so that they are indexed correctly by Splunk. Set this property to true for events that are already in HEC format.

  • Type: boolean
  • Default: false
  • Importance: low
splunk.hec.socket.timeout

The maximum duration in seconds to read/write data to network before an internal TCP Socket timeout occurs. By default, this is set to 60 seconds.

  • Type: int
  • Default: 60
  • Importance: low
splunk.hec.threads

Controls how many threads are spawned to perform data injection through HEC in a single connector task.

  • Type: int
  • Default: 1
  • Importance: low
splunk.hec.track.data

When set to true, data loss and data injection latency metadata will be indexed along with raw data. This setting only works in conjunction with /event HEC endpoint (splunk.hec.raw=false).

  • Type: boolean
  • Default: false
  • Importance: low