Kafka Connect MapR DB Connector

The MapR DB Connector provides a way to export data from a Kafka topic and write data to a MapR DB cluster.

Install MapR DB Connector

Prerequisite

This connector requires that the MapR Client is installed and working properly on the host running the connect worker process. The Kafka Connect worker process must be started with this command:

-Dmapr.home.dir=/opt/mapr -Djava.library.path=/opt/mapr/lib/

You can do this by exporting the KAFKA_OPTS environment variable before starting Kafka Connect. For example:

export KAFKA_OPTS="-Dmapr.home.dir=/opt/mapr -Djava.library.path=/opt/mapr/lib/"

You can install this connector by using the Confluent Hub client (recommended) or you can manually download the ZIP file.

Confluent Hub

Navigate to your Confluent Platform installation directory and run this command to install the latest (latest) connector version. The connector must be installed on every machine where Connect will be run.

confluent-hub install confluentinc/kafka-connect-maprdb:latest

You can install a specific version by replacing latest with a version number. For example:

confluent-hub install confluentinc/kafka-connect-maprdb:1.0.0-preview

Download

Download the ZIP file and extract it into a directory that is listed on the plugin path of the Connect worker configuration properties (e.g. plugin.path=/usr/local/share/kafka/plugins). This must be done on each of the installations where Connect will be run. For more information, see Installing Plugins.

Usage Notes

The table on the MapR DB cluster is selected based on the topic name. If you need to change this take a look at the RegexRouter transformation which can be used to change the topic name before it’s sent to MapR DB.

Examples

This configuration is used typically along with standalone workers.

name=MapRDBSinkConnector1
connector.class=io.confluent.connect.mapr.db.MapRDBSinkConnector
tasks.max=1
topics=< Required Configuration >

This configuration is used typically along with distributed workers. Write the following json to connector.json, configure all of the required values, and use the command below to post the configuration to one the distributed connect worker(s). Check here for more information about the Kafka Connect Rest API

Connect Distributed REST example
{
  "config" : {
    "name" : "MapRDBSinkConnector1",
    "connector.class" : "io.confluent.connect.mapr.db.MapRDBSinkConnector",
    "tasks.max" : "1",
    "topics" : "< Required Configuration >"
  }
}

Use curl to post the configuration to one of the Kafka Connect Workers. Change http://localhost:8083/ the the endpoint of one of your Kafka Connect worker(s).

Create a new connector
curl -s -X POST -H 'Content-Type: application/json' --data @connector.json http://localhost:8083/connectors
Update an existing connector
curl -s -X PUT -H 'Content-Type: application/json' --data @connector.json http://localhost:8083/connectors/MapRDBSinkConnector1/config

Contents: