Skip to content

strcrzy/flume-kinesis

Repository files navigation

flume-kinesis

Amazon Kinesis Source and Sink for Apache Flume

This was originally forked from: https://github.com/pdeyhim/flume-kinesis. This version is updated for newer version of the AWS API.

Building and installation

mvn compile assembly:single
cp target/*.jar FLUME_HOME_DIR/lib

Configuration

Check the examples under conf/ for specific examples. All values without defaults are required.

Kinesis Source Options

Name Default Description
endpoint https://kinesis.us-east-1.amazonaws.com endpoint to access kinesis
accessKeyId null AWS Access Key ID
secretAccessKey null AWS Secret Access Key
streamName null name of Kinesis stream
applicationName null name of Kinesis application
initialPosition TRIM_HORIZON strategy to set the initial iterator position

Kinesis Sink Options

Name Default Description
endpoint https://kinesis.us-east-1.amazonaws.com endpoint to access kinesis
accessKeyId null AWS Access Key ID
secretAccessKey null AWS Secret Access Key
streamName null name of Kinesis stream
numberOfPartitions 1 number of Kinesis partitions. Set this much higher than actual number of shards to get better uniforimity when sinking across shards.
batchSize 100 max number of events to send per API call to Kinesis. Must be between 1 and 500.
maxAttempts 100 max number of times to attempt to send events. After this the batch will be considered failed. Must be >= 1.
rollbackAfterMaxAttempts false whether to roll back the flume transaction if events cannot be sent after max attempts

About

Flume source and sink for Amazon Kinesis

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages