StreamBase Library

streambase library is an enabler for event driven processing and provides a layer of abstraction for messaging brokers such as kafka-streams while providing portability. It allows to bootstrap a stream processing application using the Kafka Streams API. It provides device messaging capability to the services using MQTT protocol, including the ability to switch between Paho MQTT client and HiveMQ client. It also provides the capability to use TLS for secure communication with the MQTT broker. It has storage capability for events using MongoDB and retry capabilities using Redis. It has the capability to monitor the health of the topics and partitions in the kafka cluster, and is integrated with Prometheus for publishing metrics.

It provides a LauncherProvider interface to the services to which in turn provides the service with the capabilities to create a stream processing application. The Launcher class in coordination with the StreamProcessorDiscoveryService creates a topology as a chain of StreamProcessor in the kafka streams application.

It provides the below capabilities to the streaming services:

Implements a discovery service StreamProcessorDiscoveryService to create the chain of processors (preprocessors and postprocessors along with service processors ) in the stream.
Implements a KafkaTopicsHealthMonitor to monitor the health of the topics.
Implements a BackdoorKafkaConsumer to create customized consumers along with callback capabilities.
Utilizes both in-memory and RocksDB state stores depending upon the configuration. Also, provides the metrics reporting capabilities for the RocksDB state store.
Manages the state of the topics, partitions and offsets in the database.
Provides TLS capabilities by integrating with vault.
Integrates with both Paho MQTT client and Hive MQTT client and allows for dynamic switching between the two with zero scheduled downtime and uses MqttDispatcher to dispatch the messages to the mqtt client.

/**
 * This interface needs to be implemented by all Ignite-Auto Stream Processors
 * henceforth. The signature is IgniteKey and IgniteEvent which will be common
 * across the stream processors in Ignite systems. Also, this will be key for
 * processor chaining.
 *
 */
public interface IgniteEventStreamProcessor extends StreamProcessor<IgniteKey<?>, IgniteEvent, IgniteKey<?>, IgniteEvent> {

}

The IgniteEventStreamProcessor interface needs to be implemented by all Ignite-Auto Stream Processors.

The above stream processor can be classified into these categories:

Preprocessor
Service Processor
Postprocessor

As highlighted above, the discovery service is responsible for creating the kafka topology with the above set of processors. The service processor/s will be sandwiched between the preprocessor and postprocessor by the discovery service.

discovery.impl.class.fqn=org.eclipse.ecsp.analytics.stream.base.discovery.PropBasedDiscoveryServiceImpl
launcher.impl.class.fqn=org.eclipse.ecsp.analytics.stream.base.KafkaStreamsLauncher
pre.processors=org.eclipse.ecsp.analytics.stream.base.processors.TaskContextInitializer,org.eclipse.ecsp.analytics.stream.base.processors.ProtocolTranslatorPreProcessor,org.eclipse.ecsp.digitalkey.sp.processor.VehicleStatePreProcessor,org.eclipse.ecsp.platform.dff.agent.processors.DFFAgentPreProcessor,org.eclipse.ecsp.analytics.stream.base.processors.DeviceMessagingAgentPreProcessor
service.stream.processors=org.eclipse.ecsp.digitalkey.sp.DigitalKeyMessageProcessor
post.processors=org.eclipse.ecsp.digitalkey.sp.processor.IgniteEventWithSignaturePostProcessor,org.eclipse.ecsp.analytics.stream.base.processors.SchedulerAgentPostProcessor,org.eclipse.ecsp.analytics.stream.base.processors.DeviceMessagingAgentPostProcessor,org.eclipse.ecsp.platform.dff.agent.processors.DFFAgentPostProcessor

Here, we have specified to use PropBasedDiscoveryServiceImpl as the discovery service as we want to create the topology based on the properties specified. Also, we have used KafkaStreamsLauncher as the launcher class which will be utilized by the discovery service to create the topology.

The source topics and the sink topics are provided by the application with these properties:

source.topic.name=spaak,activation,dk-termination-response
sink.topic.name=https-integ-high

All the processors (pre, service, post) will be subscribed to the source topics and will publish to the sink topics. If there are no sinkers specified, the last processor in the chain will publish to the sink topics.

Topics Health Monitoring

The topics whose health need to be monitored can be specified in the properties file as shown below:

kafka.topics.file.path=/data/topics.txt

The structure of the file specifies the topic name along with the number of partitions and the replication factor as shown below:

    spaak|25|2
    activation|25|2
    dk-termination-response|25|2
    digital-key-sp-dlq|25|2
    dk_service_provisioning|25|2
    device-status-digital-key-sp|25|2
    dev-stolenvehicle|25|2
    vehicle-profile-modified-authorized-users|25|2
    vehicle-profile-modified-serial-no|25|2

NOTE: The above topics must be existing in the kafka cluster.

State stores

Following state stores are supported by the streambase library:

In-memory state store
RocksDB state store

To specify the type of state store to be used by the service along with other state store properties:

state.dir=/tmp/kafka-streams
state.store.changelog.enabled=false
state.store.type=map

NOTE: A StreamProcessor must not provide the implementation of createStateStore method, if they want HashMap as state store instead of the RocksDB store.

Customizing state store and state store type

The state store can be customized while creating an instance of StreamProcessor by overriding the createStateStore method. However, the StreamProcessor must not provide the implementation of createStateStore method, if they want HashMap as state store instead of the RocksDB store.

Example:

    @Override
    public HarmanPersistentKVStore createStateStore() {
        // 
    }

In-memory cache

There are various in-memory caches maintained by streambase library for its internal use along with maintaining the metrics for each. Some of them are following:

RetryRecord cache.
RetryBucket cache.
Connection status cache.
Shoulder Tap RetryRecord cache.
Shoulder Tap RetryBucket cache.

Kafka configuration

To connect to kafka, the following properties need to be specified in the properties file along with the optional TLS properties:

#Comma separated list of kafka brokers
bootstrap.servers=127.0.0.1:9092
#Comma separated list of zookeepers
zookeeper.connect=127.0.0.1:2181

# TLS properties
kafka.ssl.enable=false
kafka.ssl.client.auth=required
kafka.client.keystore=keystore.jks
kafka.client.keystore.password=****************
kafka.client.key.password=****************
kafka.client.truststore=truststore.jks
kafka.client.truststore.password=****************

Vault Configuration

Vault can be configured to provide the TLS security details to the service to connect to the kafka broker. The following properties need to provided:

vault.enabled=false
health.vault.monitor.enabled=false
health.vault.needs.restart.on.failure=false

MongoDB configuration

To connect to the MongoDB instance, the following properties need to be specified in the properties file:

mongodb.hosts=localhost
mongodb.port=27017
mongodb.username=admin
mongodb.username=admin
mongodb.auth.db=admin
mongodb.name=admin
mongodb.pool.max.size=200

Redis configuration

To connect to the Redis instance for caching capabilities, the following properties need to be specified in the properties file:

redis.address=127.0.0.1:6379
redis.sentinels=
redis.master.name=
redis.read.mode=SLAVE
redis.subscription.mode=SLAVE
redis.database=0
redis.password=

MQTT configuration

To connect to the MQTT broker for publishing to the MQTT topics, the following properties need to be specified in the properties file:

mqtt.broker.url=tcp://127.0.0.1:1883
mqtt.topic.separator=/
mqtt.config.qos=1
mqtt.user.name=*******************
mqtt.user.password=dummyPass
mqtt.service.topic.name=test

Health configuration

Health monitoring for different components can be enabled by specifying the following properties:

health.mqtt.monitor.enabled=false
health.mongo.monitor.enabled=false
health.kafka.consumer.group.monitor.enabled=false
health.device.status.backdoor.monitor.enabled=false
health.kafka.topics.monitor.enabled=false
health.redis.monitor.enabled=false
health.vault.monitor.enabled=false

Enabling DMA/SCHEDULER Module Configurations For StreamBase

To enable the DFF/DMA and DMA scheduler services, the following properties need to be specified in the properties file:

#
dma.enabled=false
scheduler.enabled=false

Built With Dependencies

Dependency	Purpose
NoSQL DAO	NoSQL DAO capabilities
Cache Enabler	Redis caching capabilities
Paho Client	MQTT Paho Client
Moquette Broker	Java MQTT Broker
HiveMQ Mqtt client	HiveMQ client for MQTT
Vehicle Profile Entities	Morphia entities library for Vehicle Profile
Kafka Streams	For creating kafka streams application
Kafka Clients	For providing kafka client capabilities
Kafka 2.13	Core kafka capabilities
Kryo	binary object graph serialization framework for Java
Zookeeper	Enables highly reliable distributed coordination in kafka cluster
Embedded Redis	The library to implement database entities
okhttp	HTTP client library
Mock Web Server	A scriptable web server for testing HTTP clients
Maven	Dependency Management
Junit	Testing framework
Mockito	Test Mocking framework

How to contribute

Please read CONTRIBUTING.md for details on our contribution guidelines, and the process for submitting pull requests to us.

Code of Conduct

Please read CODE_OF_CONDUCT.md for details on our code of conduct.

Authors

_{Kaushal Arora}
📖 👀

_{Hussain Badshah}
📖 👀

See also the list of contributors who participated in this project.

Security Contact Information

Please read SECURITY.md to raise any security related issues.

Support

Please write to us at csp@harman.com

Troubleshooting

Please read CONTRIBUTING.md for details on how to raise an issue and submit a pull request to us.

License

This project is licensed under the Apache-2.0 License - see the LICENSE.md file for details.

Announcements

All updates to this library are documented in our Release notes and releases For the versions available, see the tags on this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 176 Commits
.github		.github
.instructions		.instructions
.sonarlint		.sonarlint
images		images
src		src
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DEPENDENCIES		DEPENDENCIES
LICENSE		LICENSE
NOTICE.md		NOTICE.md
README.md		README.md
SECURITY.md		SECURITY.md
checkstyle-suppressions.xml		checkstyle-suppressions.xml
checkstyle.xml		checkstyle.xml
pom.xml		pom.xml
release_notes.txt		release_notes.txt

Folders and files

Latest commit

History

Repository files navigation

StreamBase Library

Table of Contents

Getting Started

Prerequisites

Installation

Running the tests

Deployment

Usage

Creating a Stream Processor

Topics Health Monitoring

State stores

Customizing state store and state store type

In-memory cache

Kafka configuration

Vault Configuration

MongoDB configuration

Redis configuration

MQTT configuration

Health configuration

Enabling DMA/SCHEDULER Module Configurations For StreamBase

Built With Dependencies

How to contribute

Code of Conduct

Authors

Security Contact Information

Support

Troubleshooting

License

Announcements

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages