IUDX Adaptor Framework

A generic and pluggable data ingress/egress utility supporting ETL and Rules Engine operations based on Apache Flink.

Ingests data from diverse sources, protocols and serialization formats
Supports data-deduplication, watermarking, event timers and windowing
Pluggable sources and sinks for different modes of operations
Scriptable (JS) data transformations
Streaming SQL support for Rules Engine like applications
Supports a configuration file based specification and operation of the entire pipeline

Overview

The figure below shows an overview of the framework.

Features

Based on Apache Flink
Configuration file based specification of the pipeline
Pluggable for extending capabilities
Calcite based SQL engine acting as a rules engine
RocksDB based streaming data storage for SQL queries
Nashorn based JS Scripting for data transformation
JSON Path based key-value access
Jolt based Json-Json transformation
Quartz based job scheduling
Vert x based Api server with config based pipeline JAR generation, user and adaptor job management and monitoring
Docker development and deployment

Rules Engine

In addition to stream ETL, the framework can also be used in a rules engine mode. We allow users to pass dynamic runtime SQL queries whcih can be run on buffered streaming data.

Usage

Framework can be used in 2 applications

ETL Pipeline - Framework can be used in ETL (extract, transform, load) pipeline. Consumer can write a specification file for the entire pipeline (source -> transform -> sink) and use hosted instances api to publish the spec and perform the pipeline operations.
Rules Engine - Framework can also be used for executing rules on streaming data. Consumer can submit the rule using API and receive alerts.

Apis

The framework provides Apis to manage the lifecycle of the adaptor pipeline and to monitor it. Assuming the administrator of the framework has already provided the user with authentication credentials, the Api of relevance to get started with are

newAdaptor: Submit the above pipeline spec and create a new adaptor. This can be ETL or RULE based.

POST /adaptor
Header: {"username": "uname", "password": "password"}
Body: Spec File
Content-Type: application/json
Response: 202 (Accepted and generating jar), 401 (Unauthorized), 400 (Bad spec file)

getAdaptors: List all the adaptors and their running state (including recently submitted ones) and their ids

GET /adaptor
Header: {"username": "uname", "password": "password"}
Content-Type: application/json
Response: 200 (List of adaptors and their running state)

startAdaptor: Start and adaptor given its id. Starts even scheduled adaptors.

POST /adaptor/{id}/start
Header: {"username": "uname", "password": "password"}
Content-Type: application/json
Response: 200 (Success), 404 (No such adaptor), 401 (Unauthorized)

stopAdaptor: Stop and adaptor given its id. Stops even scheduled adaptors.

POST /adaptor/{id}/stop
Header: {"username": "uname", "password": "password"}
Content-Type: application/json
Response: 200 (Success), 404 (No such adaptor), 401 (Unauthorized)

deleteAdaptors: Delete an adaptor given its id

DELETE /adaptor/{id}
Header: {"username": "uname", "password": "password"}
Content-Type: application/json
Response: 200 (Deleted), 404 (No such adaptor), 401 (Unauthorized)

newRule: Create a new SQL rule for an already running RULE adaptor

POST /rule
Header: {"username": "uname", "password": "password"}
Body: Rule object
Content-Type: application/json
Response: 200 (Deleted), 404 (No such adaptor), 401 (Unauthorized)

Rule object body

On submitting the adaptor pipeline spec file, the server will generate a JAR with all the dependencies and run the pipeline according to the configurations specified.

The entire API specification can be found here.

Starting a local development/deployment environment

Build all required images ./setup/build.sh
Modify ./configs/config-example.json and make the server config. Modify ./configs/quartz.properties and make the quartz config.
Modify ./setup/*/docker-compose to take up the correct config files.
Bring up the local environment ./setup/start_local_dev_env.sh This brings up flink, rabbitmq, the apiserver and a mockserver.
Use the apis to submit the above example config

Name		Name	Last commit message	Last commit date
Latest commit History 413 Commits
configs		configs
demo		demo
docs		docs
framework		framework
misc		misc
mockserver		mockserver
server		server
setup		setup
template		template
test		test
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IUDX Adaptor Framework

Overview

Features

Rules Engine

Usage

Apis

Starting a local development/deployment environment

About

Releases

Packages

Contributors 7

Languages

datakaveri/iudx-adaptor-framework

Folders and files

Latest commit

History

Repository files navigation

IUDX Adaptor Framework

Overview

Features

Rules Engine

Usage

Apis

Starting a local development/deployment environment

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Languages

Packages