Sorcerer Explained

Table of Contents

Overview
Initialization
Task
Pipeline
Module
Execution
Persistence Layer

Overview

Sorcerer is a workflow scheduler that is designed to be easily extensible and modular.

Initialization

There are 3 steps to the initialization of the Sorcerer instance:

Process configuration files
Process class annotations
Reconciliation and registration

Configuration Files

First, Sorcerer processes all given configuration files and checks for configuration syntax correctness as well as converts them into internal objects.

Annotations

Next, Sorcerer processes relevant annotations in the java classpath (or packages if defined). It will search for annotations and then ensure that the class implements the required interface to be used by Sorcerer (i.e. Task.class or Pipeline.class).

Reconciliation

After configuration files and annotations are processed, Sorcerer eagerly attempts to reconcile all the relevant configured objects with their corresponding classes. For example, a task that is defined in the configuration files should also have a implementation in the Java packages.

For more details go to the Initialization page.

Task

Defining and implementing tasks require two steps:

Defining task in configuration files (See Task Configuration)
Implementing Task.class with @SorcererTask annotation (See Task Implementation)

For more information see the Task page.

Pipeline

Pipelines must be defined but implementing a specific instance is optional.

Defining pipeline in configuration files (See Pipeline Configuration)
[Optional] Implementing Pipeline.class with @SocererPipeline annotation (See Pipeline Implementation)

For more information see the Pipeline page.

Module

A module defines the content and context of a single instance of Sorcerer. Or more specifically, it is the configuration of the instance of Sorcerer running in a single JVM. It has the following configuration fields:

Name
Pipelines
Email
Storage
Packages

For more information see the Module page.

Execution

After Sorcerer is initialized, it schedules all the pipelines in a module.

For more information see the Execution page.

Persistence Layer

Sorcerer relies on a persistent storage layer to maintain pipeline and task states. Out of the box Sorcerer can use HDFS, MySQL or Zookeeper as its persistence layer. Additionally, custom storage layers can be implemented by implementing StatusStorage.class.

For more information see the Persistence Layer page.

API

For more information see the Javadocs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Sorcerer Explained

Overview

Initialization

Configuration Files

Annotations

Reconciliation

Task

Pipeline

Module

Execution

Persistence Layer

API

Files

README.md

Latest commit

History

README.md

File metadata and controls

Sorcerer Explained

Overview

Initialization

Configuration Files

Annotations

Reconciliation

Task

Pipeline

Module

Execution

Persistence Layer

API