SpringBatch patterns

This repository is for trainning purpose. It show a full Springboot SpringBatch integration using modular configuration to avoid bean name/type conflicts.

Each job can be launch independently using -Dspring.batch.job.names={jobname} parameter. (see Eclipse launch configurations for other parameters)

It use postgreSQL database and H2 for tests.

Introduction

Pattern 1 : Extract Job

SimpleExtractJobConfig.java

This is the simplest job configuration (no really innovation here). One step use the reader / processor / writer pattern to read a database table and write the content "as is" to a comma separated flat file.

Specificity : the incrementalFilename method get an unique filename resource according to a file name and a job unique run identifier (Must be used in conjunction with RunIdIncrementer).

Pattern 2 : Load Job

SimpleLoadJobConfig.java

Another job configuration that read a file to fill a table like an ETL (extract, transform and load).

The 1st Step (deleteStep) erase table records before the "load" Step. It use a JdbcTasklet (from Pivotal) to execute SQL command against the table.

Pattern 3 : Synchronize 2 files (master/detail)

File2FileSynchroJobConfig.java

This configuration may not be very usual but it can be interesting when you want to aggregate 2 files that share the same key. Typically with a master file and a detail file (ie Orders and OrderLines).

This job configuration use a MasterDetailReader Class to drive a master accumulator (CustomerAccumulator) and a Detail accumulator (TransactionAccumulator). These classes inherit from ItemAccumulator, a generic class used to define the shared key between master and detail objects.

In this way, complete object should be filled entirely by the reader.

MasterDetailReader uses the delegator pattern to delegate the reading to a specialized reader (flatfile, jdbc, ...or whatever)

Pattern 4 : Synchronize a file with a table

File2TableSynchroJobConfig.java

This pattern is a little bit different from the previous one but works the same way. This time the reader, the master csv file is synchronized with a table which contains the detail datas.

The MasterDetailReader, TransactionAccumulator and CustomerAccumulator classes are generic enough to be reused.

Pattern 5 : Synchronize a table with a with a file

Table2FileSynchroJobConfig.java

Another variation of the previous patterns. This time, the "Master" data comes from a table in the database and the "Details" data comes from a csv file.

Pattern 6 : Grouping file records

GroupingRecordsJobConfig.java

This pattern reads items packets that share the same key and returns lists that can be summed or whatever... (this is a "read many and write one" example)

Pattern 7 : Grouping tables records (with SQL)

SQLJoinSynchroJobConfig.java

With this pattern, a grouping SQL query summarize the transactions to compute the customer balance.

No processor usage.

Pattern 8 : Control Break Job

ControlBreakJobConfig.java

Another way to return Transactions list from the reader (similar to groupingRecordJob) but use the ItemListPeekableItemReader that use a Strategy pattern (see BreakKeyStrategy.java) to groups records that have same "group" key (ie the customer number) in a easy configurable way (to sum records in this example).

Pattern 9 : Multi records Fixed Job

MultiFixedRecordJobConfig.java

This pattern show the way to read and write a multirecords fixed length file (like COBOL files). This job use a PatternMatchingCompositeLineMapper to map line with a record Type (ie: 00 for header, 01 for details and 99 for footer).

Pattern 10 : Staging Job

StagingJobConfig.java

This pattern is a java configuration adaptation of the original Spring-batch parallelJob.xml config

The job reads data from the same file as the Import sample, but instead of writing it out directly it goes through a staging table, and the staging table is read in a multi-threaded step.

This job use :

ItemCountListener for Logging the count of items processed at a specified interval.
StagingItemWriter that fill BATCH_STAGING table with serialized items.
StagingItemProcessor that marks the input row as 'processed'.
StagingItemReader that read the BATCH_STAGING table for record not processed.

Pattern 11 : Daily Job

DailyJobConfig.java

This pattern show how to configure a job that run once per day and prevent to not be launch twice.

Pattern 12 : Multi files Load Job

MultiFilesLoadJobConfig.java

This Job load Transaction csv files present in a directory sequentially insert each read line in a Transaction Table. It use MultiResourceItemReader class to loop over all files found and delegate the file loading the to a reader.

Pattern 13 : Extract with Process Indicator Job

ExtractProcessIndicatorJobConfig.java

This pattern use a process indicator to flag processed records (unlike the staging job Processed Column is present in the table)

Pattern 14 : File Partition Job

FilePartitionJobConfig.java

This pattern use MultiResourcePartitioner to create partitions upon files presents in a folder.

Pattern 15 : Jdbc partition Job

JDBCPartitionJobConfig.java

Pattern 16 : Compute Delta Job

ComputeDeltaJobConfig.java

Imagine that you receive a different file every day from your partner with all the data.

And that you have to update your system with the added or deleted data, this is exactly what this job does, it compute the delta between the file received at day N-1 with the file received at day N (usually used in companies that use files to transmit data).

Pattern 17

MultiDestinationJobConfig.java

This job use a custom Classifier to distinguish customers and a ClassifierCompositeItemWriter to route items to the according itemWriter.

Pattern 18

MultiLinesExtractJobConfig.java

This job allows you to write records of different types from 2 tables to a file. The records from the master table are read by the JdbcCursorItemReader and the ItemProcessor complete with the records from the detail table. A Custom ItemWriter is responsible for writing the aggregated data.

Pattern 19

MultiLinesLoadJobConfig.java

This job allows to load records of different types from the same file and load them into their respective tables.

It use a PatternMatchingCompositeLineMapper to map each line with a record Type and a ClassifierCompositeItemWriter to choose what table to insert with.

Pattern 20

PruneHistoryJobConfig.java

This job use the revisited Spring-Batch 5.x RemoveSpringBatchHistoryTasklet to remove the old entries in the Spring-batch metadatas tables.

Pattern 21

RemoveOneJobHistoryJobConfig.java

This job is a variation of the previous one. It use the RemoveOneJobHistoryTasklet to remove the all metadatas tables entries according to a given job name.

Pattern 22

SimpleUpdateJobConfig.java

The purpose of this Job is to update table from file data. It involve a RejectFileSkipListener to put rejected datas to a file.

Pattern 23

UpsertJobConfig.java

This job illustrates usage of the SQL Upsert command in a batch.

Note: this command is not present in all database engines. For example, it is present in PostgreSQL but not in H2 or the syntax differs !!!.

Some usefull tools can be used in patterns

Tasklets

All the classes below have a JUnit test that shows how they work.

RemoveSpringBatchHistoryTasklet.java revisited Spring-Batch 5.x version of the Antoine Rey's original
AnalyzePGTasklet.java to force PostgreSQL to Analyse the given tables after heavy loading.
GatherStatisticTasklet.java to force Oracle to Gather Statistics for the given tables after heavy loading.
CompressTasklet.java to zip file in a Step (before send it for example)
DecompressTasklet.java to unzip file in a Step (after received it for example)
RestServiceTasklet.java usefull if you have to call Rest Service in jobs.
CopyFileTasklet.java to copy file in another folder.
MoveFileTasklet.java to move file in another folder.
MoveFilesTasklet.java to move files in another folder.

Chaos testing Tasklets

These tasklet can be used to simulate fails during batch execution.

BipolarTasklet.java that throw exception on odd runs.
RandomFailTasket.java used to mimimic unstable process.
SlowPGQueryTasklet.java used to mimimic slow PosgreSQL queries.
WaitingTasklet.java used to mimimic long duration process. Display countdown.

Listeners

TO DO

JobParametersValidators

TO DO

Readers

TO DO

Writers

TO DO

Processors

TO DO

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
.mvn/wrapper		.mvn/wrapper
images		images
src		src
.gitignore		.gitignore
BatchApplication controlbreak-job.launch		BatchApplication controlbreak-job.launch
BatchApplication daily-job.launch		BatchApplication daily-job.launch
BatchApplication explore-job.launch		BatchApplication explore-job.launch
BatchApplication export-processindicator-job.launch		BatchApplication export-processindicator-job.launch
BatchApplication extract.launch		BatchApplication extract.launch
BatchApplication file2filesynchro-job.launch		BatchApplication file2filesynchro-job.launch
BatchApplication file2tablesynchro-job.launch		BatchApplication file2tablesynchro-job.launch
BatchApplication import.launch		BatchApplication import.launch
BatchApplication monitoringJob.launch		BatchApplication monitoringJob.launch
BatchApplication partition-job.launch		BatchApplication partition-job.launch
BatchApplication prunehistory.launch		BatchApplication prunehistory.launch
BatchApplication remove-job-history-job.launch		BatchApplication remove-job-history-job.launch
BatchApplication staging.launch		BatchApplication staging.launch
BatchApplication table2filesynchro.launch		BatchApplication table2filesynchro.launch
BatchApplication upsert-job.launch		BatchApplication upsert-job.launch
README.md		README.md
launchDailyJob.bat		launchDailyJob.bat
launchDailyJob.sh		launchDailyJob.sh
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

desprez/springbatch-patterns

Folders and files

Latest commit

History

Repository files navigation

SpringBatch patterns

Introduction

Pattern 1 : Extract Job

Pattern 2 : Load Job

Pattern 3 : Synchronize 2 files (master/detail)

Pattern 4 : Synchronize a file with a table

Pattern 5 : Synchronize a table with a with a file

Pattern 6 : Grouping file records

Pattern 7 : Grouping tables records (with SQL)

Pattern 8 : Control Break Job

Pattern 9 : Multi records Fixed Job

Pattern 10 : Staging Job

Pattern 11 : Daily Job

Pattern 12 : Multi files Load Job

Pattern 13 : Extract with Process Indicator Job

Pattern 14 : File Partition Job

Pattern 15 : Jdbc partition Job

Pattern 16 : Compute Delta Job

Pattern 17

Pattern 18

Pattern 19

Pattern 20

Pattern 21

Pattern 22

Pattern 23

Some usefull tools can be used in patterns

Tasklets

Chaos testing Tasklets

Listeners

JobParametersValidators

Readers

Writers

Processors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages