Job submitter

2024-04-12 deprecation

This job submitter was superseded by hattivatti because:

We decided not to run compute jobs using the Kubernetes executor anymore
A lot of the complicated things the job submitter did with PVCs was deprecated by adopting the Fusion filesystem 🎉
Making K8S objects in python is quite complex, so it was replaced with a Helm chart
Needed to support more multiple executors (e.g. google batch, slurm, anything else in the future)

Job submitter

jobsubmitter is part of the INTERVENE platform backend. jobsubmitter is a python service that:

Listens for job requests, sent as Kafka messages with a JSON payload
Validates the JSON payload against bundled JSON schema
Creates pgsc_calc nextflow job and associated resources to do polygenic risk score calculation
Monitors submitted jobs and reports state changes to the backend via Kafka

The nextflow job uses the Kubernetes executor.

Each job has a sandboxed persistent volume claim to handle inter-process (cross-pod) communication.

The job submitter doesn't use Fusion yet.

Usage

To deploy on a Kubernetes cluster quickly:

$ kubectl apply -f jobsubmitter.yaml

To run locally:

$ git clone https://github.com/ebi-gdp/jobsubmitter.git
$ cd jobsubmitter
$ poetry shell
$ submit_job --kafka_bootstrap_urls localhost:9092 \
    --client_id test_id \
    --namespace test \
    --output_bucket test

--kafka_boostrap_urls: An URL that points to a Kafka service
--client_id: A human readable label for the kafka consumer
--namespace: The Kubernetes namespace you want to deploy the job submitter and jobs to
--output_bucket: The root URL of the bucket you'd like to publish results to. e.g:
- s3://results_bucket -> --output_bucket results_bucket
- Individual workflow results are published to s3://results_bucket/<WORKFLOW_ID_FROM_JSON_MESSAGE>

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
jobsubmitter		jobsubmitter
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
jobsubmitter.yaml		jobsubmitter.yaml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

2024-04-12 deprecation

Job submitter

Usage

About

Releases

Languages

ebi-gdp/jobsubmitter

Folders and files

Latest commit

History

Repository files navigation

2024-04-12 deprecation

Job submitter

Usage

About

Resources

Stars

Watchers

Forks

Releases

Languages