This project aims to simplify the deployment and management of batch workloads across multiple Kubernetes clusters using Kueue for job queueing and KubeStellar for multi-cluster configuration management.
This repository contains two core controllers:
- WorkloadController: watches for Kueue
Workload
objects and orchestrates the downsync and deployment of corresponding jobs to worker clusters managed by KubeStellar - QuotaManagerController: monitors ClusterMetrics from each worker cluster and dynamically updates Kueue's global resource quotas as needed
In multi-cluster Kubernetes environments, managing batch workloads and ensuring efficient resource utilization across clusters can be a complex challenge. Organizations often face issues such as resource contention, over-provisioning, and inefficient workload distribution, leading to suboptimal resource utilization and increased costs.
The kueue-ks project goal is to address these challenges by leveraging Kueue's quota management capabilities and integrating with KubeStellar for multi-cluster configuration management. The primary goal is to enable centralized management and intelligent distribution of batch workloads across multiple clusters based on available resource quotas.
You’ll need a Kubernetes cluster to run against. You can use K3D to get a local cluster for testing.
-
Check out this instructions
-
Run job examples:
kubectl create -f examples/batch-job.yaml
kubectl create -f examples/pytorch-simple-job.yaml
// TODO(user): Add detailed information on how you would like others to contribute to this project
This project aims to follow the Kubernetes Operator pattern
It uses Controllers which provides a reconcile function responsible for synchronizing resources untile the desired state is reached on the cluster
- Install the CRDs into the cluster:
make install
- Run your controller (this will run in the foreground, so switch to a new terminal if you want to leave it running):
make run
NOTE: You can also run this in one step by running: make install run
If you are editing the API definitions, generate the manifests such as CRs or CRDs using:
make manifests
NOTE: Run make --help
for more information on all potential make
targets
More information can be found via the Kubebuilder Documentation
Copyright 2024 The KubeStellar Authors.
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.