Comprehensive example task for running multislice TPU workloads with JobSet (and JobSet + Kueue) #428
Open
Labels
good first issue
Denotes an issue ready for a new contributor, according to the "help wanted" guidelines.
What would you like to be added:
Comprehensive example tasks running training workloads on TPUs with JobSet. Also demonstrating JobSet + Kueue integration would be nice.
Why is this needed:
We need more comprehensive examples that will reduce friction for users trying out JobSet for real training workloads. Right now we mostly just have toy examples with "sleep" containers that demonstrate different features.
The text was updated successfully, but these errors were encountered: