kubeai 0.11.0
What's Changed
- improve caching docs by @samos123 in #295
- Update kubernetes api reference by @samos123 in #290
- Deep Chat integration by @nstogner in #294
- Add gh200 support and model by @happytreees in #300
- update README by @samos123 in #296
- Update README.md by @samos123 in #305
- add llama 3.1 70b fp8 model on 1 x gh200 by @samos123 in #302
- Llama 3.1 70b with pipeline parallelism by @samos123 in #307
- add k8s device plugin / GPU operator values file by @samos123 in #308
- Add Lambda's tutorial and video to the README's table of adopters by @cbrownstein-lambda in #309
- update vllm image for GPU and TPU to v0.6.4.post1 by @samos123 in #310
- add a generic K8s install guide by @samos123 in #312
- LoRA Adapters for vLLM & support for s3, gs, oss for pulling adapters and models (to cache) from buckets by @nstogner in #304
- Add Configure Text Generation Models guide by @samos123 in #313
New Contributors
- @happytreees made their first contribution in #300
- @cbrownstein-lambda made their first contribution in #309
Full Changelog: v0.10.0...v0.11.0