TL;DR
- Several stabilizations to development flows after the switch to
uv
- Migrated CI workflows to new OSS repo - llama-stack-ops
- Added automated rebuilds for ReadTheDocs
- Llama Stack server supports HTTPS
- Added system prompt overrides support
- Several bug fixes and improvements to documentation (check out Kubernetes deployment guide by @terrytangyuan )
What's Changed
- Fix UBI9 image build when installing Python packages via uv by @terrytangyuan in #926
- Fix precommit check after moving to ruff by @terrytangyuan in #927
- LocalInferenceImpl update for LS 0.1 by @jeffxtang in #911
- Properly close PGVector DB connection during shutdown() by @terrytangyuan in #931
- Add issue template config with docs and Discord links by @terrytangyuan in #930
- Fix uv pip install timeout issue for PyTorch by @terrytangyuan in #929
- github: ignore non-hidden python virtual environments by @nathan-weinberg in #939
- fix: broken link in Quick Start doc by @nathan-weinberg in #943
- fix: broken "core concepts" link in docs website by @nathan-weinberg in #940
- Misc fixes by @ashwinb in #944
- fix: formatting for ollama note in Quick Start doc by @nathan-weinberg in #945
- [docs] typescript sdk readme by @yanxi0830 in #946
- Support sys_prompt behavior in inference by @ehhuang in #937
- if client.initialize fails, the example should exit by @cdoern in #954
- Add Podman instructions to Quick Start by @jwm4 in #957
- github: issue templates automatically apply relevant label by @nathan-weinberg in #956
- docs: miscellaneous small fixes by @booxter in #961
- Make a couple properties optional by @ashwinb in #963
- [docs] Make RAG example self-contained by @booxter in #962
- docs, tests: replace datasets.rst with memory_optimizations.rst by @booxter in #968
- Fix broken pgvector provider and memory leaks by @terrytangyuan in #947
- [docs] update the zero_to_hero_guide llama stack version to 0.1.0 by @kami619 in #960
- missing T in import by @cooktheryan in #974
- Fix README.md notebook links by @aakankshaduggal in #976
- docs: clarify host.docker.internal works for recent podman by @booxter in #977
- docs: add addn server guidance for Linux users in Quick Start by @nathan-weinberg in #972
- sys_prompt support in Agent by @ehhuang in #938
- chore: update PR template to reinforce changelog by @leseb in #988
- github: update PR template to use correct syntax to auto-close issues by @booxter in #989
- chore: remove unused argument by @cdoern in #987
- test: replace memory with vector_io fixture by @leseb in #984
- docs: use uv in CONTRIBUTING guide by @leseb in #970
- docs: Add license badge to README.md by @terrytangyuan in #994
- Add Kubernetes deployment guide by @terrytangyuan in #899
- Fix incorrect handling of chat completion endpoint in remote::vLLM by @terrytangyuan in #951
- ci: Add semantic PR title check by @terrytangyuan in #979
- feat: Add a new template for
dell
by @hardikjshah in #978 - docs: Correct typos in Zero to Hero guide by @mlecanu in #997
- fix: Update rag examples to use fresh faiss index every time by @hardikjshah in #998
- doc: getting started notebook by @ehhuang in #996
- test: fix flaky agent test by @ehhuang in #1002
- test: rm unused exception alias in pytest.raises by @leseb in #991
- fix: List providers command prints out non-existing APIs from registry. Fixes #966 by @terrytangyuan in #969
- chore: add missing ToolConfig import in groq.py by @leseb in #983
- test: remove flaky agent test by @ehhuang in #1006
- test: Split inference tests to text and vision by @terrytangyuan in #1008
- feat: Add HTTPS serving option by @ashwinb in #1000
- test: encode image data as base64 by @leseb in #1003
- fix: Ensure a better error stack trace when llama-stack is not built by @cdoern in #950
- refactor(ollama): model availability check by @leseb in #986
New Contributors
- @nathan-weinberg made their first contribution in #939
- @cdoern made their first contribution in #954
- @jwm4 made their first contribution in #957
- @booxter made their first contribution in #961
- @kami619 made their first contribution in #960
- @cooktheryan made their first contribution in #974
- @aakankshaduggal made their first contribution in #976
- @leseb made their first contribution in #988
- @mlecanu made their first contribution in #997
Full Changelog: v0.1.1...v0.1.2