v5.0.3
What's Changed
- Bring OGA under test and fix OGA server. Improve llm-prompt. by @jeremyfowers in #272
- Always move HF tozenizer encodings to the target device by @jeremyfowers in #274
- Release v5.0.3: Lemonade installer and examples, repo reorg, and lots more by @jeremyfowers in #275
- Docs, test, and examples have been moved into turnkey (CNNs and Transformers) vs. lemonade (LLMs) directories (@jeremyfowers)
- For example: docs/lemonade/getting_started.md instead of docs/lemonade_getting_started.md
- Track the memory utilization of any lemonade or turnkey command and plot it on a graph by setting the --memory option (@amd-pworfolk).
- Add examples and demo applications for the high-level LEAP APIs in examples/lemonade (@jeremyfowers).
- Add LEAP support for all OGA backends (@jeremyfowers).
- Extend the llm-prompt tool to make it more useful for model and framework validation (@amd-pworfolk).
- Updates and fixes to lemonade test code in llm_api.py (@jeremyfowers).
- Fix not_enough_tokens bug on oga-bench (@danielholanda).
Full Changelog: v5.0.2...v5.0.3