Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lemonade release v6.0.0: new OpenAI server, improvements, fixes #291

Merged
merged 4 commits into from
Feb 27, 2025

Conversation

jeremyfowers
Copy link
Collaborator

@jeremyfowers jeremyfowers commented Feb 27, 2025

Breaking Changes

New OpenAI-Compatible Server

The previous serve Tool has been replaced by a new standalone serving command. This new server has OpenAI API compatibility and will add Ollama compatibility in the near future.

The server can also be installed and used with no-code by running Lemonade_Server_Installer.exe, which is provided as a release asset in this and all future releases.

The server code was also moved out of tools/chat.py into its own file in tools/serve.py. We also renamed chat.py to prompt.py for clarity, since that file now only contains the prompting tool.

The LEAP name has been deprecated

In the interest of reducing naming confusion, the "LEAP API" is now simply the "high-level lemonade API".

  • Old usage: from lemonade.leap import from_pretrained
  • New usage: from lemonade.api import from_pretrained

Summary of Contributions

  • The base checkpoint for models is retrieved from the Hugging Face API at loading time (@ramkrishna2910)
  • The benchmarking tools (huggingface-bench, oga-bench, and llamacpp-bench) have been refactored to reduce code duplication and improve maintainability. They now also support a list of prompts (or prompt lengths) to be benchmarked: --prompts 128 256 512 (@amd-pworfolk)
  • The avg_accuracy stats has been renamed to average_mmlu_accuracy for clarity with respect to non-MMLU accuracy tests (@jeremyfowers), (attn @apsonawane)
  • Introduce Lemonade_Server_Installer.exe (@jeremyfowers)
  • Implement an OpenAI-compatible server and remove the old serve tool (@danielholanda)
  • Rename chat module to prompt (@jeremyfowers)
  • Improved lemonade getting started documentation and remove the "LEAP" branding (@jeremyfowers)
  • OGA 0.6.0 is the default package for CPU, CUDA, and DML (@jeremyfowers)

@jeremyfowers jeremyfowers merged commit f0b55b9 into main Feb 27, 2025
11 checks passed
@jeremyfowers jeremyfowers deleted the release_600 branch February 27, 2025 20:23
jeremyfowers added a commit that referenced this pull request Feb 27, 2025
Co-authored-by: amd-pworfolk <[email protected]>
Co-authored-by: Daniel Holanda <[email protected]>
Co-authored-by: Ramakrishnan Sivakumar <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants