Fix (llm): small fixes to LLM #1035

Giuseppe5 · 2024-09-27T08:52:24Z

Improve HQO implementation
Support for MSE with groupwise quantization
Add possibility to specify groupdim for groupwise weight quantization

nickfraser

LGTM!

nickfraser · 2024-10-07T13:22:10Z

src/brevitas/core/function_wrapper/shape.py

+        # - If we quantize the zero point, which will already have expanded shape matching the scale (although no padding, but we don't need the padding)
+        # - Groupwise HQO quantization, where weight will already have been padded and expanded
+        if len(x.shape) == len(self.expanded_groupwise_shape):
+            return x


Weird stuff / comments like this make me wonder if we need to re-think our implementation.

(but let's not block this release)

Giuseppe5 added 3 commits September 27, 2024 09:46

Fix (llm): small fixes to LLM

b08e0ac

Groupwise MSE support

fd51bb8

New groupdim options for LLM

6297b17

Giuseppe5 requested a review from nickfraser October 1, 2024 11:27

Giuseppe5 added the next release PRs which should be merged for the next release label Oct 2, 2024

nickfraser approved these changes Oct 7, 2024

View reviewed changes

Giuseppe5 added 2 commits October 8, 2024 13:51

Fix groupwise param_from_stats

7c35004

README

84dae4e

Giuseppe5 merged commit db6c560 into Xilinx:dev Oct 8, 2024
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix (llm): small fixes to LLM #1035

Fix (llm): small fixes to LLM #1035

Giuseppe5 commented Sep 27, 2024 •

edited

Loading

nickfraser left a comment

nickfraser Oct 7, 2024

nickfraser Oct 7, 2024

Giuseppe5 Oct 8, 2024

Fix (llm): small fixes to LLM #1035

Fix (llm): small fixes to LLM #1035

Conversation

Giuseppe5 commented Sep 27, 2024 • edited Loading

nickfraser left a comment

Choose a reason for hiding this comment

nickfraser Oct 7, 2024

Choose a reason for hiding this comment

nickfraser Oct 7, 2024

Choose a reason for hiding this comment

Giuseppe5 Oct 8, 2024

Choose a reason for hiding this comment

Giuseppe5 commented Sep 27, 2024 •

edited

Loading