[SW-207299] Recalc scales from user #774

linoybu · 2025-02-03T13:58:14Z

SW-207299
mul scale input in factor = 448/240

…omUser

linoybu · 2025-02-03T14:03:16Z

results after fix: strict-match 0.4860
results before fix: strict-match 0.4716

dudilester · 2025-02-03T15:05:25Z

...model_executor/layers/quantization/compressed_tensors/schemes/compressed_tensors_w8a8_fp8.py

@@ -7,7 +7,7 @@
 from vllm.model_executor.layers.quantization.compressed_tensors.schemes import (
    CompressedTensorsScheme)
 from vllm.model_executor.layers.quantization.utils.w8a8_utils import (
-    apply_fp8_linear, cutlass_fp8_supported, normalize_e4m3fn_to_e4m3fnuz,


can we move the logic to vllm_hpu_extension.ops repo? and have minimal logic in this general vllm file?

vllm/model_executor/layers/quantization/utils/w8a8_utils.py

linoybu added 2 commits February 3, 2025 15:35

add factor to input

34fa3fa

Merge remote-tracking branch 'origin/habana_main' into RecalcScalesFr…

4f231a4

…omUser

linoybu requested review from kzawora-intel, madamczykhabana, michalkuligowski, mgawarkiewicz, vivekgoe and afierka-intel as code owners February 3, 2025 13:58

remove import

a2357af

linoybu requested review from nirda7, kiazada and ulivne February 3, 2025 14:00

cr

4b2fd74

Yantom1 approved these changes Feb 3, 2025

View reviewed changes

linoybu added 2 commits February 3, 2025 17:01

cr

49d69ff

cr

69b73b6

dudilester reviewed Feb 4, 2025

View reviewed changes

change names

614c5ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SW-207299] Recalc scales from user #774

[SW-207299] Recalc scales from user #774

linoybu commented Feb 3, 2025 •

edited by github-actions bot

Loading

linoybu commented Feb 3, 2025

dudilester Feb 3, 2025

[SW-207299] Recalc scales from user #774

Are you sure you want to change the base?

[SW-207299] Recalc scales from user #774

Conversation

linoybu commented Feb 3, 2025 • edited by github-actions bot Loading

linoybu commented Feb 3, 2025

dudilester Feb 3, 2025

Choose a reason for hiding this comment

linoybu commented Feb 3, 2025 •

edited by github-actions bot

Loading