Feat (export): dequantize during export #1083

Giuseppe5 · 2024-10-31T17:25:18Z

Reason for this PR

Considering we cache quant metadata during export, we do not need to propagate QuantTensors during export.
This will come in handy for future features, including lazy dequantization which assumes a different paradigm in inference vs export.

Changes Made in this PR

Return a normal tensor from proxies during export.
Set all return_quant_tensor to False during tracing, and restore their states immediately after.

Testing Summary

NA

Risk Highlight

This PR includes code from another work (please detail).
This PR contains API-breaking changes.
This PR depends on work in another PR (please provide links/details).
This PR introduces new dependencies (please detail).
There are coverage gaps not covered by tests.
Documentation updates required in subsequent PR.

Checklist

Code comments added to any hard-to-understand areas, if applicable.
Changes generate no new warnings.
Updated any relevant tests, if applicable.
No conflicts with destination dev branch.
I reviewed my own code changes.
Initial CI/CD passing.
1+ reviews given, and any review issues addressed and approved.
Post-review full CI/CD passing.

Giuseppe5 added 5 commits October 31, 2024 17:22

Feat (export): dequantize during export

df8c780

runtime proxy

cba513c

fix

9bf71fe

fix name

ffef28d

change detect method

d561e66

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (export): dequantize during export #1083

Feat (export): dequantize during export #1083

Giuseppe5 commented Oct 31, 2024 •

edited

Loading

Feat (export): dequantize during export #1083

Are you sure you want to change the base?

Feat (export): dequantize during export #1083

Conversation

Giuseppe5 commented Oct 31, 2024 • edited Loading

Reason for this PR

Changes Made in this PR

Testing Summary

Risk Highlight

Checklist

Giuseppe5 commented Oct 31, 2024 •

edited

Loading