Always Cache QuantMetadata #1053

Giuseppe5 · 2024-10-14T09:21:04Z

Is your feature request related to a problem? Please describe.
During the internal call to our export functions for torch/onnx, we have multiple forward passes that are executed to perform caching of quant metadata.

Describe the solution you'd like

We have flags that use to enable/disable caching. The idea would be to always enable caching (in eval mode), and remove the need of extra forward passes.
Check if this has any meaningful impact on execution time.

Giuseppe5 added enhancement New feature or request good first issue Good for newcomers labels Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Always Cache QuantMetadata #1053

Always Cache QuantMetadata #1053

Giuseppe5 commented Oct 14, 2024

Always Cache QuantMetadata #1053

Always Cache QuantMetadata #1053

Comments

Giuseppe5 commented Oct 14, 2024