You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
During the internal call to our export functions for torch/onnx, we have multiple forward passes that are executed to perform caching of quant metadata.
Describe the solution you'd like
We have flags that use to enable/disable caching. The idea would be to always enable caching (in eval mode), and remove the need of extra forward passes.
Check if this has any meaningful impact on execution time.
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe.
During the internal call to our export functions for torch/onnx, we have multiple forward passes that are executed to perform caching of quant metadata.
Describe the solution you'd like
We have flags that use to enable/disable caching. The idea would be to always enable caching (in eval mode), and remove the need of extra forward passes.
Check if this has any meaningful impact on execution time.
The text was updated successfully, but these errors were encountered: