-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Performance] Resize operation shows huge performance drop on Windows #23430
Comments
Would you be comfortable testing perf in your environment if I shared changes on a branch with you? Regression as compared to Linux or to some previous version? |
@yuslepukhin Yes please, feel free to share these changes. Regression is compared to Linux on the same machine (tested both in WSL Ubuntu and DevContainer Debian). |
Most of the CPU time is taken by |
@yuslepukhin I don't get it. As you can see from the profile the operation taking an anomalous amount of time in Windows is "/resize/Round". The execution time of "/resize/Resize" is approximately the same (~0.250 ms) in the two OS. |
I am not seeing the same results as you do. This PR may make it better. The statement below appears to contradict the title of the Issue.
|
@yuslepukhin I changed the title of the issue since it could be misleading. What I meant is that the Thanks for providing the link to the PR, I will test it out. How eigen should make Windows and Linux performance comparable? |
Describe the issue
Hi!
We observed a significant performance regression in the
Resize
operation when usingCPUExecutionProvider
on Windows. On Linux the behavior is the one expected. In particular, when opening the profile trace we realized that it is theRound
node that shows the biggest performance drop between the two OS.The mean latency of the
Round
node over 50 runs after warmup is ~6ms on Windows and ~0.1ms on Linux.The same performance drop ratio appears disabling profiling and enabling all optimizations with
GraphOptimizationLevel.ORT_ENABLE_ALL
.Model graph:
Windows profile:
Linux profile:
To reproduce
Here is the code used to reproduce the issue:
Urgency
This slowdown severely impacts workloads that rely heavily on the Resize operator, particularly in image processing tasks.
Platform
Windows
OS Version
10
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
1.20.1
ONNX Runtime API
Python
Architecture
X64
Execution Provider
Default CPU
Execution Provider Library Version
No response
Model File
No response
Is this a quantized model?
No
The text was updated successfully, but these errors were encountered: