-
Notifications
You must be signed in to change notification settings - Fork 151
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
big performance difference on tensorRT #85
Comments
Also, I got this error while running the code above
|
Hi @HireezShanPeng, The big performance difference comes from RTX 3090 card used in the readme. M60 doesn't support 16bit precision. There is no performance advantage of running a 16-bit model on M60. logs on my 1080Ti, which also doesn't support 16-bit precision More information: |
Thank you @kamalkraj for your answer. Just to complete:
Regarding your bug I am not sure to understand when it happens. Can you please provide more context? |
Hi, I just tried the demo code below, in your result, the [TensorRT (FP16)] result is much better than others. However, the results I got are quite different. there is not such a big difference between [TensorRT (FP16)] and others (the output is attached). I wonder if you know what happened or how I can figure out the reason for that. Thank you.
The text was updated successfully, but these errors were encountered: