-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
'Speed' measurement possibly misleading for image classification #15
Comments
In my opinion this is a serious enough bug (if I am correct in my assumptions) to warrant putting a note on the leaderboard saying that the speed values should not be trusted until this is resolved. |
and to add one more point to this: The intention of the 'speed' column is to indicate the intended throughput, but even when measured by factoring out the disk read speeds, throughput has a non trivial dependence on batch size. |
Thank you for, as always, very thorough and thoughtful comments Dushyant! To address your individual points:
To add a few more caveats here:
|
Hi Robert Thanks for the detailed response. I can work on a patch to also report the loading + inference breakdown, and perhaps code to try different batch sizes and report the maximum speed obtained. Also, it may be helpful to list the caveats you mention here, on the website as well, along with information of what GPU the inference would run on. Best, |
If I am correct in assuming that the time calculated for evaluating the whole dataset (https://github.com/paperswithcode/torchbench/blob/master/torchbench/image_classification/utils.py#L75) is used for calculating the speed on the leaderboard, then I would like to point out several issues with this
I believe that the speed measurement should be done on a chunk of preloaded dummy data, with a note on the leaderboard saying that the actual speeds people can get in practice would depend on the rate at which the model can be fed.
The text was updated successfully, but these errors were encountered: