Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
digital-nomad-cheng authored Feb 27, 2023
1 parent fbcad34 commit 2bce4fe
Showing 1 changed file with 7 additions and 2 deletions.
9 changes: 7 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,8 +7,13 @@ that previously deployed using other framework. After several days trials and er

First, I compared the performance between the default schedule and the schedule generated by ANSOR.
You can just check `schedule/run.py` for exporting the two versions of runtime and `cpp` folder for
all the C++ inference benchmark details. In a nutshell, on my machine(), the default schedule runtime
is 27.56ms and the optimized schedule runtime is 23.97ms - almost 13% speedup.
all the C++ inference benchmark details. In a nutshell, on my machine - a 11th Gen Intel(R) Core(TM) i7-11850H @ 2.50GHz CPU,
the default schedule runtime for 1000 loops is **5324ms** and the optimized runtime for 1000 loops is **4853ms** - almost 9% speedup.

While the network forward time for 1000 loops using the [MNN](https://github.com/alibaba/MNN) inference engine is **4485ms** -
7.6% better than the optmized schedule. I would say the performance of TVM is amazing. It is on pair with high manually optimized
neural network engine while saveing AI engineers lots of time on operator tuning. How this speedup can be applied to other new hardware
still remains to be seen.

![result.jpg](https://github.com/digital-nomad-cheng/RetinaFace_TVM/blob/main/result.jpg)
## Main Contributions
Expand Down

0 comments on commit 2bce4fe

Please sign in to comment.