Multi-threading #43
-
Hi! In the VVenC whitepaper it is mentioned that for the multi-threading tests up to 16 threads are used for FHD sequences. Thanks in advance, |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Dear Nicolas, Thanks for the inquiry. There are a few points to discuss. First, I think you have a typo in your description. There needs to be a blank space between the argument short name and value, ie. "-t N". Now to technical aspects. Which preset did you use? Because of different CTU size, the "faster" and "fast" presets scale much better than the 3 slower presets ("medium", "slow" and "slower"). We didn't test 720p so I don't know how the scaling is there, but 8 threads should be around 5-6x speedup for HD vs single threaded execution. Further, you have a 8 core/16 thread CPU. We usually test the scaling up to the number of physical cores. Its only very recent that we can even utilize that on modern CPUs, so we don't know how well the encoder scales across virtual cores. Still, we tested the impact of overallocating threads and found it to be small (e.g. 8 threads on process bound to 1 or 2 cores). What kind of speed reduction are you seeing - what does "slightly worse" mean? You have an Core processor. The Core processors decrease frequency when utilizing more threads. Combined with speedup saturation in VVenC that effect might play a minor role. This behavior is less present on Xeon or a comparable workstation/server processors. This playing a role in your problem is only speculation, though. If you want to try to increase the threading performance you can always set --WaveFrontSynchro=1 (around 0.5-1% bitrate increse) or --CTUSize=64/32 (3% and more bitrate increase, depending on preset), or both, in the full-featured app. The options should only be used when just increasing the number of threads doesn't improve performance. Best regards, |
Beta Was this translation helpful? Give feedback.
Dear Nicolas,
Thanks for the inquiry. There are a few points to discuss.
First, I think you have a typo in your description. There needs to be a blank space between the argument short name and value, ie. "-t N".
Now to technical aspects. Which preset did you use? Because of different CTU size, the "faster" and "fast" presets scale much better than the 3 slower presets ("medium", "slow" and "slower"). We didn't test 720p so I don't know how the scaling is there, but 8 threads should be around 5-6x speedup for HD vs single threaded execution.
Further, you have a 8 core/16 thread CPU. We usually test the scaling up to the number of physical cores. Its only very recent that we can even utiliz…