optimize(Displayserver): performance up #636

apocelipes · 2023-11-28T03:19:51Z

Replace stdio with the posix api, and drop fscanf.

It shows a huge performance up in my local benchmarks.

About 35~40% faster per modefile than the old one.

Here is the highlights of benchmarks:

Running ./bench_linux_amd64
Run on (8 X 1800.01 MHz CPU s)
CPU Caches:
  L1 Data 32 KiB (x4)
  L1 Instruction 32 KiB (x4)
  L2 Unified 256 KiB (x4)
  L3 Unified 6144 KiB (x1)
Load Average: 0.12, 0.19, 0.16
-------------------------------------------------------
Benchmark             Time             CPU   Iterations
-------------------------------------------------------
BenchStdIO         1712 ns         1712 ns       341076
BenchPOSIXIO        998 ns          998 ns       655856

updates #634

Replace stdio with the posix api, and drop fscanf. It shows a huge performance up in my local benchmarks. About 35~40% faster per modefile than the old one. Here is the highlights of benchmarks: ```text Running ./bench_linux_amd64 Run on (8 X 1800.01 MHz CPU s) CPU Caches: L1 Data 32 KiB (x4) L1 Instruction 32 KiB (x4) L2 Unified 256 KiB (x4) L3 Unified 6144 KiB (x1) Load Average: 0.12, 0.19, 0.16 ------------------------------------------------------- Benchmark Time CPU Iterations ------------------------------------------------------- BenchStdIO 1712 ns 1712 ns 341076 BenchPOSIXIO 998 ns 998 ns 655856 ``` updates #634

CarterLi · 2023-11-28T05:59:53Z

Do you really think this PR can address any performance issues?

You are right that parsing string manually is always faster than fscanf, but why fscanf exists?

Come on! 1ms = 1000000ns. Several nanoseconds are not bottleneck, because it's executed only once. In addition, as for #634, this code is NOT executed unless --ds-force-drm is used.

Since @flipflop133 was using X11, the real bottleneck was the xrandr stuff. If you really want to improve fastfetch, please check that.

abbycin · 2023-11-28T06:03:58Z

这个pr真是闲的蛋疼

CarterLi · 2023-11-28T06:15:21Z

Don't get me wrong, I like performance improvements, and I'm accepting changes to improve performance at the cost of code complexity, but ONLY in common code.

Closing.

apocelipes · 2023-11-28T06:19:48Z

Do you really think this PR can address any performance issues?

You are right that parsing string manually is always faster than fscanf, but why fscanf exists?

Come on! 1ms = 1000000ns. Several nanoseconds are not bottleneck, because it's executed only once. In addition, as for #634, this code is NOT executed unless --ds-force-drm is used.

Since @flipflop133 was using X11, the real bottleneck was the xrandr stuff. If you really want to improve fastfetch, please check that.

Why we use fscanf? Because of it's a much simpler way to handle a COMPLEX parsing. The tradeoff is performance.

Since things like "1024x768" is simple and intuitive, there is no readability advantage to using fscanf, and it even costs twice as much in performance.

CarterLi · 2023-11-28T06:48:54Z

5x more lines of code and one more magic number are complexity.

it even costs twice as much in performance

1712ns -> 998ns can be called twice.

500 * 1000 * 1000ns + 1712ns -> 500 * 1000 * 1000ns + 998ns can't.

CarterLi closed this Nov 28, 2023

apocelipes deleted the feat-optimize-displayserver branch November 30, 2023 04:57

CarterLi mentioned this pull request Nov 11, 2024

IO: fix an off-by-one bug #1384

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize(Displayserver): performance up #636

optimize(Displayserver): performance up #636

apocelipes commented Nov 28, 2023

CarterLi commented Nov 28, 2023

abbycin commented Nov 28, 2023 •

edited

Loading

CarterLi commented Nov 28, 2023

apocelipes commented Nov 28, 2023

CarterLi commented Nov 28, 2023

optimize(Displayserver): performance up #636

optimize(Displayserver): performance up #636

Conversation

apocelipes commented Nov 28, 2023

CarterLi commented Nov 28, 2023

abbycin commented Nov 28, 2023 • edited Loading

CarterLi commented Nov 28, 2023

apocelipes commented Nov 28, 2023

CarterLi commented Nov 28, 2023

abbycin commented Nov 28, 2023 •

edited

Loading