Add support for profiling benchmarks #134

mdboom · 2022-06-03T16:50:00Z

This should make it much easier to collect profiles for benchmarks in the pyperformance suite.

Implements #133.

TODO:

Tests

vstinner · 2022-06-06T08:10:44Z

Tests don't pass.

How does merge_profile_stats() work? Does it compute the average of N processes timings? Or does it accumulate time?

Is it more reliable than running a single process?

mdboom · 2022-06-06T13:14:28Z

Tests don't pass.

Noted. Wanted to get some feedback about the general concept here first.

How does merge_profile_stats() work? Does it compute the average of N processes timings? Or does it accumulate time?

Is it more reliable than running a single process?

Yes, it just accumulates timings from multiple profiling runs. It produces more accurate results, in the same way that running the benchmarks multiple times does.

vstinner · 2022-06-16T13:57:58Z

pyperf/_process_time.py

    max_rss = 0
    range_it = range(loops)
    start_time = time.perf_counter()

+    if profile_filename:
+        with tempfile.NamedTemporaryFile(suffix=".profile", delete=False) as fh:
+            temp_profile_filename = fh.name


IMO tempfile.mktemp() would be better than using NamedTemporaryFile() here.

vstinner

Would it be possible to create the temporary file in pyperf/_manager.py and pass it to worker processes? A worker is more likely to crash, and so it's harder to make sure that the temporary file is removed in case of a crash.

Maybe the profiling data could be written into a pipe by the worker, and the manager would be responsible to merge data? I don't really if pyperf uses pipes or not on Windows.

mdboom · 2022-06-16T19:17:45Z

I'm not sure I understand the request.

The temporary file only comes into play when using bench_process. In that case, the temporary filename is generated in parent process and passed to the child process being benchmarked specifically to make it easier to clean up because it could crash. (The temporary file is deleted whether the worker process succeeds or fails).

For other kinds of benchmarks, there is no temporary file involved -- each child worker process merges their results directly into the output file. That has a separate problem in that there is a race condition if multiple workers update that file at the same time, but the whole design here is to not benchmark things in parallel, so that should be ok.

This certainly could use a pipe to communicate all the profiling data to the parent process -- all platforms already use that to communicate benchmark results from the worker processes. But it would add complexity to a bunch of places since the "protocol", which right now dumps things directly into the master benchmarking results, would have to split things out into separate files for the profiling results.

vstinner · 2022-06-17T13:39:29Z

cc @corona10

vstinner · 2022-06-24T09:11:52Z

cc @pablogsal

corona10 · 2022-07-06T02:03:10Z

I will left review by this weekend cc @vstinner

corona10

Please update the following documentation.

https://github.com/psf/pyperf/blob/6eb6de7427c5ac8939a21c0c68014be1167847de/doc/cli.rst#pyperf-timeit

pyperf/tests/test_runner.py

corona10 · 2022-07-10T13:13:16Z

I am going to release the new version of pyperf in 7days after this PR is merged.
cc @vstinner

Co-authored-by: Dong-hee Na <[email protected]>

mdboom · 2022-07-14T12:22:25Z

Please update the following documentation.

https://github.com/psf/pyperf/blob/6eb6de7427c5ac8939a21c0c68014be1167847de/doc/cli.rst#pyperf-timeit

@corona10: I already did that in this PR. Is there something specific missing there that you'd like to see?

corona10

@mdboom

@corona10: I already did that in this PR. Is there something specific missing there that you'd like to see?

Oops sorry, I may miss something at that time.
LGTM, I will release the next version by this weekend.

Add support for profiling benchmarks

b53733e

mdboom closed this Jun 3, 2022

mdboom reopened this Jun 3, 2022

mdboom marked this pull request as draft June 3, 2022 16:52

Fix unit tests

d566d52

mdboom marked this pull request as ready for review June 13, 2022 20:19

Add unit tests

a960e67

mdboom force-pushed the collect-profile branch from da24d5b to a960e67 Compare June 13, 2022 20:23

vstinner reviewed Jun 16, 2022

View reviewed changes

Use mktemp instead

e2890f9

Fix tests on < py3.9

14a6e19

mdboom requested a review from vstinner June 23, 2022 13:47

corona10 self-requested a review July 4, 2022 03:06

corona10 requested changes Jul 10, 2022

View reviewed changes

pyperf/tests/test_runner.py Outdated Show resolved Hide resolved

Update pyperf/tests/test_runner.py

f27c2e9

Co-authored-by: Dong-hee Na <[email protected]>

corona10 approved these changes Jul 14, 2022

View reviewed changes

corona10 merged commit f06d3bf into psf:main Jul 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for profiling benchmarks #134

Add support for profiling benchmarks #134

mdboom commented Jun 3, 2022 •

edited

Loading

vstinner commented Jun 6, 2022

mdboom commented Jun 6, 2022

vstinner Jun 16, 2022

vstinner left a comment

mdboom commented Jun 16, 2022 •

edited

Loading

vstinner commented Jun 17, 2022

vstinner commented Jun 24, 2022

corona10 commented Jul 6, 2022

corona10 left a comment

corona10 commented Jul 10, 2022

mdboom commented Jul 14, 2022

corona10 left a comment

Add support for profiling benchmarks #134

Add support for profiling benchmarks #134

Conversation

mdboom commented Jun 3, 2022 • edited Loading

vstinner commented Jun 6, 2022

mdboom commented Jun 6, 2022

vstinner Jun 16, 2022

Choose a reason for hiding this comment

vstinner left a comment

Choose a reason for hiding this comment

mdboom commented Jun 16, 2022 • edited Loading

vstinner commented Jun 17, 2022

vstinner commented Jun 24, 2022

corona10 commented Jul 6, 2022

corona10 left a comment

Choose a reason for hiding this comment

corona10 commented Jul 10, 2022

mdboom commented Jul 14, 2022

corona10 left a comment

Choose a reason for hiding this comment

mdboom commented Jun 3, 2022 •

edited

Loading

mdboom commented Jun 16, 2022 •

edited

Loading