Fix long run time and reintroduce timing benchmark test #258

brynpickering · 2023-09-25T15:29:05Z

Fixes long memory profiling CI runtimes.

It seems that the timeout that I "fixed" by removing in #254 was actually trying to tell us something. A change in how pandas groupby works means that the current implementation on reading data became unbelievably slow. To be honest, I'm surprised that pivoting a dataframe into a nested dict was applied for "performance" reasons to begin with...

Anyway, I've got rid of the df->dict conversion. Locally, this not only makes it possible to run the timing/memory benchmark (CI was taking hours for that test!) but actually halves the time it takes to run! Will be interesting to see if this leads to real speed-ups in PAM in more practical applications.

EDIT: this update leads to slower runtimes than the previous method after @Theodore-Chatziioannou tested it with a real dataset (10-20% slower). We should fix this and the profiling dataset so that it is catching runtimes that align better with real PAM usecases.

UPDATE: memory requirements also half when making this change. The benchmark is now ~1.4GB instead of ~2.8GB.

Checklist

Any checks which are not relevant to the PR can be pre-checked by the PR creator.
All others should be checked by the reviewer(s).
You can add extra checklist items here if required by the PR.

CHANGELOG updated
Tests added to cover contribution
Documentation updated

memray has a time overhead that is difficult to gauge on CI runners

codecov-commenter · 2023-09-26T08:54:23Z

Codecov Report

Merging #258 (231274e) into main (07813c8) will increase coverage by 0.69%.
Report is 135 commits behind head on main.
The diff coverage is 86.59%.

@@            Coverage Diff             @@
##             main     #258      +/-   ##
==========================================
+ Coverage   86.84%   87.53%   +0.69%     
==========================================
  Files          49       48       -1     
  Lines        5496     5746     +250     
  Branches     1372     1436      +64     
==========================================
+ Hits         4773     5030     +257     
+ Misses        462      446      -16     
- Partials      261      270       +9

Files	Coverage Δ
pam/__init__.py	`100.00% <100.00%> (ø)`
pam/planner/od.py	`97.33% <ø> (-0.04%)`	⬇️
pam/read/__init__.py	`100.00% <ø> (ø)`
pam/report/summary.py	`93.12% <100.00%> (ø)`
pam/samplers/tour.py	`98.00% <100.00%> (+2.25%)`	⬆️
pam/scoring.py	`89.94% <100.00%> (+0.30%)`	⬆️
pam/variables.py	`100.00% <100.00%> (ø)`
pam/write/__init__.py	`100.00% <ø> (ø)`
pam/activity.py	`90.90% <50.00%> (+0.14%)`	⬆️
pam/planner/ipf.py	`98.78% <98.78%> (ø)`
... and 10 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

Locally it takes ~75 seconds. On a CI runner it seems to take just over 200 seconds. To be safe, we go for 250 seconds.

fredshone · 2023-09-26T10:15:42Z

I think Theo added this df -> dict step as a speed up N years ago. Curious to know what has happened to groupby to spoil this.

Theodore-Chatziioannou · 2023-09-26T10:22:28Z

yes, let me check with a large population... Our finding back then was the opposite: that .loc was extremely slow, and using a dictionary instead gave us a performance improvement.. But this may not be the case anymore..

brynpickering · 2023-09-26T10:35:04Z

.loc being slow seems to not be the case as of pandas v1.5.0 (see pandas-dev/pandas#23735). So it may well have been true for an old version of PAM (although the memory overhead introduced by df -> dict probably existed either way). Anyway, with new pandas the pendulum has swung the other way, so seems worth taking advantage of the benefits.

brynpickering · 2023-09-26T10:39:26Z

RE why groupby is now so slow, it could be a regression introduced by the latest version of pandas (e.g., pandas-dev/pandas#52070), but I kinda doubt it since we're not actually operating on the data at all, just adding grouped arrays to a dictionary...

brynpickering · 2023-09-26T11:21:00Z

Looking at where it gets stuck on df -> dict with pandas v2.1.1, this seems to be the associated issue: pandas-dev/pandas#55256

Theodore-Chatziioannou

tested against a large pop: loading was slightly slower to before, but not a large difference. Suggest merging and revising the indexing approach in the future.

brynpickering added 2 commits September 25, 2023 16:06

Remove hh_person_df_to_dict

8174b89

Assign seq when not present

f45da39

brynpickering requested a review from fredshone September 25, 2023 15:29

brynpickering mentioned this pull request Sep 25, 2023

v0.3.0 release #257

Merged

11 tasks

Try with more lenient timeout benchmark.

1830e45

memray has a time overhead that is difficult to gauge on CI runners

brynpickering requested a review from Theodore-Chatziioannou September 26, 2023 08:25

Split out time and memory profiling

6e009e1

brynpickering added 3 commits September 26, 2023 10:04

Fix fixture decorator

6d2c670

Loosen timing benchmark

b4866b8

Further loosen timing benchmark

61ef095

Locally it takes ~75 seconds. On a CI runner it seems to take just over 200 seconds. To be safe, we go for 250 seconds.

Theodore-Chatziioannou approved these changes Sep 27, 2023

View reviewed changes

Update changelog

231274e

brynpickering merged commit 73b9d92 into main Sep 27, 2023
12 checks passed

brynpickering mentioned this pull request Sep 28, 2023

Airspeed Velocity benchmarking #260

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix long run time and reintroduce timing benchmark test #258

Fix long run time and reintroduce timing benchmark test #258

brynpickering commented Sep 25, 2023 •

edited

Loading

codecov-commenter commented Sep 26, 2023 •

edited

Loading

fredshone commented Sep 26, 2023

Theodore-Chatziioannou commented Sep 26, 2023

brynpickering commented Sep 26, 2023

brynpickering commented Sep 26, 2023

brynpickering commented Sep 26, 2023

Theodore-Chatziioannou left a comment

Fix long run time and reintroduce timing benchmark test #258

Fix long run time and reintroduce timing benchmark test #258

Conversation

brynpickering commented Sep 25, 2023 • edited Loading

Checklist

codecov-commenter commented Sep 26, 2023 • edited Loading

Codecov Report

fredshone commented Sep 26, 2023

Theodore-Chatziioannou commented Sep 26, 2023

brynpickering commented Sep 26, 2023

brynpickering commented Sep 26, 2023

brynpickering commented Sep 26, 2023

Theodore-Chatziioannou left a comment

Choose a reason for hiding this comment

brynpickering commented Sep 25, 2023 •

edited

Loading

codecov-commenter commented Sep 26, 2023 •

edited

Loading