[ENH] sklearn 1.6.dev0 adjustments. #335

goraj · 2024-11-11T18:25:01Z

Hi all,

New to the project but I've been loosely following @adam2392 and the project for a while now.
I setup a dev environment according to DEVELOPMENT.md and ran into a few issues due to sklearn 1.6.dev0 being installed. Namely the introduction of check_sample_weight_equivalence in scikit-learn/scikit-learn@364cafe leads to expected but not skipped test-case failures.

What does this implement/fix? Explain your changes.

Changes will skip check_sample_weight_equivalence testing for forest implementations. It also addresses some un-pickling issues encountered during testing due to joblib/loky in the structure of treeple.stats.utils.
Changes should be backward compatible.

for more information, see https://pre-commit.ci

codecov · 2024-11-12T19:18:50Z

Codecov Report

Attention: Patch coverage is 80.00000% with 2 lines in your changes missing coverage. Please review.

Project coverage is 80.33%. Comparing base (e1c38ad) to head (f0f2a9e).

Files with missing lines	Patch %	Lines
treeple/ensemble/_honest_forest.py	50.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #335      +/-   ##
==========================================
- Coverage   80.50%   80.33%   -0.18%     
==========================================
  Files          24       24              
  Lines        2334     2339       +5     
  Branches      339      339              
==========================================
  Hits         1879     1879              
- Misses        318      322       +4     
- Partials      137      138       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

goraj · 2024-12-09T14:27:03Z

Nothing?

adam2392

Sorry for the delay @goraj and thanks for the PR!

I left a review. So I was thinking, should we just revamp how we're using the parametrize_with_checks function. Instead of labeling sample_weight_equivalence as an ignored test per area, perhaps let's introduce a test_common.py function under treeple/tests/, and then we can do the "sklearn compatible" check there. Then, we can consolidate what tests we want to ignore in the same file.

Kind of like here:

https://github.com/scikit-learn/scikit-learn/blob/66270e46b77d6202559bae4929ec83ab320beb1e/sklearn/utils/_test_common/instance_generator.py#L771

and how expected_failed_checks kwarg of parametrize_with_checks is used to ignore tests inside https://github.com/scikit-learn/scikit-learn/blob/66270e46b77d6202559bae4929ec83ab320beb1e/sklearn/tests/test_common.py#L118

WDYT?

adam2392 · 2024-12-09T14:32:52Z

treeple/stats/utils.py

+    with parallel_config("multiprocessing"):
+        out = Parallel(n_jobs=n_jobs)(
+            delayed(_parallel_build_null_forests)(
+                y_pred_ind_arr,
+                n_estimators,
+                all_y_pred,
+                y_test,
+                seed,
+                metric,
+                **metric_kwargs,
+            )
+            for i, seed in zip(range(n_repeats), ss.spawn(n_repeats))


Why was this change made?

If I remember correctly the default loky would segfault during unit testing the *Oblique trees.

adam2392 · 2024-12-09T14:33:01Z

treeple/stats/utils.py

+    with parallel_config("multiprocessing"):
+        out = Parallel(n_jobs=n_jobs)(


Same as above.

goraj · 2024-12-17T14:58:54Z

Sorry for the delay @goraj and thanks for the PR!

I left a review. So I was thinking, should we just revamp how we're using the parametrize_with_checks function. Instead of labeling sample_weight_equivalence as an ignored test per area, perhaps let's introduce a test_common.py function under treeple/tests/, and then we can do the "sklearn compatible" check there. Then, we can consolidate what tests we want to ignore in the same file.

Kind of like here:

https://github.com/scikit-learn/scikit-learn/blob/66270e46b77d6202559bae4929ec83ab320beb1e/sklearn/utils/_test_common/instance_generator.py#L771

and how expected_failed_checks kwarg of parametrize_with_checks is used to ignore tests inside https://github.com/scikit-learn/scikit-learn/blob/66270e46b77d6202559bae4929ec83ab320beb1e/sklearn/tests/test_common.py#L118

WDYT?

Thank you.
I agree that would help quite a bit. I will update the PR accordingly, just a bit busy right now.

Jacob Gora added 2 commits November 11, 2024 18:56

sklearn 1.6.dev0 adjustments.

01e55d4

Adds sklearn<1.6 compatibility.

9686597

goraj changed the title ~~[FIX] sklearn 1.6.dev0 adjustments.~~ [WIP] sklearn 1.6.dev0 adjustments. Nov 12, 2024

Jacob Gora and others added 5 commits November 12, 2024 09:25

Additional tests covered.

0fef8d7

[pre-commit.ci] auto fixes from pre-commit.com hooks

29a34f4

for more information, see https://pre-commit.ci

Fixes OSX unpickling issue with loky/joblib.

732ca4b

Merge remote-tracking branch 'goraj/sklearn_1.6dev' into sklearn_1.6dev

8ce748a

[pre-commit.ci] auto fixes from pre-commit.com hooks

f0f2a9e

for more information, see https://pre-commit.ci

goraj changed the title ~~[WIP] sklearn 1.6.dev0 adjustments.~~ [ENH] sklearn 1.6.dev0 adjustments. Nov 12, 2024

adam2392 requested changes Dec 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] sklearn 1.6.dev0 adjustments. #335

[ENH] sklearn 1.6.dev0 adjustments. #335

goraj commented Nov 11, 2024 •

edited

Loading

codecov bot commented Nov 12, 2024

goraj commented Dec 9, 2024

adam2392 left a comment

adam2392 Dec 9, 2024

goraj Dec 17, 2024

adam2392 Dec 9, 2024

goraj Dec 17, 2024

goraj commented Dec 17, 2024 •

edited

Loading

		with parallel_config("multiprocessing"):
		out = Parallel(n_jobs=n_jobs)(

[ENH] sklearn 1.6.dev0 adjustments. #335

Are you sure you want to change the base?

[ENH] sklearn 1.6.dev0 adjustments. #335

Conversation

goraj commented Nov 11, 2024 • edited Loading

What does this implement/fix? Explain your changes.

codecov bot commented Nov 12, 2024

Codecov Report

goraj commented Dec 9, 2024

adam2392 left a comment

Choose a reason for hiding this comment

adam2392 Dec 9, 2024

Choose a reason for hiding this comment

goraj Dec 17, 2024

Choose a reason for hiding this comment

adam2392 Dec 9, 2024

Choose a reason for hiding this comment

goraj Dec 17, 2024

Choose a reason for hiding this comment

goraj commented Dec 17, 2024 • edited Loading

goraj commented Nov 11, 2024 •

edited

Loading

goraj commented Dec 17, 2024 •

edited

Loading