Prefer upper bounds when resolving/backtracking #13017

notatallshaw · 2024-10-14T05:41:20Z

Fixes: #12993
Fixes: #12990
Fixes: #12430
Fixes: #13030

This PR is built on top of #12982 so that the unit tests can be expanded, either that PR can be reviewed first, or this PR can supplant that PR.

I have developed some benchmark scripts to ensure that changes to pip's resolution algorithm don't regress common real world requirements: https://github.com/notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks.

I plan to keep building out more scenarios, you can see the current ones so far here: https://github.com/notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks/tree/main/scenarios

Upon testing this PR compared to pip 24.2 I see one small regressions and two big improvements:

Difference for scenario scenarios/problematic.toml - autogluon:
    	Success: False -> True.
    	Failure Reason: Build Failure -> None.

Difference for scenario scenarios/problematic.toml - boto3-urllib3-transient:
    	Number of packages processed: 869 -> 871

Difference for scenario scenarios/big-packages.toml - apache-airflow-all:
    	Number of requirements processed: 593 -> 592
    	Number of packages processed: 681 -> 661

The fact that autogluon can resolve is a big improvement, apache-airflow[all] gets a noticeable improvement in how many packages it has to process (and this has real time improvement, as the number of packages processed can have O(n^2) complexity) , and a scenario involving boto3 and urllib3 as transient requirements gets a small regression in having to process 2 more packages.

I am hoping to find more real world scenarios where this has a noticeable difference, but I think these results are sufficient to show this approach is a net positive.

notatallshaw · 2024-10-14T06:12:14Z

Very tentatively adding this to the 24.3 milestone on the basis of:

If a maintainer with resolver experience can look at Simplify, fix, and add unit tests for PipProvider.get_preference #12982 then this PR only adds a small amount of functional code on top: 70f4d92
This expands the unit tests in that PR to the functional code in this PR
This is backed up as not regressing against a number of scenarios
It has a real world issue it fixes

But I understand if no maintainer is available to review.

notatallshaw · 2024-10-15T00:53:51Z

Added more problematic scenarios in: https://github.com/notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks/blob/main/scenarios/problematic.toml

And found this also fixes #12430 (which was merged into another issue, but the specific resolution the user had is now solved by this).

potiuk · 2024-10-15T01:32:56Z

I do not know pip resiolution internals - but the rules explained make sense and might improve a number of cases indeed.

notatallshaw · 2024-10-18T14:34:09Z

I took a look to see whether it made any difference to put upper bound preference above or below backtracking cause preference, and at least in the scenarios I currently have in https://github.com/notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks/blob/main/scenarios it didn't make any significant difference (there was a very slight regression of apache-airflow-beam putting it below, as it visited 1 extra package).

So I consider this good in its current position, and if I find a scenario in the future, or a user reports one, where it does make a significant difference, then it can be changed.

notatallshaw · 2024-10-20T15:46:41Z

Found a minor improvement, in acryl-datahub[all] which has over 300 total dependencies, it visited 1 less requirement, 6 less packages, and produced a slightly better solution: notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks#2 (comment)

notatallshaw added 7 commits October 13, 2024 21:52

Simplify, fix, and add unit tests for PipProvider.get_preference

75ca682

Linting fix

cfeb2c9

Simplify test setup

15d3833

Prefer upper bounded requirements

70f4d92

Update tests for get_preference

2f0a369

Update docs

f1b86b2

NEWS ENTRY

2fe0b77

psf-chronographer bot added the bot:chronographer:provided label Oct 14, 2024

notatallshaw added this to the 24.3 milestone Oct 14, 2024

notatallshaw mentioned this pull request Oct 14, 2024

Request: New Release sarugaku/resolvelib#159

Closed

notatallshaw mentioned this pull request Oct 18, 2024

Vendor resolvelib 1.1.0 #13001

Draft

notatallshaw mentioned this pull request Oct 18, 2024

Bug in pip's pinned preference on packages that have a requirement ==N.* #13030

Open

1 task

Add test for "==1.*"

a8926df

notatallshaw mentioned this pull request Oct 20, 2024

add acryl-datahub[all] to big packages list notatallshaw/Pip-Resolution-Scenarios-and-Benchmarks#2

Merged

notatallshaw added 2 commits October 21, 2024 20:16

Update docstring for get_preference

35ed6c9

Update docs for resolution

7cb360b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prefer upper bounds when resolving/backtracking #13017

Prefer upper bounds when resolving/backtracking #13017

notatallshaw commented Oct 14, 2024 •

edited

Loading

notatallshaw commented Oct 14, 2024 •

edited

Loading

notatallshaw commented Oct 15, 2024 •

edited

Loading

potiuk commented Oct 15, 2024

notatallshaw commented Oct 18, 2024 •

edited

Loading

notatallshaw commented Oct 20, 2024

Prefer upper bounds when resolving/backtracking #13017

Are you sure you want to change the base?

Prefer upper bounds when resolving/backtracking #13017

Conversation

notatallshaw commented Oct 14, 2024 • edited Loading

notatallshaw commented Oct 14, 2024 • edited Loading

notatallshaw commented Oct 15, 2024 • edited Loading

potiuk commented Oct 15, 2024

notatallshaw commented Oct 18, 2024 • edited Loading

notatallshaw commented Oct 20, 2024

notatallshaw commented Oct 14, 2024 •

edited

Loading

notatallshaw commented Oct 14, 2024 •

edited

Loading

notatallshaw commented Oct 15, 2024 •

edited

Loading

notatallshaw commented Oct 18, 2024 •

edited

Loading