Upgrade typeguard, and add test that would have caught typeguard-related regression #2241

mmacpherson · 2025-01-31T22:23:40Z

This refers to issue #2235.

The proximal issue is that you can't use a JSONType parameter via the Runner API with recent python versions.

That raises an error from the vendored-in typeguard library, which performs runtime type checks. That error arises in turn from updates to cpython, that made a previously-positional argument required to be specified as a keyword. Typeguard has since been patched to address this issue.

This PR aims to update the typeguard dependency, and to add a test that would have caught the issue when it first arose.

Adding the test is the more challenging problem. The current test runner invokes the API tests like so:

top_level_dict, run_level_dict = construct_arg_dicts_from_click_api()
runner = Runner(
                "test_flow.py", show_output=False, env=env, **top_level_dict
            )
result = runner.run(**run_level_dict)

To test an additional parameter via the path that has failed here, one would need somehow to get it into the run_level_dict.

I'm proposing here to augment the run_level_dict parameters via a RUNTIME_PARAMETERS environment variable that may be set in a test.

Running via:

 PYTHONPATH=`pwd`/../../ python run_tests.py --debug --contexts dev-local --graphs single-linear-step --tests BasicParameterTest

After setting RUNTIME_PARAMETERS now yields a test failure; the same "recursive_guard" issue under discussion.

With that in place, I patched typeguard, and that resolves the test failure.

I also ran PYTHONPATH=pwd/../../ python run_tests.py --debug --contexts dev-local --tests BasicParameterTest, and those passed as well. I might be wrong, but I'm expecting that the GH Actions would be relied upon to test other configurations.

They may well be a cleaner way to achieve test coverage of this code path, and I'm happy to modify/iterate if so. Hopefully this at least helps identify the key issues.

…horoughly

mmacpherson · 2025-02-01T18:36:22Z

Refactored this, adding a RuntimeParameters test aot modifying BasicParameters, and testing more parameter types there.

One weakness here is that these tests would all pass if the runtime parameters weren't applied. I thought a little about how to prove that they are being applied. For example, if you could detect from the test context whether we were running via the cli executor vs the api executor, then you could set a parameter differently in PARAMETERS vs RUNTIME_PARAMETERS and test that you've gotten the expected one. But I didn't find a good solution.

romain-intel · 2025-02-01T19:21:36Z

Thanks for the contribution!

I haven't looked in detail yet but a few high level comments:

to update any vendored software (including typeguard), please update using the vendor.py script included (and change the file here: https://github.com/Netflix/metaflow/blob/master/metaflow/_vendor/vendor_any.txt to point to the version you are vendoring)
for injecting the parameter into run_level_dict; I think a better approach may be to try to do json.loads on the parameters coming in and passing them as python object. Currently we do test for JSONType (ish) but they are all passed in as strings (iirc).

mmacpherson · 2025-02-01T20:15:28Z

Thanks kindly @romain-intel.

Pushed a couple commits:

Uses the standard vendor path to upgrade typeguard (and typing_extensions, on which it depends).
Undid the RUNTIME_PARAMETERS path, and tried what I think you meant.
The way that I've done it only applies to JSONType parameters atm. If you want to test that other parameter types also work as intended when passed in to Runner.run() as python objects, we might refactor? Perhaps there's already a metaflow function that would unpack the default (string) values as encoded in MetaflowTest.PARAMETERS into the corresponding python type, that could be used?

mmacpherson added 3 commits January 31, 2025 13:10

add mechanism to test runtime arguments

faaa17e

bump typeguard dependency

c72c320

would only be used in API context, so drop cli/api switch

ed0efe2

mmacpherson force-pushed the upgrade-typeguard branch from 97c5c14 to ed0efe2 Compare January 31, 2025 22:56

refactor: leave basic params test be, and check runtime params more t…

863e9b1

…horoughly

mmacpherson force-pushed the upgrade-typeguard branch from 8284d20 to 863e9b1 Compare February 1, 2025 18:27

spelling tweak

52528b7

update using vendor.py mechanism

57bf1fd

mmacpherson force-pushed the upgrade-typeguard branch from 3f82469 to 57bf1fd Compare February 1, 2025 19:46

respond to review comments

ffe4fdd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade typeguard, and add test that would have caught typeguard-related regression #2241

Upgrade typeguard, and add test that would have caught typeguard-related regression #2241

mmacpherson commented Jan 31, 2025 •

edited

Loading

mmacpherson commented Feb 1, 2025

romain-intel commented Feb 1, 2025

mmacpherson commented Feb 1, 2025 •

edited

Loading

Upgrade typeguard, and add test that would have caught typeguard-related regression #2241

Are you sure you want to change the base?

Upgrade typeguard, and add test that would have caught typeguard-related regression #2241

Conversation

mmacpherson commented Jan 31, 2025 • edited Loading

mmacpherson commented Feb 1, 2025

romain-intel commented Feb 1, 2025

mmacpherson commented Feb 1, 2025 • edited Loading

mmacpherson commented Jan 31, 2025 •

edited

Loading

mmacpherson commented Feb 1, 2025 •

edited

Loading