Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Don't require recommissioning the DUT for every test #456

Open
cecille opened this issue Oct 3, 2024 · 5 comments
Open

[Feature] Don't require recommissioning the DUT for every test #456

cecille opened this issue Oct 3, 2024 · 5 comments

Comments

@cecille
Copy link

cecille commented Oct 3, 2024

Feature description

With the time required to factory reset the device, delay starting the commissioning, delay starting the test, and re-running the failed test, this add hours to the testing process even for small devices.

Use Cases

Commission device once, run all tests.

Test Cases

just run through multiple tests in a row

Additional Information

No response

@fabiowmm
Copy link
Contributor

fabiowmm commented Oct 14, 2024

@cecille The current logic in TH for running python tests is as follows:

  • Python Testing Suite: these are test scripts for which the script author wants TH to perform commissioning before launching the first script in the run (and only that)
  • Python Testing Suite - No commissioning: These are the ones for which TH will not try to commission (if commissioning is needed it will either be done manually by the user or done by the script itself).
  • Python Testing Suite - Old script format: For these ones TH will ask the user, before launching EVERY script in the run, whether it should try to commission the device before launching the script.

The inclusion of test scripts in each category above is determined by the value of the is_commissioning variable defined in the script code (True means type 1, False means type 2 and undefined/none means type 3).

So, except for the Old script format category, TH does not try to commission before each test.

@cecille
Copy link
Author

cecille commented Oct 29, 2024

This is a slightly different problem - I think I mean re-commissioning at each run. The TH often fails to start tests, resulting in test runs with partial failures. To re-run these tests, you need to create a new test run with only the failing tests etc. until all the tests are run. The back and forth of creating new test runs with only the failing tests is challenging, as is identifying which run holds the pass result for a given test. As a result the ATLs seem to have developed some workarounds. One such workaround is their recommendation that if you're going to use the UI that test be run one at a time, one per run, so you can just re-run the entire run and see the passing result.

This makes it easier to determine when the correct set of tests has been run, but it also means that the device needs to be recommissioned before every time. The first thing the TH seems to do before each test run is to ask you to put the device in commissioning mode.

Storing the admin kvs between runs would allow folks to run multiple test runs without re-commissioning.

@rquidute
Copy link
Contributor

rquidute commented Dec 27, 2024

@cecille , we are planning implement this feature following this way:
In the 1st test execution run, the commissioning information will get store in the Host.
From the 2nd execution on, a prompt will be displayed to the user asking if he wants to re-use the previous commissioning information: If the user chooses YES, the test will start with no need of commissioning, in case of selecting NO, the commissioning will be performed same way as it is now.

This is the new prompt to be displayed to the user:
Screenshot 2024-12-27 at 12 54 30

@cecille
Copy link
Author

cecille commented Jan 6, 2025

seem reasonable to me.

@rquidute
Copy link
Contributor

PR Merged. Waiting for feedback using real devices.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants