-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Testing farm job canceled before configured timeout #209
Comments
This is very weird. @mcattamoredhat, could you please reproduce the issue with debug logging enabled? And I agree the current log message could be better. I'll try to extend it with more information. |
So, this is a limitation of GitHub-hosted runners. From GitHub doc:
Also, see this Discussion: https://github.com/orgs/community/discussions/25700#discussioncomment-3248791 |
We can check if the execution time is greater than the timeout input and only then cancel the TF request. |
Hi @jamacku, although I've changed to sclorg/testing-farm-as-github-action v3.1.0, I still have this issue in a few tests such as https://github.com/virt-s1/rhel-edge/actions/runs/10553424096 (iot-f39-x86) |
@mcattamoredhat, I may have missed something. I'll have a look. It should work without any additional configuration from your side. |
The problem might be that the Job run for 5h 59min 56s and then it was killed by runner. But we are expecting 6h. I'll adjust the value. |
Hi @jamacku, our CI has detected some PRs failing due to this issue, despite we already updated our workflows to use sclorg/[email protected] |
Hmm, there might still be some bug on our side. |
Type of issue
Bug Report
Description
We have seen in our downstream CI several testing-farm jobs canceled after 6h 0m . Although configured
timeout
default value is 480m in action inputs.The error log message doesn't not provide any details, just the message
Request was canceled on user request.
This is an example of the issue https://github.com/virt-s1/rhel-edge/actions/runs/9963311207/job/27529080681 edge-rhel-94-x86 job is using default timeout value of 480m
API request output is https://api.testing-farm.io/v0.1/requests/ee761663-f05f-43c2-84d9-673545b0f037
pipeline.log shows some tests failing:
Nevertheless guest pre/post installation logs don't have any failing playbook tasks.
May you please provide some help?
Reproducer
No response
The text was updated successfully, but these errors were encountered: