Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

long running (or stuck) SUT shutdown can deadlock systest_watchdog #60

Closed
hyperthunk opened this issue Sep 17, 2012 · 2 comments
Closed
Assignees
Milestone

Comments

@hyperthunk
Copy link
Member

This is probably a duplicate of issue #57. The root cause appears to be an infinitely hanging SUT shutdown. What happens is roughly thus:

  1. a ct_hook calls systest_watchdog:force_stop/1
  2. the watchdog calls systest_sut:stop/1
  3. the sut calls systest_cleaner:kill_wait/2
  4. the cleaner blocks

This situation leads to the watchdog becoming permanently blocked, which in turn prevents future SUTs from starting, and we end up with gen_server:call/2 timeouts everywhere (or subsequent testcase time traps).

All of this is very unclear to the test author and operator, so we need to tidy it all up somehow.

@ghost ghost assigned hyperthunk Sep 17, 2012
@hyperthunk
Copy link
Member Author

The fix for this is going to be a configurable timeout for shutdowns. IF this timeout is triggered, then we're going to skip all further testing and make the operator fix things before continuing.

@hyperthunk
Copy link
Member Author

Marking as WONTFIX as we are going to address this via issue #75 and co.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant