Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pilz integration tests have started failing sporadically on Rolling due to exit code -11 #3188

Open
sea-bass opened this issue Jan 2, 2025 · 4 comments
Labels
bug Something isn't working help wanted Extra attention is needed

Comments

@sea-bass
Copy link
Contributor

sea-bass commented Jan 2, 2025

Description

There have lately been pretty frequent (~50% or so) flakes with some exit codes in Pilz integration tests:
https://github.com/moveit/moveit2/actions/runs/12589290424/job/35088819135?pr=3187#step:11:8621

    ======================================================================
    FAIL: test_exit_codes (pilz_industrial_motion_planner.TestOutcome.test_exit_codes)
    ----------------------------------------------------------------------
    Traceback (most recent call last):
      File "/home/runner/work/moveit2/moveit2/.work/target_ws/src/moveit2/moveit_planners/pilz_industrial_motion_planner/test/unit_tests/launch/unittest_planning_context_loaders.test.py", line 53, in test_exit_codes
        launch_testing.asserts.assertExitCodes(
      File "/opt/ros/rolling/lib/python3.12/site-packages/launch_testing/asserts/assert_exit_codes.py", line 62, in assertExitCodes
        assert info.returncode in allowable_exit_codes, 'Proc {} exited with code {}'.format(
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    AssertionError: Proc unittest_planning_context_loaders-1 exited with code -11

This seems to be specifically happening in the rolling CI jobs, but it's usually cleared away after 1 or 2 retries.

Creating this issue to keep track.

ROS Distro

Rolling

OS and version

Ubuntu 24.04

Source or binary build?

Source

If binary, which release version?

No response

If source, which branch?

No response

Which RMW are you using?

None

Steps to Reproduce

Run CI

Expected behavior

CI is green

Actual behavior

CI is not always green

Backtrace or Console output

No response

@sea-bass sea-bass added bug Something isn't working help wanted Extra attention is needed labels Jan 2, 2025
@GayarUsmanov
Copy link
Contributor

GayarUsmanov commented Jan 8, 2025

Is there a way to reproduce the issue locally on my machine?

@sea-bass
Copy link
Contributor Author

Is there a way to reproduce the issue locally on my machine?

In these integration tests I often rely on the stress command to reproduce locally: https://www.geeksforgeeks.org/linux-stress-command-with-examples/

However, issues pertaining to exit codes can be tricky and sometimes can't be reproduced. Worth a try, though.

@GayarUsmanov
Copy link
Contributor

Sorry, I should've been clearer. I meant to ask how do you run the integration test on local machine?
Once I know how to do that I'll run it under a memory checker, e.g. Valgrind.

@sea-bass
Copy link
Contributor Author

sea-bass commented Jan 12, 2025

Aha. You can do something like this to the invocation of colcon test

https://docs.ros.org/en/humble/How-To-Guides/Getting-Backtraces-in-ROS-2.html#debugging-tests-with-gdb

Edited for better doc link. There's a section in that doc too for how to prefix a node in a launch file with gdb (or other tools)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants