Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ansible request for Linux core settings on docker test hosts #1817

Open
2 of 3 tasks
sxa opened this issue Jan 7, 2021 · 2 comments
Open
2 of 3 tasks

Ansible request for Linux core settings on docker test hosts #1817

sxa opened this issue Jan 7, 2021 · 2 comments

Comments

@sxa
Copy link
Member

sxa commented Jan 7, 2021

Please put the name of the software product (and affected platforms if relevant) in the title of this issue

  • Missing install
  • Bug in ansible playbook
  • Request for new playbook addition

Details: The default setting in /proc/sys/kernel/core_pattern on Ubuntu is to have it set to |/usr/share/apport/apport %p %s %c %d %P %E. This prevents processes dumping core in docker by default. A fix is to set it to core which appears to cause the behavior we want. We should probably understand the implications and any side effects of doing this before implementing it, but it gives us a viable solution that I've implemented on three machines I've been experimenting with as part of #1809 (test-ibmcloud-ubuntu1604-x64-1, test-packet-ubuntu1804-armv8-1, and the new 20.04 host I'm working with)

If added to the playbooks it will need to be done in a way that persists across reboots (what I've done on the two referenced machines will not at the moment, so it will get lost if they are restarted)

@andrew-m-leonard
Copy link
Contributor

andrew-m-leonard commented Apr 13, 2021

Looks like this may have fixed an issue I was seeing on node test-docker-fedora33-ppc64le-1 with: https://ci.adoptopenjdk.net/job/Test_openjdk11_j9_sanity.functional_ppc64le_linux/99/consoleFull
failing to produce dumps,

11:59:48  Testing: Create core file
11:59:48  Test start time: 2021/04/13 10:59:48 Coordinated Universal Time
11:59:49  Running command: "/home/jenkins/workspace/Test_openjdk11_j9_sanity.functional_ppc64le_linux/openjdkbinary/j2sdk-image/bin/java"  -Xjit -Xgcpolicy:gencon -Xnocompressedrefs  -Xmx256m -cp /home/jenkins/workspace/Test_openjdk11_j9_sanity.functional_ppc64le_linux/openjdk-tests/TKG/../../jvmtest/functional/cmdline_options_testresources/cmdlinetestresources.jar:/home/jenkins/workspace/Test_openjdk11_j9_sanity.functional_ppc64le_linux/openjdk-tests/TKG/../../jvmtest/functional/cmdLineTests/gcCheck/gcCheck.jar -Xdump:system:file=j9core.dmp,events=vmstop,request=exclusive+compact org.openj9.test.nogc.Main
11:59:49  Time spent starting: 53 milliseconds
11:59:56  Time spent executing: 6279 milliseconds
11:59:56  Test result: FAILED
11:59:56  Output from test:
11:59:56   [ERR] Error: Could not find or load main class org.openj9.test.nogc.Main
11:59:56   [ERR] Caused by: java.lang.ClassNotFoundException: org.openj9.test.nogc.Main
11:59:56   [ERR] JVMDUMP039I Processing dump event "vmstop", detail "#0000000000000001" at 2021/04/13 10:59:49 - please wait.
11:59:56   [ERR] JVMDUMP032I JVM requested System dump using '/home/jenkins/workspace/Test_openjdk11_j9_sanity.functional_ppc64le_linux/openjdk-tests/TKG/output_16183033589351/cmdLineTester_GCCheck_0/j9core.dmp' in response to an event
11:59:56   [ERR] JVMPORT030W /proc/sys/kernel/core_pattern setting "|/usr/share/apport/apport %p %s %c %d %P %E" specifies that the core dump is to be piped to an external program.  Attempting to rename either core or core.3103966.
11:59:56   [ERR] 
11:59:56   [ERR] JVMDUMP012E Error in System dump: The core file created by child process with pid = 3103966 was not found. Expected to find core file with name "/home/jenkins/workspace/Test_openjdk11_j9_sanity.functional_ppc64le_linux/openjdk-tests/TKG/output_16183033589351/cmdLineTester_GCCheck_0/core"
11:59:56   [ERR] JVMDUMP013I Processed dump event "vmstop", detail "#0000000000000001".
11:59:56  >> Success condition was not found: [Output match: System dump written]

seems to work now re-running via a Grinder: https://ci.adoptopenjdk.net/job/Grinder/48/

@sxa
Copy link
Member Author

sxa commented Apr 14, 2021

Resolved on that host (Until next reboot ...)

@sxa sxa added this to the October 2021 milestone Oct 1, 2021
@sxa sxa modified the milestones: October 2021, December 2021 Dec 1, 2021
@sxa sxa modified the milestones: December 2021, 2022-01 (January) Jan 6, 2022
@sxa sxa modified the milestones: 2022-03 (March), 2022-04 (April) Apr 11, 2022
@sxa sxa modified the milestones: 2022-04 (April), 2022-06 (June) May 24, 2022
@sxa sxa modified the milestones: 2022-06 (June), 2022-07 (July) Jun 30, 2022
@sxa sxa modified the milestones: 2023-04 (April), 2023-05 (May) May 16, 2023
@sxa sxa modified the milestones: 2023-05 (May), Backlog Jun 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Todo
Development

No branches or pull requests

3 participants