Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] CrossClustersCancellationIT testCloseSkipUnavailable failing #121627

Open
elasticsearchmachine opened this issue Feb 4, 2025 · 6 comments
Open
Assignees
Labels
:Analytics/ES|QL AKA ESQL needs:risk Requires assignment of a risk label (low, medium, blocker) Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) >test-failure Triaged test failures from CI

Comments

@elasticsearchmachine
Copy link
Collaborator

elasticsearchmachine commented Feb 4, 2025

Build Scans:

Reproduction Line:

./gradlew ":x-pack:plugin:esql:internalClusterTest" --tests "org.elasticsearch.xpack.esql.action.CrossClustersCancellationIT.testCloseSkipUnavailable" -Dtests.seed=6E2E64F7EB9BA2B -Dtests.locale=sk-SK -Dtests.timezone=America/Managua -Druntime.java=23

Applicable branches:
main

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

org.elasticsearch.node.NodeClosedException: node closed {cluster-a-1}{YrDL1SRPQrmnFgyk96RW3g}{n7IXpIKIRVSWTpgNHt3H6g}{cluster-a-1}{127.0.0.1}{127.0.0.1:19303}{cdfhilmrstw}{9.1.0}{8000099-9010000}

Issue Reasons:

  • [main] 7 failures in test testCloseSkipUnavailable (3.3% fail rate in 213 executions)
  • [main] 3 failures in step part3 (12.5% fail rate in 24 executions)
  • [main] 2 failures in pipeline elasticsearch-periodic-platform-support (28.6% fail rate in 7 executions)
  • [main] 3 failures in pipeline elasticsearch-intake (12.5% fail rate in 24 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

@elasticsearchmachine elasticsearchmachine added :Analytics/ES|QL AKA ESQL >test-failure Triaged test failures from CI labels Feb 4, 2025
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch main

Mute Reasons:

  • [main] 2 failures in test testCloseSkipUnavailable (13.3% fail rate in 15 executions)

Build Scans:

@elasticsearchmachine elasticsearchmachine added Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) needs:risk Requires assignment of a risk label (low, medium, blocker) labels Feb 4, 2025
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/es-analytical-engine (Team:Analytics)

fzowl pushed a commit to voyage-ai/elasticsearch that referenced this issue Feb 4, 2025
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch 8.x

Mute Reasons:

  • [8.x] 2 consecutive failures in step rocky-9_platform-support-unix
  • [8.x] 35 failures in test testCloseSkipUnavailable (24.8% fail rate in 141 executions)
  • [8.x] 2 failures in step part3 (13.3% fail rate in 15 executions)
  • [8.x] 5 failures in step part-3 (25.0% fail rate in 20 executions)
  • [8.x] 2 failures in step oraclelinux-8_platform-support-unix (66.7% fail rate in 3 executions)
  • [8.x] 2 failures in step rhel-7_platform-support-unix (66.7% fail rate in 3 executions)
  • [8.x] 2 failures in step rhel-9_platform-support-unix (66.7% fail rate in 3 executions)
  • [8.x] 2 failures in step oraclelinux-7_platform-support-unix (66.7% fail rate in 3 executions)
  • [8.x] 2 failures in step rocky-9_platform-support-unix (66.7% fail rate in 3 executions)
  • [8.x] 4 failures in pipeline elasticsearch-periodic-platform-support (100.0% fail rate in 4 executions)
  • [8.x] 2 failures in pipeline elasticsearch-periodic (50.0% fail rate in 4 executions)
  • [8.x] 2 failures in pipeline elasticsearch-intake (13.3% fail rate in 15 executions)
  • [8.x] 4 failures in pipeline elasticsearch-pull-request (22.2% fail rate in 18 executions)

Build Scans:

@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch 8.x

Mute Reasons:

  • [8.x] 40 failures in test testCloseSkipUnavailable (23.3% fail rate in 172 executions)
  • [8.x] 3 failures in step part3 (17.6% fail rate in 17 executions)
  • [8.x] 5 failures in step part-3 (22.7% fail rate in 22 executions)
  • [8.x] 2 failures in step rhel-7_platform-support-unix (50.0% fail rate in 4 executions)
  • [8.x] 2 failures in step oraclelinux-8_platform-support-unix (50.0% fail rate in 4 executions)
  • [8.x] 2 failures in step rhel-9_platform-support-unix (50.0% fail rate in 4 executions)
  • [8.x] 2 failures in step rocky-9_platform-support-unix (50.0% fail rate in 4 executions)
  • [8.x] 3 failures in step oraclelinux-7_platform-support-unix (75.0% fail rate in 4 executions)
  • [8.x] 2 failures in step debian-12_platform-support-unix (50.0% fail rate in 4 executions)
  • [8.x] 3 failures in pipeline elasticsearch-intake (17.6% fail rate in 17 executions)
  • [8.x] 4 failures in pipeline elasticsearch-pull-request (21.1% fail rate in 19 executions)
  • [8.x] 3 failures in pipeline elasticsearch-periodic (75.0% fail rate in 4 executions)
  • [8.x] 4 failures in pipeline elasticsearch-periodic-platform-support (100.0% fail rate in 4 executions)

Build Scans:

@smalyshev
Copy link
Contributor

The latest error is:


WARNING: Uncaught exception in thread: Thread[#693,elasticsearch[main-cluster-0][scheduler][T#1],5,TGRP-CrossClustersCancellationIT] |  
-- | --
  | java.lang.OutOfMemoryError: Java heap space

which I am not entirely sure that it belongs to this test...

@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch main

Mute Reasons:

  • [main] 7 failures in test testCloseSkipUnavailable (3.3% fail rate in 213 executions)
  • [main] 3 failures in step part3 (12.5% fail rate in 24 executions)
  • [main] 2 failures in pipeline elasticsearch-periodic-platform-support (28.6% fail rate in 7 executions)
  • [main] 3 failures in pipeline elasticsearch-intake (12.5% fail rate in 24 executions)

Build Scans:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Analytics/ES|QL AKA ESQL needs:risk Requires assignment of a risk label (low, medium, blocker) Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

2 participants