Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[v1.22.x]prov/efa: differentiate unresponsive receiver errors following rdma-core #10497

Open
wants to merge 1 commit into
base: v1.22.x
Choose a base branch
from

Conversation

jiaxiyan
Copy link
Contributor

Add a new vendor error code EFA_IO_COMP_STATUS_LOCAL_ERROR_UNREACH_REMOTE from rdma core to indicate the remote is unreachable. Add a new EFA provider error code UNESTABLISHED_RECV_UNRESP to distinguish unresponsive receiver error when the peer is reachable by the EFA device but libfabric failed to complete a handshake.
Add unit test for EFA_IO_COMP_STATUS_LOCAL_ERROR_UNREACH_REMOTE.

Signed-off-by: Jessie Yang [email protected]
(cherry picked from commit 5573b3f)

Add a new vendor error code EFA_IO_COMP_STATUS_LOCAL_ERROR_UNREACH_REMOTE
from rdma core to indicate the remote is unreachable.
Add a new EFA provider error code UNESTABLISHED_RECV_UNRESP to distinguish
unresponsive receiver error when the peer is reachable by the EFA device
but libfabric failed to complete a handshake.
Add unit test for EFA_IO_COMP_STATUS_LOCAL_ERROR_UNREACH_REMOTE.

Signed-off-by: Jessie Yang <[email protected]>
(cherry picked from commit 5573b3f)
@jiaxiyan jiaxiyan requested a review from a team October 28, 2024 18:27
@shijin-aws
Copy link
Contributor

bot:aws:retest

1 similar comment
@shijin-aws
Copy link
Contributor

bot:aws:retest

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants