feat(relay): don't close connections upon errors in relay server #4718

thomaseizinger · 2023-10-24T04:09:03Z

Description

To remove the usages of ConnectionHandlerEvent::Close from the relay-server, we unify what used to be called CircuitFailedReason and FatalUpgradeError. Whilst the errors may be fatal for the particular circuit, they are not necessarily fatal for the entire connection.

Related: #3591.
Resolves: #4716.

Notes & open questions

Should we do some kind of "smart" connection management upon failures on the streams further up? At the moment, we don't expose the details of which connection a stream failed on. I am leaning towards saying "no" here and instead relying more on fix(swarm): keep connections alive while active streams exist #4595. Once we do more automated keep-alive tracking, bad connections will close automatically much more aggressively. That is because any error on a stream will lead to the user dropping the stream which means we will automatically return KeepAlive::No.

Change checklist

I have performed a self-review of my own code
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
A changelog entry has been made in the appropriate crates

protocols/relay/CHANGELOG.md

protocols/relay/src/behaviour.rs

mxinden

Thank you for the work.

Can you add a test that ensures #4752 is fixed with this pull request, i.e. that a connection is not closed even though the remote does not support the relay protocol as one expected?

protocols/relay/src/behaviour.rs

protocols/relay/src/behaviour/handler.rs

thomaseizinger · 2023-10-29T22:36:15Z

Can you add a test that ensures #4752 is fixed with this pull request, i.e. that a connection is not closed even though the remote does not support the relay protocol as one expected?

To properly test this, I think we need to also merge the fix for the client side (#4745) otherwise, the client will simply close the connection. Happy to add a test once both PRs are landed.

mergify · 2023-10-30T10:17:20Z

This pull request has merge conflicts. Could you please resolve them @thomaseizinger? 🙏

To make a reservation with a relay, a user calls `Swarm::listen_on` with an address of the relay, suffixed with a `/p2pcircuit` protocol. Similarly, to establish a circuit to another peer, a user needs to call `Swarm::dial` with such an address. Upon success, the `Swarm` then issues a `SwarmEvent::NewListenAddr` event in case of a successful reservation or a `SwarmEvent::ConnectionEstablished` in case of a successful connect. The story is different for errors. Somewhat counterintuitively, the actual reason of an error during these operations are only reported as `relay::Event`s without a direct correlation to the user's `Swarm::listen_on` or `Swarm::dial` calls. With this PR, we send these errors back "into" the `Transport` and report them as `SwarmEvent::ListenerClosed` or `SwarmEvent::OutgoingConnectionError`. This is conceptually more correct. Additionally, by sending these errors back to the transport, we no longer use `ConnectionHandlerEvent::Close` which entirely closes the underlying relay connection. In case the connection is not used for something else, it will be closed by the keep-alive algorithm. Resolves: #4717. Related: #3591. Related: #4718. Pull-Request: #4745.

This PR implements the long-awaited design of disallowing `ConnectionHandler`s to close entire connections. Instead, users should close connections via `ToSwarm::CloseConnection` from a `NetworkBehaviour` or - even better - from the `Swarm` via `close_connection`. A `NetworkBehaviour` also does not have a "full" view onto how a connection is used but at least it can correlate whether it created the connection via the `ConnectionId`. In general, the more modular and friendly approach is to stop "using" a connection if a particular protocol no longer needs it. As a result of the keep-alive algorithm, such a connection is then closed automatically. Depends-on: #4745. Depends-on: #4718. Depends-on: #4749. Related: #3353. Related: #4714. Resolves: #3591. Pull-Request: #4755.

thomaseizinger added 9 commits October 24, 2023 13:56

Flatten error hierarchy

22fbc01

Split inbound and outbound protocol workers

5da5d94

Only pass into connect what is necessary

ef39e44

Don't close relayed connections

483cfb6

Remove unused UpgradeError

bfc3544

Restructure error handling of outbound_stop

68f55af

Add changelog entry

76ad993

Add deprecation type alias to make migration easier

0d20720

Insert into active_connect_requests

bfdae50

thomaseizinger requested a review from mxinden October 24, 2023 04:09

thomaseizinger mentioned this pull request Oct 24, 2023

swarm: Remove ConnectionHandler::Error #3591

Closed

This comment was marked as resolved.

Sign in to view

Merge branch 'master' into feat/remove-connection-close-relayed

502564d

thomaseizinger commented Oct 24, 2023

View reviewed changes

protocols/relay/CHANGELOG.md Outdated Show resolved Hide resolved

Update protocols/relay/CHANGELOG.md

842bf89

thomaseizinger commented Oct 24, 2023

View reviewed changes

protocols/relay/src/behaviour.rs Show resolved Hide resolved

thomaseizinger mentioned this pull request Oct 27, 2023

feat(relay): propagate errors to Transport::{listen_on,dial} #4745

Merged

4 tasks

thomaseizinger added this to the v0.53.0 milestone Oct 27, 2023

This comment was marked as resolved.

Sign in to view

Merge branch 'master' into feat/remove-connection-close-relayed

bf29c29

thomaseizinger mentioned this pull request Oct 29, 2023

feat(swarm): don't have ConnectionHandlers close connections #4755

Merged

4 tasks

mxinden reviewed Oct 29, 2023

View reviewed changes

protocols/relay/src/behaviour.rs Show resolved Hide resolved

protocols/relay/src/behaviour/handler.rs Show resolved Hide resolved

thomaseizinger mentioned this pull request Oct 29, 2023

relay: don't report errors for failed requests as events but log them instead as warn! #4757

Open

thomaseizinger added the internal-change Pull requests that make internal changes to crates and thus don't need to include a changelog entry. label Oct 29, 2023

Deprecate event variants

34c8f71

thomaseizinger force-pushed the feat/remove-connection-close-relayed branch from 14ee70d to 34c8f71 Compare October 29, 2023 23:12

mxinden approved these changes Oct 31, 2023

View reviewed changes

mxinden added the send-it label Oct 31, 2023

thomaseizinger and others added 4 commits November 1, 2023 12:24

Merge branch 'master' into feat/remove-connection-close-relayed

a07b7fc

Set Error to Void

09f1be3

Remove unused error variants

b618982

Merge branch 'master' into feat/remove-connection-close-relayed

5173804

mergify bot merged commit 823d0b2 into master Nov 1, 2023
71 checks passed

mergify bot deleted the feat/remove-connection-close-relayed branch November 1, 2023 01:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(relay): don't close connections upon errors in relay server #4718

feat(relay): don't close connections upon errors in relay server #4718

thomaseizinger commented Oct 24, 2023 •

edited

Loading

This comment was marked as resolved.

This comment was marked as resolved.

mxinden left a comment

thomaseizinger commented Oct 29, 2023

mergify bot commented Oct 30, 2023

feat(relay): don't close connections upon errors in relay server #4718

feat(relay): don't close connections upon errors in relay server #4718

Conversation

thomaseizinger commented Oct 24, 2023 • edited Loading

Description

Notes & open questions

Change checklist

This comment was marked as resolved.

This comment was marked as resolved.

mxinden left a comment

Choose a reason for hiding this comment

thomaseizinger commented Oct 29, 2023

mergify bot commented Oct 30, 2023

thomaseizinger commented Oct 24, 2023 •

edited

Loading