feat(parachain): send requests from subsystems #4475

haikoschol · 2025-01-16T11:38:29Z

Changes

This PR adds support for sending requests of request/response network protocols via overseer/network bridge from parachain subsystems.

Tests

go test -run TestSendRequests ./dot/parachain/network-bridge

Issues

#4448

haikoschol · 2025-01-16T11:39:57Z

As we agreed on a recent call, this approach is using SendRequests messages passed through overseer to the network bridge. Although the plan was to propose types/structure before implementing, I had to code it to figure whether it would actually work. One issue I ran into repeatedly was cyclic imports. To get unstuck, I moved a few things into dot/parachain/network-bridge/messages that don't really belong there. That's a minor thing that can be changed later of course.

I've used the following existing types from the network package for the implementation:

Message for requests
ResponseMessage for responses
the RequestMaker interface
RequestResponseProtocol returned by network.Service.GetRequestResponseProtocol

I've added the following types:

// ReqProtocolMessage is a network message that can be sent over a request response protocol.
type ReqProtocolMessage interface {
	network.Message
	// Response returns an instance of the response type for this message, for the purpose of decoding into it.
	Response() network.ResponseMessage
	Protocol() ReqProtocolName
}

// ReqRespResult is the result of sending a request over a request response protocol. It contains either a response
// message or an error.
type ReqRespResult struct {
	Response network.ResponseMessage
	Error    error
}

// OutgoingRequest contains all data required to send a request over a request response protocol and receive the result.
type OutgoingRequest struct {
	Recipient peer.ID // TODO use a type that can contain either a peer ID or an authority ID
	Payload   ReqProtocolMessage
	Result    chan ReqRespResult
}

type IfDisconnectedBehavior int

const (
	TryConnect     IfDisconnectedBehavior = iota
	ImmediateError                        // TODO not implemented
)

// SendRequests is a subsystem message for sending requests over a request response protocol.
type SendRequests struct {
	Requests       []*OutgoingRequest
	IfDisconnected IfDisconnectedBehavior
}

Usage from a subsystem looks roughly as follows:

request := networkbridgemessages.NewOutgoingRequest(
	"recipient",
	networkbridgemessages.ChunkFetchingRequest{
		CandidateHash: parachaintypes.CandidateHash{Value: common.Hash{1}},
		Index:         42,
	})

sendRequests := networkbridgemessages.SendRequests{
	Requests:       []*networkbridgemessages.OutgoingRequest{request},
	IfDisconnected: networkbridgemessages.TryConnect,
}

ss.subsystemToOverseer <- sendRequests

result := <- request.Result

There are a few downsides/issues to discuss in this approach:

Concurrency

The goroutine in NetworkBridgeSender that handles overseer messages currently handles requests sequentially and blocks on each request until error/timeout/response. It's easy enough to start a new goroutine for each request, but unclear how cancellation/shutdown can be handled properly. RequestResponseProtocol itself does not have a mechanism for aborting requests.

Parameterization

RequestResponseProtocol requires setting a value for timeouts and the maximum response size. Right now, this is set to hard-coded values in NetworkBridgeSender. Ideally, subsystems would set these values according to their requirements. This could be accomplished by adding these parameters to the OutgoingRequest struct

Efficiency

With the current implementation, an instance of RequestResponseProtocol is created for each request. This causes a large buffer to be allocated every time, which was supposed to be avoided by keeping this buffer in the objects state.

dot/network/service.go

EclesioMeloJunior

LGTM, just a small observation on context creation

dot/parachain/network-bridge/messages/request_response_protocols.go

EclesioMeloJunior

LGTM, just few comments

dot/parachain/network-bridge/sender.go

EclesioMeloJunior · 2025-01-27T12:25:51Z

dot/parachain/network-bridge/sender.go

+
+// PoV is probably the largest message and is currently set at 5MB, but will likely be increased to 10MB in the future.
+// see: https://github.com/paritytech/polkadot-sdk/issues/5334
+// Maybe message types should have a MaxSize() method instead of using the same value for all messages.


yeah, I would say you can attach to the protocol string the max response size

The question then becomes what the values for requests other than PoVFetchingV1 should be. Would you be ok with deferring this change to a later PR, given that using smaller values here amounts to a potential performance optimization?

EclesioMeloJunior · 2025-01-27T12:36:57Z

dot/parachain/network-bridge/sender_test.go

+
+func requireClosed(t *testing.T, ch chan networkbridgemessages.ReqRespResult) {
+	select {
+	case <-ch:


just to add an extra layer, when the channel is closed ok is false. Actually, you don't need the select when you retrieve a tuple from the channel (even if the channel is not closed)

Suggested change

case <-ch:

_, ok := <-ch

require.False(t, ok)

If the channel is empty but not closed, the receive will block though. The ok merely indicates whether the received value is the zero value for the type of the channel, i.e. whether the channel was empty.

feat(parachain): send requests from subsystems

f8ec2bd

haikoschol self-assigned this Jan 16, 2025

EclesioMeloJunior reviewed Jan 17, 2025

View reviewed changes

dot/network/service.go Show resolved Hide resolved

haikoschol added 4 commits January 20, 2025 22:15

Merge branch 'feat/parachain' into haiko/req-resp-4448

f0e6368

use gomock

93ca5dd

implement cancellation in OutgoingRequest

15c7252

fix unrelated flaky tests

364de93

haikoschol marked this pull request as ready for review January 23, 2025 09:03

haikoschol requested review from jimjbrettj and timwu20 as code owners January 23, 2025 09:03

fix doc comment

ba731e3

haikoschol requested review from dimartiro and axaysagathiya January 23, 2025 09:10

haikoschol added 2 commits January 23, 2025 16:42

run SendRequests tests in parallel

6923ef1

add/fix license headers

9c326ca

EclesioMeloJunior reviewed Jan 23, 2025

View reviewed changes

dot/parachain/network-bridge/messages/request_response_protocols.go Show resolved Hide resolved

ensure context inside OutgoingRequest is always cancelled

6b7321c

EclesioMeloJunior approved these changes Jan 27, 2025

View reviewed changes

haikoschol added 2 commits February 3, 2025 15:30

increase request timeout

9c6e113

require response channel to be empty and closed

54efaf6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(parachain): send requests from subsystems #4475

feat(parachain): send requests from subsystems #4475

haikoschol commented Jan 16, 2025

haikoschol commented Jan 16, 2025

EclesioMeloJunior left a comment

EclesioMeloJunior left a comment

EclesioMeloJunior Jan 27, 2025

haikoschol Feb 3, 2025

EclesioMeloJunior Jan 27, 2025

haikoschol Feb 3, 2025

feat(parachain): send requests from subsystems #4475

Are you sure you want to change the base?

feat(parachain): send requests from subsystems #4475

Conversation

haikoschol commented Jan 16, 2025

Changes

Tests

Issues

haikoschol commented Jan 16, 2025

EclesioMeloJunior left a comment

Choose a reason for hiding this comment

EclesioMeloJunior left a comment

Choose a reason for hiding this comment

EclesioMeloJunior Jan 27, 2025

Choose a reason for hiding this comment

haikoschol Feb 3, 2025

Choose a reason for hiding this comment

EclesioMeloJunior Jan 27, 2025

Choose a reason for hiding this comment

haikoschol Feb 3, 2025

Choose a reason for hiding this comment