[Merged by Bors] - p2p: server: adjust deadline during long reads and writes #5463

ivan4th · 2024-01-18T03:12:21Z

Motivation

Currently, if a request takes too long to complete due to the data size and/or the speed of network connection, the requests often time out before the completion. This happens because stream deadline is set just once at the start of the request and not updated till it is completed. The timeouts cause a lot of i/o deadline exceeded errors, and also cause the nodes with high peer counts to be hammered by repeated retries by peers that fail to get ATX ID list for epoch, ActiveSets etc. Moreover, nodes may prevent themselves from retrieving the full response to request they make because of the timeouts.

Changes

In p2p/server, split large reads / writes into 4 KiB chunks. Update stream deadline after each chunk is read/written.

Test Plan

Verified the approach to work using emulated low-bandwidth node-to-node sync env: https://gist.github.com/ivan4th/c8add0b4d6a6fffb3da248b6b6bac346
Verified the approach to work on a node with high peer count
Verified normal mainnet syncing

codecov · 2024-01-18T03:22:52Z

Codecov Report

Attention: 17 lines in your changes are missing coverage. Please review.

Comparison is base (dd1fa87) 77.6% compared to head (aee761f) 77.6%.
Report is 18 commits behind head on develop.

Files	Patch %	Lines
p2p/server/server.go	57.5%	14 Missing ⚠️
p2p/server/deadline_adjuster.go	96.7%	2 Missing and 1 partial ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           develop   #5463    +/-   ##
========================================
  Coverage     77.6%   77.6%            
========================================
  Files          267     268     +1     
  Lines        30956   31184   +228     
========================================
+ Hits         24045   24224   +179     
- Misses        5396    5431    +35     
- Partials      1515    1529    +14

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ivan4th · 2024-01-18T04:18:24Z

bors try

spacemesh-bors · 2024-01-18T05:10:21Z

try

Build succeeded:

p2p/server/deadline_adjuster.go

p2p/server/deadline_adjuster_test.go

p2p/server/server.go

Co-authored-by: Bartosz Różański <[email protected]>

ivan4th · 2024-01-25T19:08:53Z

bors merge

## Motivation Currently, if a request takes too long to complete due to the data size and/or the speed of network connection, the requests often time out before the completion. This happens because stream deadline is set just once at the start of the request and not updated till it is completed. The timeouts cause a lot of `i/o deadline exceeded` errors, and also cause the nodes with high peer counts to be hammered by repeated retries by peers that fail to get ATX ID list for epoch, ActiveSets etc. Moreover, nodes may prevent themselves from retrieving the full response to request they make because of the timeouts. ## Changes In `p2p/server`, split large reads / writes into 4 KiB chunks. Update stream deadline after each chunk is read/written. ## Test Plan Verified the approach to work using emulated low-bandwidth node-to-node sync env: https://gist.github.com/ivan4th/c8add0b4d6a6fffb3da248b6b6bac346 Verified the approach to work on a node with high peer count Verified normal mainnet syncing Co-authored-by: Ivan Shvedunov <[email protected]>

spacemesh-bors · 2024-01-25T21:42:23Z

Pull request successfully merged into develop.

Build succeeded:

## Motivation Currently, if a request takes too long to complete due to the data size and/or the speed of network connection, the requests often time out before the completion. This happens because stream deadline is set just once at the start of the request and not updated till it is completed. The timeouts cause a lot of `i/o deadline exceeded` errors, and also cause the nodes with high peer counts to be hammered by repeated retries by peers that fail to get ATX ID list for epoch, ActiveSets etc. Moreover, nodes may prevent themselves from retrieving the full response to request they make because of the timeouts. ## Changes In `p2p/server`, split large reads / writes into 4 KiB chunks. Update stream deadline after each chunk is read/written. ## Test Plan Verified the approach to work using emulated low-bandwidth node-to-node sync env: https://gist.github.com/ivan4th/c8add0b4d6a6fffb3da248b6b6bac346 Verified the approach to work on a node with high peer count Verified normal mainnet syncing Co-authored-by: Ivan Shvedunov <[email protected]>

ivan4th requested review from dshulyak, fasmat and poszu as code owners January 18, 2024 03:12

ivan4th added 2 commits January 18, 2024 07:25

p2p: server: adjust deadline during long reads and writes

4f4efb1

Update CHANGELOG.md

d63ca14

ivan4th force-pushed the feature/adjust-deadline branch from 869fc1a to d63ca14 Compare January 18, 2024 03:26

spacemesh-bors bot added a commit that referenced this pull request Jan 18, 2024

Try #5463:

e36c815

ivan4th added 5 commits January 19, 2024 16:07

p2p: server: introduce hard deadline

f29dce4

Merge remote-tracking branch 'origin/develop'

bf1339d

fetch: make hard timeout configurable for requests

a0bc458

fetch: adjust tests and defaults

6ecb03e

Merge remote-tracking branch 'origin/develop'

9c50490

poszu reviewed Jan 25, 2024

View reviewed changes

ivan4th and others added 2 commits January 25, 2024 19:59

Update p2p/server/deadline_adjuster.go

cfea2ce

Co-authored-by: Bartosz Różański <[email protected]>

p2p: server: improve error handling

aee761f

poszu approved these changes Jan 25, 2024

View reviewed changes

spacemesh-bors bot changed the title ~~p2p: server: adjust deadline during long reads and writes~~ [Merged by Bors] - p2p: server: adjust deadline during long reads and writes Jan 25, 2024

spacemesh-bors bot closed this Jan 25, 2024

spacemesh-bors bot deleted the feature/adjust-deadline branch January 25, 2024 21:42

ivan4th mentioned this pull request Jan 25, 2024

Backport p2p fixes to v1.3 #5500

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - p2p: server: adjust deadline during long reads and writes #5463

[Merged by Bors] - p2p: server: adjust deadline during long reads and writes #5463

ivan4th commented Jan 18, 2024

codecov bot commented Jan 18, 2024 •

edited

Loading

ivan4th commented Jan 18, 2024

spacemesh-bors bot commented Jan 18, 2024

ivan4th commented Jan 25, 2024

spacemesh-bors bot commented Jan 25, 2024

[Merged by Bors] - p2p: server: adjust deadline during long reads and writes #5463

[Merged by Bors] - p2p: server: adjust deadline during long reads and writes #5463

Conversation

ivan4th commented Jan 18, 2024

Motivation

Changes

Test Plan

codecov bot commented Jan 18, 2024 • edited Loading

Codecov Report

ivan4th commented Jan 18, 2024

spacemesh-bors bot commented Jan 18, 2024

try

ivan4th commented Jan 25, 2024

spacemesh-bors bot commented Jan 25, 2024

codecov bot commented Jan 18, 2024 •

edited

Loading