Limit data sent after dht differences #94

ThetaSinner · 2025-02-03T20:09:31Z

Applies the same logic that we have for "new ops" to the op ids returned by the DHT diff process. I believe this is necessary regardless of whether it's a complete solution to the "new agent joining an existing network" problem.

By leaving this unlimited, we'd just return the entire data set in one batch during the first gossip round with new peers. That's too much load on the peer you happen to hit. As I'm writing, I'm wondering if we should enforce a maximum value for this that peers can request...

By limiting how much data we get in one go, we aren't doing anything too complicated like trying to decide which sectors we want data from or which time slices, when both are sparse and we'd get unpredictable results that way. Like this, we do our best to sync. Then we have to hope that between one gossip round and the next, the other peer will fetch enough to reduce the number of sectors/slices that produce a diff on the next round. If it's not that quick, then over time that is the effect.

As soon as we stop having a diff for a time slice or a sector, we'll be able to progress to new areas of the DHT in a natural way. There will be some duplicate requests of op ids but that's a reasonable thing to have happen I think. it's part of learning about the network.

This change really needs tests that exercise the 3 different paths

Limit on "new ops"
Limit on "disc diff" when part of the limit was consumed by "new ops"
Limit on "ring diff" when part of the limit was consumed by "new ops"

But adding those tests requires modifying the testing in the same ways as #85

…ration of the op store

…ice operation of the op store

crates/api/src/op_store.rs

crates/gossip/proto/gen/kitsune2.gossip.rs

crates/gossip/src/initiate.rs

neonphog

woot, looks good.

As I'm writing, I'm wondering if we should enforce a maximum value for this that peers can request

Agree, just a local config, and we can std::cmp::min() the requested size and our max config.

ThetaSinner added 4 commits February 3, 2025 20:03

chore(kitsune2_api): Add optional byte limiting to the time slice ope…

0b761f6

…ration of the op store

chore(kitsune2_core): Implement optional byte limiting to the time sl…

ebd677a

…ice operation of the op store

chore(kitsune2_dht): Limit how much data is returned by a DHT diff

3ca77ef

chore(kitsune2_gossip): Integrate op data limiting for DHT differences

2dc239c

ThetaSinner marked this pull request as ready for review February 3, 2025 20:17

ThetaSinner commented Feb 3, 2025

View reviewed changes

crates/api/src/op_store.rs Show resolved Hide resolved

ThetaSinner commented Feb 3, 2025

View reviewed changes

crates/api/src/op_store.rs Show resolved Hide resolved

ThetaSinner commented Feb 3, 2025

View reviewed changes

crates/gossip/proto/gen/kitsune2.gossip.rs Show resolved Hide resolved

ThetaSinner commented Feb 3, 2025

View reviewed changes

crates/gossip/src/initiate.rs Show resolved Hide resolved

ThetaSinner requested a review from a team February 3, 2025 20:36

neonphog approved these changes Feb 3, 2025

View reviewed changes

ThetaSinner merged commit 2a9f534 into main Feb 3, 2025
5 checks passed

ThetaSinner deleted the limit-data-sent-after-dht-differences branch February 3, 2025 23:05

ThetaSinner mentioned this pull request Feb 4, 2025

Op data limit tests #96

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit data sent after dht differences #94

Limit data sent after dht differences #94

ThetaSinner commented Feb 3, 2025 •

edited

Loading

neonphog left a comment

Limit data sent after dht differences #94

Limit data sent after dht differences #94

Conversation

ThetaSinner commented Feb 3, 2025 • edited Loading

neonphog left a comment

Choose a reason for hiding this comment

ThetaSinner commented Feb 3, 2025 •

edited

Loading