feat(tiering): Faster small bins serialization #3340

dranikpg · 2024-07-19T14:15:28Z

No description provided.

Signed-off-by: Vladislav Oleshko <[email protected]>

dranikpg · 2024-07-20T11:56:03Z

src/server/snapshot.cc

+  void FlushDelayed(bool force) {
+    // Flush pages with most records accumulated first, or all, if forced.
+    // It's enough just to issue reads, because they are collapsed by the tiered storage internally
+    while ((force && !delayed_.empty()) || delayed_.size() > kMaxPageAccum) {


We need to choose some kind of limit here... by the number of pages? by the total sum of enties? 🤷🏻‍♂️ We have to take into account that once we flush all of those, we can get a memory spike

dranikpg · 2024-07-20T11:57:00Z

src/server/snapshot.cc

  }
+
+  // Flush tiered delayed entries to avoid reordering with journal
+  tiered_serializer_->FlushDelayed(true);
 }


Ideally we pass the value with ChangeReq& and find out whether it was tiered, only if it's tiered and part of a relevant page, we should flush it

dranikpg · 2024-07-20T11:57:26Z

src/server/tiered_storage.cc

  int64_t memory_margin_ = 0;
+  std::vector<tiering::DiskSegment> delayed_deletes_;


todo: better naming

dranikpg · 2024-07-20T11:58:13Z

@romange Please look only at the idea, I left some open questions

romange

Nice that you suggest to do it without changing the RDB format.
I am not sure about the effectiveness of flush though. the probability of keys from the same page being in the same serialization batch are low.

romange · 2024-07-20T15:47:27Z

src/server/snapshot.cc

-    : db_slice_(slice), dest_(dest), compression_mode_(compression_mode) {
+    : db_slice_(slice),
+      dest_(dest),
+      tiered_serializer_(new TieredSerializer{}),


do you want to initialize it unconditionally?

🤷🏻‍♂️ I don't think it's expensive

romange · 2024-07-20T15:50:53Z

src/server/snapshot.cc

+      auto entries = delayed_.extract(page);
+      for (auto& [base, value] : entries.mapped()) {
+        DCHECK(value.IsExternal());
+        pending_.push_back(Read(std::move(base), value));


what guarantess that by the time you flush delayed, their segment are still correct?
maybe the pages were freed and repurposed for other values?

Inside tiered storage, I don't delete small bin pages while we're serializing

romange · 2024-07-20T15:52:52Z

src/server/snapshot.cc

+        delayed_sizes_.insert({entries.size(), page});
+      }
+    } else {
+      pending_.push_back(Read(std::move(base), pv));


nit: pending_reads_

romange · 2024-07-21T13:25:24Z

I thought about this more. Master is the more fragile piece compared to slave or an instance loading the snapshot.
That means we want to move the complexity to Load phase and I think it is possible to do if we just send the segment support at the end. This suggestion tracks pending items on the master side, i.e. maintains per object granularity. I think it's worth extending RDB support to maintain segment granularity (i.e. send 4K pages with their offsets.).

dranikpg · 2024-09-28T12:24:26Z

Closed in favour of #3396

dranikpg force-pushed the tiering-offload-opt branch 2 times, most recently from 549f0a0 to 150256b Compare July 20, 2024 11:54

feat(tiering): Faster small bins serialization

150256b

Signed-off-by: Vladislav Oleshko <[email protected]>

dranikpg commented Jul 20, 2024

View reviewed changes

dranikpg requested a review from romange July 20, 2024 11:57

dranikpg changed the title ~~WIP feat(tiering): Faster small bins serialization~~ feat(tiering): Faster small bins serialization Jul 20, 2024

romange reviewed Jul 20, 2024

View reviewed changes

dranikpg closed this Jul 25, 2024

dranikpg deleted the tiering-offload-opt branch July 25, 2024 12:59

dranikpg restored the tiering-offload-opt branch July 25, 2024 14:54

dranikpg reopened this Jul 25, 2024

dranikpg closed this Sep 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(tiering): Faster small bins serialization #3340

feat(tiering): Faster small bins serialization #3340

dranikpg commented Jul 19, 2024

dranikpg Jul 20, 2024

dranikpg Jul 20, 2024

dranikpg Jul 20, 2024

dranikpg commented Jul 20, 2024

romange left a comment

romange Jul 20, 2024

dranikpg Jul 20, 2024

romange Jul 20, 2024

dranikpg Jul 20, 2024

romange Jul 20, 2024

romange commented Jul 21, 2024

dranikpg commented Sep 28, 2024

		int64_t memory_margin_ = 0;
		std::vector<tiering::DiskSegment> delayed_deletes_;

feat(tiering): Faster small bins serialization #3340

feat(tiering): Faster small bins serialization #3340

Conversation

dranikpg commented Jul 19, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dranikpg commented Jul 20, 2024

romange left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

romange commented Jul 21, 2024

dranikpg commented Sep 28, 2024