DRAFT: Consume fieldcaps node response while deserializing it #120010

original-brownbear · 2025-01-11T00:00:53Z

Just to illustrate the point of where we can save memory here, the concrete API could be made nicer but functionally this thing seems fine and comes with a measurable performance boost from some quick and dirty benchmarking.

The API implemented here could definitely use some extra love. But the idea seems to work quite well according to some quick testing and this approach can potentially save a massive amount of memory not just in field caps.

Instead of deserializing a full list of per-index responses before getting to processing them, we should consume each response right after deserializing it without any intermediary list. This should be safe to do since we deserialize responses on the same thread that we consume the full response on today.
Not only does this save a lot of heap by merging responses directly, it also might save quite a bit of runtime from better cache locality. Also, having a stateful deserializer allows for a couple helpful tricks in around deduplication across multiple responses for fan-out style APIs as well.

The API implemented here could definitely use some extra love. But the idea seems to work quite well according to some quick testing and this approach can potentially save a massive amount of memory not just in field caps. Instead of deserializing a full list of per-index responses before getting to processing them, we should consume each response right after deserializing it without any intermediary list. This should be safe to do since we deserialize responses on the same thread that we consume the full response on today. Not only does this save a lot of heap by merging responses directly, it also might save quite a bit of runtime from better cache locality.

This field is only used (by security) for requests, having it in responses is redundant. Also, we have a couple of responses that are singletons/quasi-enums where setting the value needlessly might introduce some strange contention even though it's a plain store. This isn't just a cosmetic change. It makes it clear at compile time that each response instance is exclusively defined by the bytes that it is read from. This makes it easier to reason about the validity of suggested optimizations like elastic#120010

…line-consume-response

original-brownbear added WIP :Search Foundations/Search Catch all for Search Foundations labels Jan 11, 2025

elasticsearchmachine added the v9.0.0 label Jan 11, 2025

even faster

6e86b1d

original-brownbear mentioned this pull request Jan 11, 2025

Remove remoteAddress field from TransportResponse #120016

Open

original-brownbear and others added 4 commits January 11, 2025 17:30

fix

73be1bd

[CI] Auto commit changes from spotless

b399259

fix

8a5d3c9

Merge remote-tracking branch 'origin/inline-consume-response' into in…

2a71fcc

…line-consume-response

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRAFT: Consume fieldcaps node response while deserializing it #120010

DRAFT: Consume fieldcaps node response while deserializing it #120010

original-brownbear commented Jan 11, 2025 •

edited

Loading

DRAFT: Consume fieldcaps node response while deserializing it #120010

Are you sure you want to change the base?

DRAFT: Consume fieldcaps node response while deserializing it #120010

Conversation

original-brownbear commented Jan 11, 2025 • edited Loading

original-brownbear commented Jan 11, 2025 •

edited

Loading