Increased memory usage with many short-lived producers #39

agis · 2017-08-25T12:47:06Z

We have a pathological case where RSS can skyrocket: large number of short-lived producers.

Such a case is typical with some kind of forking client. resque for example, spawns a process per job and kills it after the job is done. Assuming a short-lived job that also produces 1 message to Rafka, we may end up with hundreds or even thousand of producers that are spawned only to produce a single message and die afterwards.

In the meanwhile, confluent-kafka-go producers are costly since each of them pre-allocates two 1M-buffers:

the p.events channel, accessible via p.Events()
the p.produceChannel channel, accessible via p.ProduceChannel()

The situation gets even worse cause of golang/go#16930.

Proposal

This could be fixed by re-architecting Rafka to have a N:M model (N=client producers, M=librdkafka producers), but that would require significant changes and would make Rafka usage more complex. We want to keep the 1:1 model if possible because it is simple.

However, we can remedy the issue in some ways:

set the buffer size of produceChannel to 0: This channel is completely unnecessary since we use the function-based producer (6ef4bf2)
decrease the buffer of the events channel
- we can set this to a much more sensible default for our use-case. For this, we submitted producer: Make events channel size configurable confluentinc/confluent-kafka-go#90, which is now merged
- ~~Allow clients to control their producer configuration: producer: Allow clients to control their configuration #40~~ (defered)

We should also state in the README that Rafka, like librdkafka itself is optimized for few long-lived producers instead of a bursty usage patterns (ie. many short-lived producers).

The text was updated successfully, but these errors were encountered:

Part of #39.

This revision contains the configurable events channel size which we need in order to address #39.

This is a follow-up to 2f5a1f0 and a part of #39.

agis · 2019-05-23T09:12:54Z

golang/go#16930 is now closed. We should verify that rafka built with Go 1.13 no longer suffers from this issue, and close this.

Also relevant: golang/go#30333.

agis · 2019-05-23T12:23:59Z

After testing with go tip (65ef999), the situation is pretty much the same as before. So I'm leaving this open as a known issue.

agis added the enhancement label Aug 25, 2017

agis added this to the 1.0 milestone Aug 25, 2017

agis changed the title ~~Increased memory usage with many, short-lived producers~~ Increased memory usage with many short-lived producers Aug 25, 2017

agis added a commit that referenced this issue Aug 25, 2017

producer: Make ProduceChannel unbuffered

6ef4bf2

Part of #39.

agis added a commit that referenced this issue Aug 28, 2017

dep: Pin confluent-kafka-go to 9fa7efcf0471e6d18d6

2f5a1f0

This revision contains the configurable events channel size which we need in order to address #39.

agis added a commit that referenced this issue Aug 29, 2017

Limit producer events channel buffer

a4fb6ea

This is a follow-up to 2f5a1f0 and a part of #39.

agis closed this as completed Sep 5, 2017

agis added known issue and removed enhancement labels Jun 6, 2018

agis reopened this Jun 6, 2018

agis self-assigned this May 23, 2019

agis closed this as completed May 23, 2019

agis reopened this May 23, 2019

agis removed this from the 1.0 milestone May 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increased memory usage with many short-lived producers #39

Increased memory usage with many short-lived producers #39

agis commented Aug 25, 2017 •

edited by kiriakosv

Loading

agis commented May 23, 2019

agis commented May 23, 2019

Increased memory usage with many short-lived producers #39

Increased memory usage with many short-lived producers #39

Comments

agis commented Aug 25, 2017 • edited by kiriakosv Loading

Proposal

agis commented May 23, 2019

agis commented May 23, 2019

agis commented Aug 25, 2017 •

edited by kiriakosv

Loading