kafka-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jay Kreps <...@confluent.io>
Subject Re: [DISCUSS] KIP-63: Unify store and downstream caching in streams
Date Fri, 03 Jun 2016 22:15:38 GMT
Hey Eno,

Should the config be the global memory use rather than the per-processor?
That is, let’s say I know I have fixed a 1GB heap because that is what I
set for Java, and want to use 100MB for caching, it seems like right now
I’d have to do some math that depends on my knowing a bit about how caching
works to figure out how to set that parameter so I don't run out of memory.
Does it also depend on the number of partitions assigned (and hence the
number of task), if so that makes it even harder to set since each time
rebalancing happens that changes so it is then pretty hard to set safely.

You could theoretically argue for either bottom up (you know how much cache
you need per processor as you have it and you want to get exactly that) or
top down (you know how much memory you have to spare but can't be bothered
to work out what that amounts to per-processor). I think our experience has
been that 99% of people never change the default and if it runs out of
memory they really struggle to fix it and kind of blame us, so I think top
down and a global config might be better. :-)

Example: https://issues.apache.org/jira/browse/KAFKA-3775


On Fri, Jun 3, 2016 at 2:39 PM, Eno Thereska <eno.thereska@gmail.com> wrote:

> Hi Gwen,
> Yes. As an example, if cache.max.bytes.buffering set to X, and if users
> have A aggregation operators and T KTable.to() operators, then X*(A + T)
> total bytes will be allocated for caching.
> Eno
> > On 3 Jun 2016, at 21:37, Gwen Shapira <gwen@confluent.io> wrote:
> >
> > Just to clarify: "cache.max.bytes.buffering" is per processor?
> >
> >
> > On Thu, Jun 2, 2016 at 11:30 AM, Eno Thereska <eno.thereska@gmail.com>
> wrote:
> >> Hi there,
> >>
> >> I have created KIP-63: Unify store and downstream caching in streams
> >>
> https://cwiki.apache.org/confluence/display/KAFKA/KIP-63%3A+Unify+store+and+downstream+caching+in+streams
> <
> https://cwiki.apache.org/confluence/display/KAFKA/KIP-63:+Unify+store+and+downstream+caching+in+streams
> >
> >>
> >>
> >> Feedback is appreciated.
> >>
> >> Thank you
> >> Eno

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message