cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gary Dusbabek <gdusba...@gmail.com>
Subject Re: What is the optimal size of batch mutate batches?
Date Tue, 11 May 2010 13:14:23 GMT
On Tue, May 11, 2010 at 06:54, David Boxenhorn <david@lookin2.com> wrote:
> My problem is that my rows are of very varying degrees of bushiness (i.e.
> number of supercolums and columns per row). I inserted 592,500 rows
> successfully, in a few minutes, and then I hit a batch of exceptionally
> bushy rows and ran out of memory.
>
> Does anyone have any suggestions about how to deal with this problem?

I think this is a matter of the size of data and not the "bushiness"
rows (fabulous term, btw).  For example, you could probably get away
with bushy rows to a certain degree if the columns themselves were
very small.

So you should probably approach this problem from a "how many MB do I
want to send in a single message" approach instead.  When the rows get
bushy, you should send less of them.

Gary.

Mime
View raw message