Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <y2je06563881004230639i7af7217eoc70f5ba7f0d9e892@mail.gmail.com>
References: <g2z775e31411004221927w774b958bta8432c889a932f42@mail.gmail.com>
	 <u2we06563881004222009u995a3d8az649d75088e2a2f8b@mail.gmail.com>
	 <20DF20F5-F894-40B3-AFB8-70D18F61E2A9@oskarsson.nu>
	 <y2je06563881004230639i7af7217eoc70f5ba7f0d9e892@mail.gmail.com>
Date: Fri, 23 Apr 2010 10:39:45 -0400
Message-ID: <x2y775e31411004230739m6f87e1e7vdadfe95603265a14@mail.gmail.com>
Subject: Re: MapReduce, Timeouts and Range Batch Size
From: Joost Ouwerkerk <joost@openplaces.org>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=0016364ee0ecfabf5b0484e86542

--0016364ee0ecfabf5b0484e86542
Content-Type: text/plain; charset=ISO-8859-1

Awesome.  In the meantime, I hacked something similar myself.  The
performance difference does not appear to be material.  I think the real
killer is the get_range_slices call.  Relative to that, the cost of getting
the connection appears to be more or less trivial.  What can I do to
alleviate that cost?  CASSANDRA-821 looks interesting -- can I apply that to
0.6.1 ?
joost.

On Fri, Apr 23, 2010 at 9:39 AM, Jonathan Ellis <jbellis@gmail.com> wrote:

> Great!  Created https://issues.apache.org/jira/browse/CASSANDRA-1017
> to track this.
>
> On Fri, Apr 23, 2010 at 4:12 AM, Johan Oskarsson <johan@oskarsson.nu>
> wrote:
> > I have written some code to avoid thrift reconnection, it just keeps the
> connection open between get_range_slices calls.
> > I can extract that and put it up but not until early next week.
> >
> > /Johan
> >
> > On 23 apr 2010, at 05.09, Jonathan Ellis wrote:
> >
> >> That would be an easy win, sure.
> >>
> >> On Thu, Apr 22, 2010 at 9:27 PM, Joost Ouwerkerk <joost@openplaces.org>
> wrote:
> >>> I was getting client timeouts in ColumnFamilyRecordReader.maybeInit()
> when
> >>> MapReducing.  So I've reduced the Range Batch Size to 256 (from 4096)
> and
> >>> this seems to have fixed my problem, although it has slowed things down
> a
> >>> bit -- presumably because there are 16x more calls to get_range_slices.
> >>> While I was in that code I noticed that a new client was being created
> for
> >>> each batch get.  By decreasing the batch size, I've increased this
> >>> overhead.  I'm thinking of re-writing ColumnFamilyRecordReader to do
> some
> >>> connection pooling.  Anyone have any thoughts on that?
> >>> joost.
> >>>
> >
> >
>

--0016364ee0ecfabf5b0484e86542
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Awesome. =A0In the meantime, I hacked something similar myself. =A0The perf=
ormance difference does not appear to be material. =A0I think the real kill=
er is the get_range_slices call. =A0Relative to that, the cost of getting t=
he connection appears to be more or less trivial. =A0What can I do to allev=
iate that cost? =A0CASSANDRA-821 looks interesting -- can I apply that to 0=
.6.1 ?<div>
joost.</div><div><br><div class=3D"gmail_quote">On Fri, Apr 23, 2010 at 9:3=
9 AM, Jonathan Ellis <span dir=3D"ltr">&lt;<a href=3D"mailto:jbellis@gmail.=
com">jbellis@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_=
quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1=
ex;">
Great! =A0Created <a href=3D"https://issues.apache.org/jira/browse/CASSANDR=
A-1017" target=3D"_blank">https://issues.apache.org/jira/browse/CASSANDRA-1=
017</a><br>
to track this.<br>
<div><div></div><div class=3D"h5"><br>
On Fri, Apr 23, 2010 at 4:12 AM, Johan Oskarsson &lt;<a href=3D"mailto:joha=
n@oskarsson.nu">johan@oskarsson.nu</a>&gt; wrote:<br>
&gt; I have written some code to avoid thrift reconnection, it just keeps t=
he connection open between get_range_slices calls.<br>
&gt; I can extract that and put it up but not until early next week.<br>
&gt;<br>
&gt; /Johan<br>
&gt;<br>
&gt; On 23 apr 2010, at 05.09, Jonathan Ellis wrote:<br>
&gt;<br>
&gt;&gt; That would be an easy win, sure.<br>
&gt;&gt;<br>
&gt;&gt; On Thu, Apr 22, 2010 at 9:27 PM, Joost Ouwerkerk &lt;<a href=3D"ma=
ilto:joost@openplaces.org">joost@openplaces.org</a>&gt; wrote:<br>
&gt;&gt;&gt; I was getting client timeouts in ColumnFamilyRecordReader.mayb=
eInit() when<br>
&gt;&gt;&gt; MapReducing. =A0So I&#39;ve reduced the Range Batch Size to 25=
6 (from 4096) and<br>
&gt;&gt;&gt; this seems to have fixed my problem, although it has slowed th=
ings down a<br>
&gt;&gt;&gt; bit -- presumably because there are 16x more calls to get_rang=
e_slices.<br>
&gt;&gt;&gt; While I was in that code I noticed that a new client was being=
 created for<br>
&gt;&gt;&gt; each batch get. =A0By decreasing the batch size, I&#39;ve incr=
eased this<br>
&gt;&gt;&gt; overhead. =A0I&#39;m thinking of re-writing ColumnFamilyRecord=
Reader to do some<br>
&gt;&gt;&gt; connection pooling. =A0Anyone have any thoughts on that?<br>
&gt;&gt;&gt; joost.<br>
&gt;&gt;&gt;<br>
&gt;<br>
&gt;<br>
</div></div></blockquote></div><br></div>

--0016364ee0ecfabf5b0484e86542--