Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of apoorva.gaurav@myntra.com
 designates 209.85.223.170 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CANF7QJSKm1twZby-JeKgegwvxCaj_HCGToKBpqv=8Ar6BwjKUg@mail.gmail.com>
References: 
 <CAJRvvD_bvyMpqbAdFRB_Eabi8NUf=5pPk5v0mfim7H=x4gJHyA@mail.gmail.com>
 <CAF4_GfiPUuFeWukJGne8kEX8fFdxT_+6kKywvYkjTYtByTiM8g@mail.gmail.com>
 <CAJRvvD9kCiFeH-d9NDi35ZAuHMUGrd--F50OfY5xkkTt8LJS0g@mail.gmail.com>
 <CAEDUwd0Vokx61rcoTDNAmfE_skfzO0Mks7fcxQKC5wj+mQHirg@mail.gmail.com>
 <CAJRvvD9CpyTRyuagj629T3hyC50yMgG7J-oQ85TicHx=DrQSCQ@mail.gmail.com>
 <CAEDUwd3oR2E93t3M03LmpvXqK+HrKso1u_m1LabtDuukY1k1YA@mail.gmail.com>
 <CAJRvvD8GZNY06nbpeKpdknE=Vn8RDqtobK6NO2BYbyzBBcpT7g@mail.gmail.com>
 <CANF7QJSKm1twZby-JeKgegwvxCaj_HCGToKBpqv=8Ar6BwjKUg@mail.gmail.com>
From: Apoorva Gaurav <apoorva.gaurav@myntra.com>
Date: Wed, 2 Apr 2014 11:42:11 +0530
Message-ID: 
 <CAJRvvD8qiOYGSB5oQ1+UoS1okOurOFYjU-yUTQCogKgVau=mSg@mail.gmail.com>
Subject: Re: Read performance in map data type
To: user <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=20cf301b67e97dc9d904f6092b81

--20cf301b67e97dc9d904f6092b81
Content-Type: text/plain; charset=ISO-8859-1

I've observed that reducing fetch size results in better latency (isn't
that obvious :-)), tried from fetch size varying from 100 to 10000, seeing
a lot of errors for 10000. Haven't tried modifying the number of columns.

Let me start a new thread focused on fetch size.


On Wed, Apr 2, 2014 at 9:53 AM, Sourabh Agrawal <iitr.sourabh@gmail.com>wrote:

> From the doc : The fetch size controls how much resulting rows will be
> retrieved simultaneously.
> So, I guess it does not depend on the number of columns as such. As all
> the columns for a key reside on the same node, I think it wouldn't matter
> much whatever be the number of columns as long as we have enough memory in
> the app.
>
> Default value is 5000. (com.datastax.driver.core.QueryOptions)
>
> We use it with the default value. I have never profiled cassandra for read
> load. If you profile it for different fetch sizes, please share the results
> :)
>
>
> On Wed, Apr 2, 2014 at 8:45 AM, Apoorva Gaurav <apoorva.gaurav@myntra.com>wrote:
>
>> Thanks Sourabh,
>>
>> I've modelled my table as "studentID int, subjectID int, marks int,
>> PRIMARY KEY(studentID, subjectID)" as primarily I'll be querying using
>> studentID and sometime using studentID and subjectID.
>>
>> I've tried driver 2.0.0 and its giving good results. Also using its auto
>> paging feature. Any idea what should be a typical value for fetch size. And
>> does the fetch size depends on how many columns are there in the CQL table
>> for e.g. should fetch size in a table like "studentID int, subjectID
>> int, marks1 int, marks2 int, marks3 int.... marksN int PRIMARY
>> KEY(studentID, subjectID)" be less than fetch size in "studentID int,
>> subjectID int, marks int, PRIMARY KEY(studentID, subjectID)"
>>
>>
>> On Wed, Apr 2, 2014 at 2:20 AM, Robert Coli <rcoli@eventbrite.com> wrote:
>>
>>>  On Mon, Mar 31, 2014 at 9:13 PM, Apoorva Gaurav <
>>> apoorva.gaurav@myntra.com> wrote:
>>>
>>>> Thanks Robert, Is there a workaround, as in our test setups we keep
>>>> dropping and recreating tables.
>>>>
>>>
>>> Use unique keyspace (or table) names for each test? That's the approach
>>> they're taking in 5202...
>>>
>>> =Rob
>>>
>>>
>>
>>
>> --
>> Thanks & Regards,
>> Apoorva
>>
>
>
>
> --
> Sourabh Agrawal
> Bangalore
> +91 9945657973
>


-- 
Thanks & Regards,
Apoorva

--20cf301b67e97dc9d904f6092b81
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">I&#39;ve observed that reducing fetch size results in bett=
er latency (isn&#39;t that obvious :-)), tried from fetch size varying from=
 100 to 10000, seeing a lot of errors for 10000. Haven&#39;t tried modifyin=
g the number of columns.=A0<div>

<br></div><div>Let me start a new thread focused on fetch size.<div class=
=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed, Apr 2, 2014 at =
9:53 AM, Sourabh Agrawal <span dir=3D"ltr">&lt;<a href=3D"mailto:iitr.soura=
bh@gmail.com" target=3D"_blank">iitr.sourabh@gmail.com</a>&gt;</span> wrote=
:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">From the doc : The fetch si=
ze controls how much resulting rows will be retrieved simultaneously.=A0<di=
v>So, I guess it does not depend on the number of columns as such. As all t=
he columns for a key reside on the same node, I think it wouldn&#39;t matte=
r much whatever be the number of columns as long as we have enough memory i=
n the app.</div>


<div><br></div><div>Default value is 5000. (com.datastax.driver.core.QueryO=
ptions)</div><div><br></div><div>We use it with the default value. I have n=
ever profiled cassandra for read load. If you profile it for different fetc=
h sizes, please share the results :)</div>


</div><div class=3D"gmail_extra"><div><div><br><br><div class=3D"gmail_quot=
e">On Wed, Apr 2, 2014 at 8:45 AM, Apoorva Gaurav <span dir=3D"ltr">&lt;<a =
href=3D"mailto:apoorva.gaurav@myntra.com" target=3D"_blank">apoorva.gaurav@=
myntra.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Thanks Sourabh,<div><br><di=
v>I&#39;ve modelled my table as=A0<span style=3D"font-size:13px;font-family=
:arial,sans-serif">&quot;</span><span style=3D"font-size:13px;font-family:a=
rial,sans-serif">studentID int, subjectID int, marks int, PRIMARY KEY(stude=
ntID, subjectID)&quot; as primarily I&#39;ll be querying using studentID an=
d sometime using studentID and subjectID.</span></div>


<div><span style=3D"font-size:13px;font-family:arial,sans-serif"><br></span=
></div><div>I&#39;ve tried driver 2.0.0 and its giving good results. Also u=
sing its auto paging feature. Any idea what should be a typical value for f=
etch size. And does the fetch size depends on how many columns are there in=
 the CQL table for e.g. should fetch size in a table like=A0<span style=3D"=
font-size:13px;font-family:arial,sans-serif">&quot;</span><span style=3D"fo=
nt-size:13px;font-family:arial,sans-serif">studentID int, subjectID int, ma=
rks1 int, marks2 int, marks3 int.... marksN int PRIMARY KEY(studentID, subj=
ectID)&quot; be less than fetch size in=A0</span><span style=3D"font-size:1=
3px;font-family:arial,sans-serif">&quot;</span><span style=3D"font-size:13p=
x;font-family:arial,sans-serif">studentID int, subjectID int, marks int, PR=
IMARY KEY(studentID, subjectID)&quot;</span><span style=3D"font-size:13px;f=
ont-family:arial,sans-serif"><br>


</span></div></div></div><div class=3D"gmail_extra"><div><div><br><br><div =
class=3D"gmail_quote">On Wed, Apr 2, 2014 at 2:20 AM, Robert Coli <span dir=
=3D"ltr">&lt;<a href=3D"mailto:rcoli@eventbrite.com" target=3D"_blank">rcol=
i@eventbrite.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>=A0On Mon, Mar 31, 201=
4 at 9:13 PM, Apoorva Gaurav <span dir=3D"ltr">&lt;<a href=3D"mailto:apoorv=
a.gaurav@myntra.com" target=3D"_blank">apoorva.gaurav@myntra.com</a>&gt;</s=
pan> wrote:</div>


<div class=3D"gmail_extra"><div class=3D"gmail_quote"><div>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Thanks Robert, Is there a w=
orkaround, as in our test setups we keep dropping and recreating tables.</d=
iv>


</blockquote><div><br></div></div><div>Use unique keyspace (or table) names=
 for each test? That&#39;s the approach they&#39;re taking in 5202...</div>=
<div>=A0=A0</div><div>=3DRob</div><div><br></div></div></div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div></div><div>--=
 <br>Thanks &amp; Regards,<br>Apoorva<br>
</div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div></div><div>--=
 <br><div dir=3D"ltr">Sourabh Agrawal<div>Bangalore</div><div>+91 994565797=
3</div></div>
</div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>Thanks &amp;=
 Regards,<br>Apoorva<br>
</div></div></div>

--20cf301b67e97dc9d904f6092b81--