Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of fabien@yakaz.com designates
 209.85.212.46 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <00f801ce885e$b3864e70$1a92eb50$@struq.com>
References: <41C4F926-156E-4245-97A1-3D6CE35760C9@gmail.com>
 <00f801ce885e$b3864e70$1a92eb50$@struq.com>
From: Fabien Rousseau <fabien@yakaz.com>
Date: Wed, 24 Jul 2013 15:42:15 +0200
Message-ID: 
 <CAO_Px7jH_pbA16EjpLq=rgAu40OpUayuU3BGMRQM31Pf59QXyA@mail.gmail.com>
Subject: Re: disappointed
To: user <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=047d7b3437e40de0ae04e2421599

--047d7b3437e40de0ae04e2421599
Content-Type: text/plain; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

Hi Paul,

Concerning large rows which are not compacting, I've probably managed to
reproduce your problem.
I suppose you're using collections, but also TTLs ?

Anyway, I opened an issue here :
https://issues.apache.org/jira/browse/CASSANDRA-5799

Hope this helps


2013/7/24 Christopher Wirt <chris.wirt@struq.com>

> Hi Paul,****
>
> ** **
>
> Sorry to hear you=92re having a low point.****
>
> ** **
>
> We ended up not using the collection features of 1.2. ****
>
> Instead storing a compressed string containing the map and handling clien=
t
> side.****
>
> ** **
>
> We only have fixed schema short rows so no experience with large row
> compaction.****
>
> ** **
>
> File descriptors have never got that high for us. But, if you only have a
> couple physical nodes with loads of data and small ss-tables maybe they
> could get that high?****
>
> ** **
>
> Only time I=92ve had file descriptors get out of hand was then compaction
> got slightly confused with a new schema when I dropped and recreated
> instead of truncating.
> https://issues.apache.org/jira/browse/CASSANDRA-4857 restarting the node
> fixed the issue.****
>
> ** **
>
> ** **
>
> From my limited experience I think Cassandra is a dangerous choice for an
> young limited funding/experience start-up expecting to scale fast. We are=
 a
> fairly mature start-up with funding. We=92ve just spent 3-5 months moving
> from Mongo to Cassandra. It=92s been expensive and painful getting Cassan=
dra
> to read like Mongo, but we=92ve made it J****
>
> ** **
>
> ** **
>
> ** **
>
> ** **
>
> *From:* Paul Ingalls [mailto:paulingalls@gmail.com]
> *Sent:* 24 July 2013 06:01
> *To:* user@cassandra.apache.org
> *Subject:* disappointed****
>
> ** **
>
> I want to check in.  I'm sad, mad and afraid.  I've been trying to get a
> 1.2 cluster up and working with my data set for three weeks with no
> success.  I've been running a 1.1 cluster for 8 months now with no hiccup=
s,
> but for me at least 1.2 has been a disaster.  I had high hopes for
> leveraging the new features of 1.2, specifically vnodes and collections.
> But at this point I can't release my system into production, and will
> probably need to find a new back end.  As a small startup, this could be
> catastrophic.  I'm mostly mad at myself.  I took a risk moving to the new
> tech.  I forgot sometimes when you gamble, you lose.****
>
> ** **
>
> First, the performance of 1.2.6 was horrible when using collections.  I
> wasn't able to push through 500k rows before the cluster became unusable.
>  With a lot of digging, and way too much time, I discovered I was hitting=
 a
> bug that had just been fixed, but was unreleased.  This scared me, becaus=
e
> the release was already at 1.2.6 and I would have expected something as
> https://issues.apache.org/jira/browse/CASSANDRA-5677 would have been
> addressed long before.  But gamely I grabbed the latest code from the 1.2
> branch, built it and I was finally able to get past half a million rows.
> ****
>
> ** **
>
> But, then I hit ~4 million rows, and a multitude of problems.  Even with
> the fix above, I was still seeing a ton of compactions failing,
> specifically the ones for large rows.  Not a single large row will compac=
t,
> they all assert with the wrong size.  Worse, and this is what kills the
> whole thing, I keep hitting a wall with open files, even after dumping th=
e
> whole DB, dropping vnodes and trying again.  Seriously, 650k open file
> descriptors?  When it hits this limit, the whole DB craps out and is
> basically unusable.  This isn't that many rows.  I have close to a half a
> billion in 1.1=85****
>
> ** **
>
> I'm now at a standstill.  I figure I have two options unless someone here
> can help me.  Neither of them involve 1.2.  I can either go back to 1.1 a=
nd
> remove the features that collections added to my service, or I find anoth=
er
> data backend that has similar performance characteristics to cassandra bu=
t
> allows collections type behavior in a scalable manner.  Cause as far as I
> can tell, 1.2 doesn't scale.  Which makes me sad, I was proud of what I
> accomplished with 1.1=85.****
>
> ** **
>
> Does anyone know why there are so many open file descriptors?  Any ideas
> on why a large row won't compact?****
>
> ** **
>
> Paul****
>


--=20
Fabien Rousseau
*
*
 <aurore@yakaz.com>www.yakaz.com

--047d7b3437e40de0ae04e2421599
Content-Type: text/html; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi Paul,<div><br></div><div>Concerning large rows which ar=
e not compacting, I&#39;ve probably managed to reproduce your problem.</div=
><div>I suppose you&#39;re using collections, but also TTLs ?</div><div><br=
>

</div><div>Anyway, I opened an issue here :=A0<a href=3D"https://issues.apa=
che.org/jira/browse/CASSANDRA-5799">https://issues.apache.org/jira/browse/C=
ASSANDRA-5799</a>=A0</div><div><br></div><div>Hope this helps</div></div><d=
iv class=3D"gmail_extra">

<br><br><div class=3D"gmail_quote">2013/7/24 Christopher Wirt <span dir=3D"=
ltr">&lt;<a href=3D"mailto:chris.wirt@struq.com" target=3D"_blank">chris.wi=
rt@struq.com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=3D"m=
argin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div lang=3D"EN-GB" link=3D"blue" vlink=3D"purple"><div><p class=3D"MsoNorm=
al"><span style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;s=
ans-serif&quot;;color:#1f497d">Hi Paul,<u></u><u></u></span></p><p class=3D=
"MsoNormal">

<span style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-=
serif&quot;;color:#1f497d"><u></u>=A0<u></u></span></p><p class=3D"MsoNorma=
l"><span style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sa=
ns-serif&quot;;color:#1f497d">Sorry to hear you=92re having a low point.<u>=
</u><u></u></span></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d">We ended up not using =
the collection features of 1.2. <u></u><u></u></span></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Instead storing a compres=
sed string containing the map and handling client side.<u></u><u></u></span=
></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d">We only have fixed sch=
ema short rows so no experience with large row compaction.<u></u><u></u></s=
pan></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d">File descriptors have =
never got that high for us. But, if you only have a couple physical nodes w=
ith loads of data and small ss-tables maybe they could get that high?<u></u=
><u></u></span></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d">Only time I=92ve had f=
ile descriptors get out of hand was then compaction got slightly confused w=
ith a new schema when I dropped and recreated instead of truncating. <a hre=
f=3D"https://issues.apache.org/jira/browse/CASSANDRA-4857" target=3D"_blank=
">https://issues.apache.org/jira/browse/CASSANDRA-4857</a> restarting the n=
ode fixed the issue.<u></u><u></u></span></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></spa=
n></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">From my limited experienc=
e I think Cassandra is a dangerous choice for an young limited funding/expe=
rience start-up expecting to scale fast. We are a fairly mature start-up wi=
th funding. We=92ve just spent 3-5 months moving from Mongo to Cassandra. I=
t=92s been expensive and painful getting Cassandra to read like Mongo, but =
we=92ve made it </span><span style=3D"font-size:11.0pt;font-family:Wingding=
s;color:#1f497d">J</span><span style=3D"font-size:11.0pt;font-family:&quot;=
Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u><u></u></span></=
p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></spa=
n></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></spa=
n></p>

<div><div style=3D"border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt=
 0cm 0cm 0cm"><p class=3D"MsoNormal"><b><span lang=3D"EN-US" style=3D"font-=
size:10.0pt;font-family:&quot;Tahoma&quot;,&quot;sans-serif&quot;">From:</s=
pan></b><span lang=3D"EN-US" style=3D"font-size:10.0pt;font-family:&quot;Ta=
homa&quot;,&quot;sans-serif&quot;"> Paul Ingalls [mailto:<a href=3D"mailto:=
paulingalls@gmail.com" target=3D"_blank">paulingalls@gmail.com</a>] <br>

<b>Sent:</b> 24 July 2013 06:01<br><b>To:</b> <a href=3D"mailto:user@cassan=
dra.apache.org" target=3D"_blank">user@cassandra.apache.org</a><br><b>Subje=
ct:</b> disappointed<u></u><u></u></span></p></div></div><div><div class=3D=
"h5">

<p class=3D"MsoNormal"><u></u>=A0<u></u></p><p class=3D"MsoNormal">I want t=
o check in. =A0I&#39;m sad, mad and afraid. =A0I&#39;ve been trying to get =
a 1.2 cluster up and working with my data set for three weeks with no succe=
ss. =A0I&#39;ve been running a 1.1 cluster for 8 months now with no hiccups=
, but for me at least 1.2 has been a disaster. =A0I had high hopes for leve=
raging the new features of 1.2, specifically vnodes and collections. =A0 Bu=
t at this point I can&#39;t release my system into production, and will pro=
bably need to find a new back end. =A0As a small startup, this could be cat=
astrophic. =A0I&#39;m mostly mad at myself. =A0I took a risk moving to the =
new tech. =A0I forgot sometimes when you gamble, you lose.<u></u><u></u></p=
>

<div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=3D"Mso=
Normal">First, the performance of 1.2.6 was horrible when using collections=
. =A0I wasn&#39;t able to push through 500k rows before the cluster became =
unusable. =A0With a lot of digging, and way too much time, I discovered I w=
as hitting a bug that had just been fixed, but was unreleased. =A0This scar=
ed me, because the release was already at 1.2.6 and I would have expected s=
omething as=A0<a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-56=
77" target=3D"_blank">https://issues.apache.org/jira/browse/CASSANDRA-5677<=
/a>=A0would have been addressed long before. =A0But gamely I grabbed the la=
test code from the 1.2 branch, built it and I was finally able to get past =
half a million rows. =A0<u></u><u></u></p>

</div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal">But, then I hit ~4 million rows, and a multitude of problems=
. =A0Even with the fix above, I was still seeing a ton of compactions faili=
ng, specifically the ones for large rows. =A0Not a single large row will co=
mpact, they all assert with the wrong size. =A0Worse, and this is what kill=
s the whole thing, I keep hitting a wall with open files, even after dumpin=
g the whole DB, dropping vnodes and trying again. =A0Seriously, 650k open f=
ile descriptors? =A0When it hits this limit, the whole DB craps out and is =
basically unusable. =A0This isn&#39;t that many rows. =A0I have close to a =
half a billion in 1.1=85<u></u><u></u></p>

</div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal">I&#39;m now at a standstill. =A0I figure I have two options =
unless someone here can help me. =A0Neither of them involve 1.2. =A0I can e=
ither go back to 1.1 and remove the features that collections added to my s=
ervice, or I find another data backend that has similar performance charact=
eristics to cassandra but allows collections type behavior in a scalable ma=
nner. =A0Cause as far as I can tell, 1.2 doesn&#39;t scale. =A0Which makes =
me sad, I was proud of what I accomplished with 1.1=85.<u></u><u></u></p>

</div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal">Does anyone know why there are so many open file descriptors=
? =A0Any ideas on why a large row won&#39;t compact?<u></u><u></u></p></div=
><div>

<p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=3D"MsoNorma=
l">Paul<u></u><u></u></p></div></div></div></div></div></blockquote></div><=
br><br clear=3D"all"><div><br></div>-- <br><div dir=3D"ltr"><font face=3D"a=
rial, helvetica, sans-serif">Fabien Rousseau<br>

</font><div><font color=3D"#666666"><font face=3D"arial, helvetica, sans-se=
rif"><b><br>

</b></font></font><font face=3D"arial, helvetica, sans-serif"><img src=3D"h=
ttp://www.yakaz.com/img/logo_yakaz_small.png"><br></font></div><div><span s=
tyle=3D"font-family:arial,sans-serif;font-size:13px;border-collapse:collaps=
e"><div>


<a href=3D"mailto:aurore@yakaz.com" target=3D"_blank"><font face=3D"arial, =
helvetica, sans-serif"><font color=3D"#666666"></font></font></a><a href=3D=
"http://www.yakaz.com/" target=3D"_blank"><font face=3D"arial, helvetica, s=
ans-serif"><font color=3D"#000000">www.yakaz.com</font></font></a><br>

</div></span></div></div>
</div>

--047d7b3437e40de0ae04e2421599--