Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of paulingalls@gmail.com
 designates 209.85.192.182 as permitted sender)
From: Paul Ingalls <paulingalls@gmail.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_A6894B5B-80BB-42EC-A7AD-93EE8BDF0E9A"
Message-Id: <6C2FA25D-739A-4F41-B986-049D5EF0404C@gmail.com>
Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\))
Subject: Re: disappointed
Date: Wed, 24 Jul 2013 09:43:03 -0700
References: <41C4F926-156E-4245-97A1-3D6CE35760C9@gmail.com>
 <00f801ce885e$b3864e70$1a92eb50$@struq.com>
To: user@cassandra.apache.org
In-Reply-To: <00f801ce885e$b3864e70$1a92eb50$@struq.com>


--Apple-Mail=_A6894B5B-80BB-42EC-A7AD-93EE8BDF0E9A
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=windows-1252

Hi Chris,

Thanks for the response!

What kind of challenges did you run into that kept you from using =
collections?

I currently and running 4 physical nodes, same as I was with case 1.1.6. =
 I'm using size tiered compaction.  Would changing to level tiered with =
a large minimum make a big difference, or would it just push the problem =
off till later?

Yeah, I have run into problems dropping schemas before as well.  I was =
careful this time to start with an empty db folder=85

Glad you were successful in your transition=85:)

Paul

On Jul 24, 2013, at 4:12 AM, "Christopher Wirt" <chris.wirt@struq.com> =
wrote:

> Hi Paul,
> =20
> Sorry to hear you=92re having a low point.
> =20
> We ended up not using the collection features of 1.2.
> Instead storing a compressed string containing the map and handling =
client side.
> =20
> We only have fixed schema short rows so no experience with large row =
compaction.
> =20
> File descriptors have never got that high for us. But, if you only =
have a couple physical nodes with loads of data and small ss-tables =
maybe they could get that high?
> =20
> Only time I=92ve had file descriptors get out of hand was then =
compaction got slightly confused with a new schema when I dropped and =
recreated instead of truncating. =
https://issues.apache.org/jira/browse/CASSANDRA-4857 restarting the node =
fixed the issue.
> =20
> =20
> =46rom my limited experience I think Cassandra is a dangerous choice =
for an young limited funding/experience start-up expecting to scale =
fast. We are a fairly mature start-up with funding. We=92ve just spent =
3-5 months moving from Mongo to Cassandra. It=92s been expensive and =
painful getting Cassandra to read like Mongo, but we=92ve made it J
> =20
> =20
> =20
> =20
> From: Paul Ingalls [mailto:paulingalls@gmail.com]=20
> Sent: 24 July 2013 06:01
> To: user@cassandra.apache.org
> Subject: disappointed
> =20
> I want to check in.  I'm sad, mad and afraid.  I've been trying to get =
a 1.2 cluster up and working with my data set for three weeks with no =
success.  I've been running a 1.1 cluster for 8 months now with no =
hiccups, but for me at least 1.2 has been a disaster.  I had high hopes =
for leveraging the new features of 1.2, specifically vnodes and =
collections.   But at this point I can't release my system into =
production, and will probably need to find a new back end.  As a small =
startup, this could be catastrophic.  I'm mostly mad at myself.  I took =
a risk moving to the new tech.  I forgot sometimes when you gamble, you =
lose.
> =20
> First, the performance of 1.2.6 was horrible when using collections.  =
I wasn't able to push through 500k rows before the cluster became =
unusable.  With a lot of digging, and way too much time, I discovered I =
was hitting a bug that had just been fixed, but was unreleased.  This =
scared me, because the release was already at 1.2.6 and I would have =
expected something as =
https://issues.apache.org/jira/browse/CASSANDRA-5677 would have been =
addressed long before.  But gamely I grabbed the latest code from the =
1.2 branch, built it and I was finally able to get past half a million =
rows. =20
> =20
> But, then I hit ~4 million rows, and a multitude of problems.  Even =
with the fix above, I was still seeing a ton of compactions failing, =
specifically the ones for large rows.  Not a single large row will =
compact, they all assert with the wrong size.  Worse, and this is what =
kills the whole thing, I keep hitting a wall with open files, even after =
dumping the whole DB, dropping vnodes and trying again.  Seriously, 650k =
open file descriptors?  When it hits this limit, the whole DB craps out =
and is basically unusable.  This isn't that many rows.  I have close to =
a half a billion in 1.1=85
> =20
> I'm now at a standstill.  I figure I have two options unless someone =
here can help me.  Neither of them involve 1.2.  I can either go back to =
1.1 and remove the features that collections added to my service, or I =
find another data backend that has similar performance characteristics =
to cassandra but allows collections type behavior in a scalable manner.  =
Cause as far as I can tell, 1.2 doesn't scale.  Which makes me sad, I =
was proud of what I accomplished with 1.1=85.
> =20
> Does anyone know why there are so many open file descriptors?  Any =
ideas on why a large row won't compact?
> =20
> Paul


--Apple-Mail=_A6894B5B-80BB-42EC-A7AD-93EE8BDF0E9A
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=windows-1252

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dwindows-1252"><base href=3D"x-msg://5292/"></head><body =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><div>Hi =
Chris,</div><div><br></div><div>Thanks for the =
response!</div><div><br></div><div>What kind of challenges did you run =
into that kept you from using collections?</div><div><br></div><div>I =
currently and running 4 physical nodes, same as I was with case 1.1.6. =
&nbsp;I'm using size tiered compaction. &nbsp;Would changing to level =
tiered with a large minimum make a big difference, or would it just push =
the problem off till later?</div><div><br></div><div>Yeah, I have run =
into problems dropping schemas before as well. &nbsp;I was careful this =
time to start with an empty db folder=85</div><div><br></div><div>Glad =
you were successful in your =
transition=85:)</div><div><br></div><div>Paul</div><br><div><div>On Jul =
24, 2013, at 4:12 AM, "Christopher Wirt" &lt;<a =
href=3D"mailto:chris.wirt@struq.com">chris.wirt@struq.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite"><div lang=3D"EN-GB" link=3D"blue" vlink=3D"purple" =
style=3D"font-family: Helvetica; font-size: medium; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px; =
"><div class=3D"WordSection1" style=3D"page: WordSection1; "><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: rgb(31, 73, 125); ">Hi =
Paul,<o:p></o:p></span></div><div style=3D"margin: 0cm 0cm 0.0001pt; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
rgb(31, 73, 125); ">&nbsp;</span></div><div style=3D"margin: 0cm 0cm =
0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: rgb(31, 73, 125); ">Sorry to hear you=92re having a low =
point.<o:p></o:p></span></div><div style=3D"margin: 0cm 0cm 0.0001pt; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
rgb(31, 73, 125); ">&nbsp;</span></div><div style=3D"margin: 0cm 0cm =
0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: rgb(31, 73, 125); ">We ended up not using the collection features =
of 1.2.<o:p></o:p></span></div><div style=3D"margin: 0cm 0cm 0.0001pt; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
rgb(31, 73, 125); ">Instead storing a compressed string containing the =
map and handling client side.<o:p></o:p></span></div><div style=3D"margin:=
 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><span style=3D"font-size: 11pt; font-family: Calibri, =
sans-serif; color: rgb(31, 73, 125); ">&nbsp;</span></div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: rgb(31, 73, 125); ">We only have fixed =
schema short rows so no experience with large row =
compaction.<o:p></o:p></span></div><div style=3D"margin: 0cm 0cm =
0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: rgb(31, 73, 125); ">&nbsp;</span></div><div style=3D"margin: 0cm =
0cm 0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: rgb(31, 73, 125); ">File descriptors have never got that high for =
us. But, if you only have a couple physical nodes with loads of data and =
small ss-tables maybe they could get that =
high?<o:p></o:p></span></div><div style=3D"margin: 0cm 0cm 0.0001pt; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
rgb(31, 73, 125); ">&nbsp;</span></div><div style=3D"margin: 0cm 0cm =
0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: rgb(31, 73, 125); ">Only time I=92ve had file descriptors get out =
of hand was then compaction got slightly confused with a new schema when =
I dropped and recreated instead of truncating.<span =
class=3D"Apple-converted-space">&nbsp;</span><a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-4857" =
style=3D"color: purple; text-decoration: underline; =
">https://issues.apache.org/jira/browse/CASSANDRA-4857</a><span =
class=3D"Apple-converted-space">&nbsp;</span>restarting the node fixed =
the issue.<o:p></o:p></span></div><div style=3D"margin: 0cm 0cm =
0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: rgb(31, 73, 125); ">&nbsp;</span></div><div style=3D"margin: 0cm =
0cm 0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: rgb(31, 73, 125); ">&nbsp;</span></div><div style=3D"margin: 0cm =
0cm 0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: rgb(31, 73, 125); ">=46rom my limited experience I think =
Cassandra is a dangerous choice for an young limited funding/experience =
start-up expecting to scale fast. We are a fairly mature start-up with =
funding. We=92ve just spent 3-5 months moving from Mongo to Cassandra. =
It=92s been expensive and painful getting Cassandra to read like Mongo, =
but we=92ve made it<span =
class=3D"Apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Wingdings; color: rgb(31, 73, =
125); ">J</span><span style=3D"font-size: 11pt; font-family: Calibri, =
sans-serif; color: rgb(31, 73, 125); "><o:p></o:p></span></div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: rgb(31, 73, 125); ">&nbsp;</span></div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: rgb(31, 73, 125); ">&nbsp;</span></div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: rgb(31, 73, 125); ">&nbsp;</span></div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: rgb(31, 73, 125); =
">&nbsp;</span></div><div><div style=3D"border-style: solid none none; =
border-top-width: 1pt; border-top-color: rgb(181, 196, 223); padding: =
3pt 0cm 0cm; "><div style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; =
font-family: 'Times New Roman', serif; "><b><span lang=3D"EN-US" =
style=3D"font-size: 10pt; font-family: Tahoma, sans-serif; =
">From:</span></b><span lang=3D"EN-US" style=3D"font-size: 10pt; =
font-family: Tahoma, sans-serif; "><span =
class=3D"Apple-converted-space">&nbsp;</span>Paul Ingalls =
[mailto:paulingalls@<a href=3D"http://gmail.com">gmail.com</a>]<span =
class=3D"Apple-converted-space">&nbsp;</span><br><b>Sent:</b><span =
class=3D"Apple-converted-space">&nbsp;</span>24 July 2013 =
06:01<br><b>To:</b><span class=3D"Apple-converted-space">&nbsp;</span><a =
href=3D"mailto:user@cassandra.apache.org">user@cassandra.apache.org</a><br=
><b>Subject:</b><span =
class=3D"Apple-converted-space">&nbsp;</span>disappointed<o:p></o:p></span=
></div></div></div><div style=3D"margin: 0cm 0cm 0.0001pt; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div><div style=3D"margin: 0cm 0cm 0.0001pt; =
font-size: 12pt; font-family: 'Times New Roman', serif; ">I want to =
check in. &nbsp;I'm sad, mad and afraid. &nbsp;I've been trying to get a =
1.2 cluster up and working with my data set for three weeks with no =
success. &nbsp;I've been running a 1.1 cluster for 8 months now with no =
hiccups, but for me at least 1.2 has been a disaster. &nbsp;I had high =
hopes for leveraging the new features of 1.2, specifically vnodes and =
collections. &nbsp; But at this point I can't release my system into =
production, and will probably need to find a new back end. &nbsp;As a =
small startup, this could be catastrophic. &nbsp;I'm mostly mad at =
myself. &nbsp;I took a risk moving to the new tech. &nbsp;I forgot =
sometimes when you gamble, you lose.<o:p></o:p></div><div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; "><o:p>&nbsp;</o:p></div></div><div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; ">First, the performance of 1.2.6 was horrible when =
using collections. &nbsp;I wasn't able to push through 500k rows before =
the cluster became unusable. &nbsp;With a lot of digging, and way too =
much time, I discovered I was hitting a bug that had just been fixed, =
but was unreleased. &nbsp;This scared me, because the release was =
already at 1.2.6 and I would have expected something as&nbsp;<a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-5677" =
style=3D"color: purple; text-decoration: underline; =
">https://issues.apache.org/jira/browse/CASSANDRA-5677</a>&nbsp;would =
have been addressed long before. &nbsp;But gamely I grabbed the latest =
code from the 1.2 branch, built it and I was finally able to get past =
half a million rows. &nbsp;<o:p></o:p></div></div><div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; "><o:p>&nbsp;</o:p></div></div><div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; ">But, then I hit ~4 million rows, and a multitude of =
problems. &nbsp;Even with the fix above, I was still seeing a ton of =
compactions failing, specifically the ones for large rows. &nbsp;Not a =
single large row will compact, they all assert with the wrong size. =
&nbsp;Worse, and this is what kills the whole thing, I keep hitting a =
wall with open files, even after dumping the whole DB, dropping vnodes =
and trying again. &nbsp;Seriously, 650k open file descriptors? =
&nbsp;When it hits this limit, the whole DB craps out and is basically =
unusable. &nbsp;This isn't that many rows. &nbsp;I have close to a half =
a billion in 1.1=85<o:p></o:p></div></div><div><div style=3D"margin: 0cm =
0cm 0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div><div style=3D"margin: 0cm 0cm =
0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; ">I'm =
now at a standstill. &nbsp;I figure I have two options unless someone =
here can help me. &nbsp;Neither of them involve 1.2. &nbsp;I can either =
go back to 1.1 and remove the features that collections added to my =
service, or I find another data backend that has similar performance =
characteristics to cassandra but allows collections type behavior in a =
scalable manner. &nbsp;Cause as far as I can tell, 1.2 doesn't scale. =
&nbsp;Which makes me sad, I was proud of what I accomplished with =
1.1=85.<o:p></o:p></div></div><div><div style=3D"margin: 0cm 0cm =
0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div><div style=3D"margin: 0cm 0cm =
0.0001pt; font-size: 12pt; font-family: 'Times New Roman', serif; ">Does =
anyone know why there are so many open file descriptors? &nbsp;Any ideas =
on why a large row won't compact?<o:p></o:p></div></div><div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; "><o:p>&nbsp;</o:p></div></div><div><div =
style=3D"margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: 'Times =
New Roman', serif; =
">Paul</div></div></div></div></blockquote></div><br></body></html>=

--Apple-Mail=_A6894B5B-80BB-42EC-A7AD-93EE8BDF0E9A--