Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_308CA26B-7F71-4192-BC3B-12D6941739A7"
Message-Id: <ED879CDE-4C81-490E-BA42-8F23516613F0@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.1 \(1498\))
Subject: Re: RF update
Date: Fri, 19 Oct 2012 00:32:02 +1300
References: 
 <CAEsQWxoesptSP3+M5R32cUVXQNDYq7QnOSzHFA1Rpu54y90o_A@mail.gmail.com>
 <868AB5E4-E058-4A16-BDDB-A5849A0155E9@voodoolunchbox.com>
 <CAEsQWxrjDtYz=6ByreXUKFFdGf1J8RL_hhU5rpgp=_vODe2c_A@mail.gmail.com>
To: user@cassandra.apache.org
In-Reply-To: 
 <CAEsQWxrjDtYz=6ByreXUKFFdGf1J8RL_hhU5rpgp=_vODe2c_A@mail.gmail.com>


--Apple-Mail=_308CA26B-7F71-4192-BC3B-12D6941739A7
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

> Follow up question: Is it safe to abort the compactions happening =
after node repair?
It is always safe to abort a compaction. The purpose of compaction is to =
replicate the current truth in a more compact format. It does not modify =
data, it just creates new files. The worse case would be killing it =
between the time the new files are marked as non temp and the time the =
old files are deleted. That would result in wasted disk space, but the =
truth in the system would not change.=20

>=20
> > Question: These additional compactions seem redundant since there =
are no reads or writes on the cluster after the first major compaction =
(immediately after the data load), is that right?

Repair transfers a portion of the  -Data.db component from potentially =
multiple SSTables. This may result in multiple new SStables being =
created on the receiving node. Once the files are created they are =
processed in a similar way to when a memtable is flushed and so =
compaction kicks in.

> And if so, what can we do to avoid them? We are currently waiting =
multiple days.

That fact that compaction is taking so long is odd. Have you checked the =
logs for GC problems? if you are running an SSD backed instance and have =
turned off compaction throttling the high IO throughput can result in =
mucho garbage. Faster is not always better.=20

To improve your situation consider:

* disabling compaction by setting min_compaction_threshold and =
max_compaction_threshold to 0 via schema or nodetool
* disabling durable_writes to disable the commit log during the bulk =
load.=20

Cheers


-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 17/10/2012, at 11:55 PM, Matthias Broecheler <me@matthiasb.com> =
wrote:

> Follow up question: Is it safe to abort the compactions happening =
after node repair?
>=20
> On Mon, Oct 15, 2012 at 6:32 PM, Will Martin <will@voodoolunchbox.com> =
wrote:
> +1   It doesn't make sense that the xfr compactions are heavy unless =
they are translating the file. This could be a protocol mismatch: =
however the requirements for node level compaction and wire compaction I =
would expect to be pretty different.
> On Oct 15, 2012, at 4:42 PM, Matthias Broecheler wrote:
>=20
> > Hey,
> >
> > we are writing a lot of data into a cassandra cluster for a batch =
loading use case. We cannot use the sstable batch loader, so in order to =
speed up the loading process we are using RF=3D1 while the data is =
loading. After the load is complete, we want to increase the RF. For =
that, we are updating the RF in the schema and then run the node repair =
tool on each cassandra instance to stream the data over. However, we are =
noticing that this process is slowed down by a lot of compactions (the =
actually streaming of data only takes a couple of minutes).
> >
> > Cassandra is already running a major compaction after the data =
loading process has completed. But then, there are to be two more =
compactions (one on the sender and one on the receiver) happening and =
those take a very long time even on the aws high i/o instance with no =
compaction throttling.
> >
> > Question: These additional compactions seem redundant since there =
are no reads or writes on the cluster after the first major compaction =
(immediately after the data load), is that right? And if so, what can we =
do to avoid them? We are currently waiting multiple days.
> >
> > Thank you very much for your help,
> > Matthias
> >
>=20
>=20
>=20
>=20
> --=20
> Matthias Broecheler, PhD
> http://www.matthiasb.com
> E-Mail: me@matthiasb.com


--Apple-Mail=_308CA26B-7F71-4192-BC3B-12D6941739A7
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Diso-8859-1"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><blockquote type=3D"cite">Follow up question: Is it safe to abort the =
compactions happening after node repair?<br></blockquote>It is always =
safe to abort a compaction. The purpose of compaction is to replicate =
the current truth in a more compact format. It does not modify data, it =
just creates new files. The worse case would be killing it between the =
time the new files are marked as non temp and the time the old files are =
deleted. That would result in wasted disk space, but the truth in the =
system would not change.&nbsp;<div><br></div><div><blockquote =
type=3D"cite"><div class=3D"gmail_quote"><blockquote class=3D"gmail_quote"=
 style=3D"margin: 0px 0px 0px 0.8ex; border-left-width: 1px; =
border-left-color: rgb(204, 204, 204); border-left-style: solid; =
padding-left: 1ex; position: static; z-index: auto; "><div =
class=3D"HOEnZb"><div class=3D"h5"><br>&gt; Question: These additional =
compactions seem redundant since there are no reads or writes on the =
cluster after the first major compaction (immediately after the data =
load), is that =
right?</div></div></blockquote></div></blockquote></div><div>Repair =
transfers a portion of the &nbsp;-Data.db component from potentially =
multiple SSTables. This may result in multiple new SStables being =
created on the receiving node. Once the files are created they are =
processed in a similar way to when a memtable is flushed and so =
compaction kicks in.</div><div><br></div><div><blockquote =
type=3D"cite"><div class=3D"gmail_quote"><blockquote class=3D"gmail_quote"=
 style=3D"margin: 0px 0px 0px 0.8ex; border-left-width: 1px; =
border-left-color: rgb(204, 204, 204); border-left-style: solid; =
padding-left: 1ex; position: static; z-index: auto; "><div =
class=3D"HOEnZb"><div class=3D"h5">And if so, what can we do to avoid =
them? We are currently waiting multiple =
days.</div></div></blockquote></div></blockquote></div><div>That fact =
that compaction is taking so long is odd. Have you checked the logs for =
GC problems? if you are running an SSD backed instance and have turned =
off compaction throttling the high IO throughput can result in mucho =
garbage. Faster is not always better.&nbsp;</div><div><br></div><div>To =
improve your situation consider:</div><div><br></div><div>* disabling =
compaction by setting min_compaction_threshold and =
max_compaction_threshold to 0 via schema or nodetool</div><div>* =
disabling durable_writes to disable the commit log during the bulk =
load.&nbsp;</div><div><br></div><div>Cheers</div><div><br></div><div><br><=
div apple-content-edited=3D"true">
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></div></span></span>
</div>

<br><div><div>On 17/10/2012, at 11:55 PM, Matthias Broecheler &lt;<a =
href=3D"mailto:me@matthiasb.com">me@matthiasb.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite">Follow up question: Is it safe to abort the compactions =
happening after node repair?<br><br><div class=3D"gmail_quote">On Mon, =
Oct 15, 2012 at 6:32 PM, Will Martin <span dir=3D"ltr">&lt;<a =
href=3D"mailto:will@voodoolunchbox.com" =
target=3D"_blank">will@voodoolunchbox.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin: 0px 0px 0px 0.8ex; =
border-left-width: 1px; border-left-color: rgb(204, 204, 204); =
border-left-style: solid; padding-left: 1ex; position: static; z-index: =
auto; ">+1 &nbsp; It doesn't make sense that the xfr compactions are =
heavy unless they are translating the file. This could be a protocol =
mismatch: however the requirements for node level compaction and wire =
compaction I would expect to be pretty different.<br>


<div class=3D"HOEnZb"><div class=3D"h5">On Oct 15, 2012, at 4:42 PM, =
Matthias Broecheler wrote:<br>
<br>
&gt; Hey,<br>
&gt;<br>
&gt; we are writing a lot of data into a cassandra cluster for a batch =
loading use case. We cannot use the sstable batch loader, so in order to =
speed up the loading process we are using RF=3D1 while the data is =
loading. After the load is complete, we want to increase the RF. For =
that, we are updating the RF in the schema and then run the node repair =
tool on each cassandra instance to stream the data over. However, we are =
noticing that this process is slowed down by a lot of compactions (the =
actually streaming of data only takes a couple of minutes).<br>


&gt;<br>
&gt; Cassandra is already running a major compaction after the data =
loading process has completed. But then, there are to be two more =
compactions (one on the sender and one on the receiver) happening and =
those take a very long time even on the aws high i/o instance with no =
compaction throttling.<br>


&gt;<br>
&gt; Question: These additional compactions seem redundant since there =
are no reads or writes on the cluster after the first major compaction =
(immediately after the data load), is that right? And if so, what can we =
do to avoid them? We are currently waiting multiple days.<br>


&gt;<br>
&gt; Thank you very much for your help,<br>
&gt; Matthias<br>
&gt;<br>
<br>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- =
<br>Matthias Broecheler, PhD<br>
<a href=3D"http://www.matthiasb.com/" =
target=3D"_blank">http://www.matthiasb.com</a><br>
E-Mail: <a href=3D"mailto:me@matthiasb.com" =
target=3D"_blank">me@matthiasb.com</a><br>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_308CA26B-7F71-4192-BC3B-12D6941739A7--