Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of arodrime@gmail.com designates
 209.85.212.44 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAN3fqkyKST2UiAeit_gBqTZUOCO+b3SnKeQFiqd9Ofu=LX41yA@mail.gmail.com>
References: 
 <CAN3fqkx_JRh+71q9LGzBKSBNC-xqwy7-VjFQaQ1q=MdAPee_Vw@mail.gmail.com>
 <CALY91SNYJyBqV7gt5ZuvD_89H3cG=hWxet4HWK1bDqeR5JfDdw@mail.gmail.com>
 <CAN3fqkyKST2UiAeit_gBqTZUOCO+b3SnKeQFiqd9Ofu=LX41yA@mail.gmail.com>
From: Alain RODRIGUEZ <arodrime@gmail.com>
Date: Thu, 8 Nov 2012 13:53:01 +0100
Message-ID: 
 <CA+VSrLrH_3jMHWzwDtY7-7+AiL7J4fRbn=TX2dCZTsCSFpRvvQ@mail.gmail.com>
Subject: Re: Compact and Repair
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7b6d99a4ef873804cdfb51c5

--047d7b6d99a4ef873804cdfb51c5
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Did you change the RF or had a node down since you repaired last time ?


2012/11/8 Henrik Schr=F6der <skrolle@gmail.com>

> No, we're not using columns with TTL, and I performed a major compaction
> before the repair, so there shouldn't be vast amounts of tombstones movin=
g
> around.
>
> And the increase happened during the repair, the nodes gained ~20-30GB
> each.
>
>
> /Henrik
>
>
>
> On Thu, Nov 8, 2012 at 12:40 PM, horschi <horschi@gmail.com> wrote:
>
>> Hi,
>>
>> is it possible that your repair is overrepairing due to any of the issue=
s
>> discussed here:
>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/repair-=
compaction-and-tombstone-rows-td7583481.html?
>>
>>
>> I've seen repair increasing the load on my cluster, but what you are
>> describing sounds like a lot to me.
>>
>> Does this increase happen due to repair entirely? Or was the load maybe
>> increasing gradually over the week and you just checked for the first ti=
me?
>>
>> cheers,
>> Christian
>>
>>
>>
>> On Thu, Nov 8, 2012 at 11:55 AM, Henrik Schr=F6der <skrolle@gmail.com>wr=
ote:
>>
>>> Hi,
>>>
>>> We recently ran a major compaction across our cluster, which reduced th=
e
>>> storage used by about 50%. This is fine, since we do a lot of updates t=
o
>>> existing data, so that's the expected result.
>>>
>>> The day after, we ran a full repair -pr across the cluster, and when
>>> that finished, each storage node was at about the same size as before t=
he
>>> major compaction. Why does that happen? What gets transferred to other
>>> nodes, and why does it suddenly take up a lot of space again?
>>>
>>> We haven't run repair -pr regularly, so is this just something that
>>> happens on the first weekly run, and can we expect a different result n=
ext
>>> week? Or does repair always cause the data to grow on each node? To me =
it
>>> just doesn't seem proportional?
>>>
>>>
>>> /Henrik
>>>
>>
>>
>

--047d7b6d99a4ef873804cdfb51c5
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Did you change the RF or had a node down since you repaired last time ?<div=
 class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">2012/11/8 Henrik =
Schr=F6der <span dir=3D"ltr">&lt;<a href=3D"mailto:skrolle@gmail.com" targe=
t=3D"_blank">skrolle@gmail.com</a>&gt;</span><br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">No, we&#39;re not using columns with TTL, an=
d I performed a major compaction before the repair, so there shouldn&#39;t =
be vast amounts of tombstones moving around.<br>

<br>And the increase happened during the repair, the nodes gained ~20-30GB =
each.<span class=3D"HOEnZb"><font color=3D"#888888"><br>
<br><br>/Henrik</font></span><div class=3D"HOEnZb"><div class=3D"h5"><br><d=
iv class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Thu, Nov 8, =
2012 at 12:40 PM, horschi <span dir=3D"ltr">&lt;<a href=3D"mailto:horschi@g=
mail.com" target=3D"_blank">horschi@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Hi,<br><br>is it possible that your repair i=
s overrepairing due to any of the issues discussed here: <a href=3D"http://=
cassandra-user-incubator-apache-org.3065146.n2.nabble.com/repair-compaction=
-and-tombstone-rows-td7583481.html" target=3D"_blank">http://cassandra-user=
-incubator-apache-org.3065146.n2.nabble.com/repair-compaction-and-tombstone=
-rows-td7583481.html</a> ?<br>


<br><br>I&#39;ve seen repair increasing the load on my cluster, but what yo=
u are describing sounds like a lot to me.<br><br>Does this increase happen =
due to repair entirely? Or was the load maybe increasing gradually over the=
 week and you just checked for the first time?<br>


<br>cheers,<br>Christian<div><div><br><br><br><div class=3D"gmail_quote">On=
 Thu, Nov 8, 2012 at 11:55 AM, Henrik Schr=F6der <span dir=3D"ltr">&lt;<a h=
ref=3D"mailto:skrolle@gmail.com" target=3D"_blank">skrolle@gmail.com</a>&gt=
;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Hi,<br><br>We recently ran a major compactio=
n across our cluster, which reduced the storage used by about 50%. This is =
fine, since we do a lot of updates to existing data, so that&#39;s the expe=
cted result.<br>


<br>The day after, we ran a full repair -pr across the cluster, and when th=
at finished, each storage node was at about the same size as before the maj=
or compaction. Why does that happen? What gets transferred to other nodes, =
and why does it suddenly take up a lot of space again?<br>


<br>We haven&#39;t run repair -pr regularly, so is this just something that=
 happens on the first weekly run, and can we expect a different result next=
 week? Or does repair always cause the data to grow on each node? To me it =
just doesn&#39;t seem proportional?<span><font color=3D"#888888"><br>


<br><br>/Henrik<br>
</font></span></blockquote></div><br>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--047d7b6d99a4ef873804cdfb51c5--