Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of berrytemk@gmail.com designates
 209.85.215.44 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <399A1DFA-D0AC-4355-B2AC-3A0DFCFB6ADE@thelastpickle.com>
References: 
 <CAHfvOyNU39EfDVOJi7xoYKCG2FNA2xOoupoo6otvqEQotUfc8g@mail.gmail.com>
	<4F8446F0.7070906@4friends.od.ua>
	<CAHfvOyPbY=hQ8wJe--X57oHLAcWkCCDHideewJEaiyJkvrcqWQ@mail.gmail.com>
	<4F84639D.7050304@4friends.od.ua>
	<CAMQDwDZdezzUR2o0Y4rZqa=mZmHzeCiVyagV25dJ=6cnyrqHqA@mail.gmail.com>
	<CAHfvOyNUSK5VCt2X8Gjm-TVY=kuV+HQds8o_E31guMs=1-3wPg@mail.gmail.com>
	<CAJJ04x74OtKrQjDO3NBSBpzBEwUN7LbNKvyuYQWZ+mJ9OzW4ww@mail.gmail.com>
	<CAHfvOyMPbj-pE9=bRqD5pkMA6raE4+qVFyu4brNc+vs5pQn6dA@mail.gmail.com>
	<399A1DFA-D0AC-4355-B2AC-3A0DFCFB6ADE@thelastpickle.com>
Date: Thu, 12 Apr 2012 00:06:06 -0400
Message-ID: 
 <CAHfvOyOLu8Q-LGdkzK1TnBmV=LzNGj8xsxcepB4Eh5nkV9pwTA@mail.gmail.com>
Subject: Re: Repair Process Taking too long
From: Frank Ng <berrytemk@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=f46d04088d8ba5253804bd73793c

--f46d04088d8ba5253804bd73793c
Content-Type: text/plain; charset=ISO-8859-1

Thank you for confirming that the per node data size is most likely causing
the long repair process.  I have tried a repair on smaller column families
and it was significantly faster.

On Wed, Apr 11, 2012 at 9:55 PM, aaron morton <aaron@thelastpickle.com>wrote:

> If you have 1TB of data it will take a long time to repair. Every bit of
> data has to be read and a hash generated. This is one of the reasons we
> often suggest that around 300 to 400Gb per node is a good load in the
> general case.
>
> Look at nodetool compactionstats .Is there a validation compaction running
> ? If so it is still building the merkle  hash tree.
>
> Look at nodetool netstats . Is it streaming data ? If so all hash trees
> have been calculated.
>
> Cheers
>
>
> -----------------
> Aaron Morton
> Freelance Developer
> @aaronmorton
> http://www.thelastpickle.com
>
> On 12/04/2012, at 2:16 AM, Frank Ng wrote:
>
> Can you expand further on your issue? Were you using Random Patitioner?
>
> thanks
>
> On Tue, Apr 10, 2012 at 5:35 PM, David Leimbach <leimy2k@gmail.com> wrote:
>
>> I had this happen when I had really poorly generated tokens for the ring.
>>  Cassandra seems to accept numbers that are too big.  You get hot spots
>> when you think you should be balanced and repair never ends (I think there
>> is a 48 hour timeout).
>>
>>
>> On Tuesday, April 10, 2012, Frank Ng wrote:
>>
>>> I am not using tier-sized compaction.
>>>
>>>
>>> On Tue, Apr 10, 2012 at 12:56 PM, Jonathan Rhone <rhone@tinyco.com>wrote:
>>>
>>>> Data size, number of nodes, RF?
>>>>
>>>> Are you using size-tiered compaction on any of the column families that
>>>> hold a lot of your data?
>>>>
>>>> Do your cassandra logs say you are streaming a lot of ranges?
>>>> zgrep -E "(Performing streaming repair|out of sync)"
>>>>
>>>>
>>>> On Tue, Apr 10, 2012 at 9:45 AM, Igor <igor@4friends.od.ua> wrote:
>>>>
>>>>>  On 04/10/2012 07:16 PM, Frank Ng wrote:
>>>>>
>>>>> Short answer - yes.
>>>>> But you are asking wrong question.
>>>>>
>>>>>
>>>>> I think both processes are taking a while.  When it starts up,
>>>>> netstats and compactionstats show nothing.  Anyone out there successfully
>>>>> using ext3 and their repair processes are faster than this?
>>>>>
>>>>>  On Tue, Apr 10, 2012 at 10:42 AM, Igor <igor@4friends.od.ua> wrote:
>>>>>
>>>>>> Hi
>>>>>>
>>>>>> You can check with nodetool  which part of repair process is slow -
>>>>>> network streams or verify compactions. use nodetool netstats or
>>>>>> compactionstats.
>>>>>>
>>>>>>
>>>>>> On 04/10/2012 05:16 PM, Frank Ng wrote:
>>>>>>
>>>>>>> Hello,
>>>>>>>
>>>>>>> I am on Cassandra 1.0.7.  My repair processes are taking over 30
>>>>>>> hours to complete.  Is it normal for the repair process to take this long?
>>>>>>>  I wonder if it's because I am using the ext3 file system.
>>>>>>>
>>>>>>> thanks
>>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Jonathan Rhone
>>>> Software Engineer
>>>>
>>>> *TinyCo*
>>>> 800 Market St., Fl 6
>>>> San Francisco, CA 94102
>>>> www.tinyco.com
>>>>
>>>>
>>>
>
>

--f46d04088d8ba5253804bd73793c
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Thank you for confirming that the per node data size is most likely causing=
 the long repair process. =A0I have tried a repair on smaller column famili=
es and it was significantly faster.<br><br><div class=3D"gmail_quote">On We=
d, Apr 11, 2012 at 9:55 PM, aaron morton <span dir=3D"ltr">&lt;<a href=3D"m=
ailto:aaron@thelastpickle.com">aaron@thelastpickle.com</a>&gt;</span> wrote=
:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div style=3D"word-wrap:break-word">If you h=
ave 1TB of data it will take a long time to repair. Every bit of data has t=
o be read and a hash generated. This is one of the reasons we often suggest=
 that around 300 to 400Gb per node is a good load in the general case.=A0<d=
iv>
<br></div><div>Look at nodetool compactionstats .Is there a validation comp=
action running ? If so it is still building the merkle =A0hash tree.=A0</di=
v><div><br></div><div>Look at nodetool netstats . Is it streaming data ? If=
 so all hash trees have been calculated.=A0</div>
<div><br></div><div>Cheers</div><div><br></div><div><br><div><div>
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;te=
xt-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norm=
al;border-collapse:separate;text-transform:none;font-size:medium;white-spac=
e:normal;font-family:Helvetica;word-spacing:0px"><span style=3D"text-indent=
:0px;letter-spacing:normal;font-variant:normal;font-style:normal;font-weigh=
t:normal;line-height:normal;border-collapse:separate;text-transform:none;fo=
nt-size:medium;white-space:normal;font-family:Helvetica;word-spacing:0px"><=
div style=3D"word-wrap:break-word">
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<div><div>-----------------</div><div>Aaron Morton</div><div>Freelance Deve=
loper</div><div>@aaronmorton</div><div><a href=3D"http://www.thelastpickle.=
com" target=3D"_blank">http://www.thelastpickle.com</a></div></div></div></=
span></div>
</span></div></span></span>
</div><div><div class=3D"h5">

<br><div><div>On 12/04/2012, at 2:16 AM, Frank Ng wrote:</div><br><blockquo=
te type=3D"cite">Can you expand further on your issue? Were you using Rando=
m Patitioner?<br><br>thanks<br><br><div class=3D"gmail_quote">On Tue, Apr 1=
0, 2012 at 5:35 PM, David Leimbach <span dir=3D"ltr">&lt;<a href=3D"mailto:=
leimy2k@gmail.com" target=3D"_blank">leimy2k@gmail.com</a>&gt;</span> wrote=
:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">I had this happen when I had really poorly g=
enerated tokens for the ring. =A0Cassandra seems to accept numbers that are=
 too big. =A0You get hot spots when you think you should be balanced and re=
pair never ends (I think there is a 48 hour timeout).<div>

<div><span></span><br>
<br>On Tuesday, April 10, 2012, Frank Ng  wrote:<br><blockquote class=3D"gm=
ail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-le=
ft:1ex">I am not using tier-sized compaction.<br><br><br><div class=3D"gmai=
l_quote">


On Tue, Apr 10, 2012 at 12:56 PM, Jonathan Rhone <span dir=3D"ltr">&lt;<a>r=
hone@tinyco.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" =
style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<span style=3D"font-family:Consolas,Menlo,Monaco,&#39;Lucida Console&#39;,&=
#39;Liberation Mono&#39;,&#39;DejaVu Sans Mono&#39;,&#39;Bitstream Vera San=
s Mono&#39;,monospace,serif;font-size:12px;line-height:21px">Data size, num=
ber of nodes, RF?</span><div>


<br></div><div>Are you using size-tiered compaction on any of the column fa=
milies that hold a lot of your data?<br><div><br></div><div>Do your cassand=
ra logs say you are streaming a lot of ranges?</div><div><span style=3D"lin=
e-height:21px;font-size:12px;font-family:Consolas,Menlo,Monaco,&#39;Lucida =
Console&#39;,&#39;Liberation Mono&#39;,&#39;DejaVu Sans Mono&#39;,&#39;Bits=
tream Vera Sans Mono&#39;,monospace,serif">zgrep -E &quot;(Performing strea=
ming repair|out of sync)&quot;</span><span style=3D"line-height:21px;font-s=
ize:12px;font-family:Consolas,Menlo,Monaco,&#39;Lucida Console&#39;,&#39;Li=
beration Mono&#39;,&#39;DejaVu Sans Mono&#39;,&#39;Bitstream Vera Sans Mono=
&#39;,monospace,serif">=A0</span></div>


<div><span style=3D"line-height:21px;font-size:12px;font-family:Consolas,Me=
nlo,Monaco,&#39;Lucida Console&#39;,&#39;Liberation Mono&#39;,&#39;DejaVu S=
ans Mono&#39;,&#39;Bitstream Vera Sans Mono&#39;,monospace,serif"><br>
</span></div><div><font face=3D"Consolas, Menlo, Monaco, &#39;Lucida Consol=
e&#39;, &#39;Liberation Mono&#39;, &#39;DejaVu Sans Mono&#39;, &#39;Bitstre=
am Vera Sans Mono&#39;, monospace, serif"><span style=3D"font-size:12px;lin=
e-height:21px"><br>


</span></font></div><div><div><div><div class=3D"gmail_quote">On Tue, Apr 1=
0, 2012 at 9:45 AM, Igor <span dir=3D"ltr">&lt;<a>igor@4friends.od.ua</a>&g=
t;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">

 =20
   =20
 =20
  <div bgcolor=3D"#FFFFFF" text=3D"#000000">
    On 04/10/2012 07:16 PM, Frank Ng wrote:<br>
    <br>
    Short answer - yes.<br>
    But you are asking wrong question.<div><div></div><div><br>
    <br>
    <blockquote type=3D"cite">I think both processes are taking a while.=A0=
 When it
      starts up, netstats and compactionstats show nothing.=A0 Anyone out
      there successfully using ext3 and their repair processes are
      faster than this?<br>
      <br>
      <div class=3D"gmail_quote">
        On Tue, Apr 10, 2012 at 10:42 AM, Igor <span dir=3D"ltr">&lt;<a>igo=
r@4friends.od.ua</a>&gt;</span>
        wrote:<br>
        <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border=
-left:1px #ccc solid;padding-left:1ex">
          Hi<br>
          <br>
          You can check with nodetool =A0which part of repair process is
          slow - network streams or verify compactions. use nodetool
          netstats or compactionstats.
          <div>
            <div><br>
              <br>
              On 04/10/2012 05:16 PM, Frank Ng wrote:<br>
              <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;=
border-left:1px #ccc solid;padding-left:1ex">
                Hello,<br>
                <br>
                I am on Cassandra 1.0.7. =A0My repair processes are taking
                over 30 hours to complete. =A0Is it normal for the repair
                process to take this long? =A0I wonder if it&#39;s because =
I
                am using the ext3 file system.<br>
                <br>
                thanks<br>
              </blockquote>
              <br>
            </div>
          </div>
        </blockquote>
      </div>
      <br>
    </blockquote>
    <br>
  </div></div></div>

</blockquote></div><br><br clear=3D"all"><div><br></div></div></div><span><=
font color=3D"#888888">-- <br><span style=3D"border-collapse:collapse"><spa=
n style=3D"font-size:13px;border-collapse:collapse;font-family:arial,sans-s=
erif"><font size=3D"1"><font face=3D"tahoma, sans-serif">Jonathan Rhone</fo=
nt></font></span><div>


<font size=3D"1" face=3D"tahoma, sans-serif">Software Engineer</font></div>=
<div><font size=3D"1" face=3D"tahoma, sans-serif"><br></font></div><div sty=
le=3D"font-family:arial,sans-serif;font-size:13px"><span style=3D"font-fami=
ly:arial,sans-serif;font-size:13px;border-collapse:collapse"><span style=3D=
"font-size:x-small"><font color=3D"#3333FF" face=3D"tahoma, sans-serif"><b>=
TinyCo</b></font></span><div>


<div><font size=3D"1"><font face=3D"tahoma, sans-serif">800 Market St., Fl =
6</font></font></div><div><font size=3D"1"><font face=3D"tahoma, sans-serif=
">San Francisco, CA 94102</font></font></div></div><div><font size=3D"1"><f=
ont face=3D"tahoma, sans-serif"><a href=3D"http://www.tinyco.com/" style=3D=
"color:rgb(0,0,204)" target=3D"_blank">www.tinyco.com</a></font></font></di=
v>


</span></div></span><br>
</font></span></div></div>
</blockquote></div><br>
</blockquote>
</div></div></blockquote></div><br>
</blockquote></div><br></div></div></div></div></div></blockquote></div><br=
>

--f46d04088d8ba5253804bd73793c--