Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of shimi.k@gmail.com designates
 209.85.161.44 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=toUBSj7LwA++HmEdljN8AflI0OBo6CSpcl1lcsxft6dMRJZUiuTOZGieN6oJexkfEX
         77yczZXY3rRjSwN+ruPiu9emptKiZA7MdmBDC4C3aqod4eATLKkkBGc1PylUNK8n97EP
         GzY+dkO3mSK+VcwJixhBQKxfcYPbkceI5DGJU=
MIME-Version: 1.0
In-Reply-To: <20100717212113.GC79210@alumni.caltech.edu>
References: <20100714225847.GA64220@alumni.caltech.edu>
	<AANLkTilt07GBogJ182y-iDz3UcYtT08IXSWUN73Tp6cG@mail.gmail.com>
	<20100715202806.GB71234@alumni.caltech.edu>
	<AANLkTikG6mIT6RVCQWa8LeqPjcQjEvbLtDcOrDNAlXFw@mail.gmail.com>
	<20100716054508.GA73522@alumni.caltech.edu>
	<20100717212113.GC79210@alumni.caltech.edu>
Date: Sun, 18 Jul 2010 20:09:45 +0300
Message-ID: <AANLkTimmYDae7tiI3sK5t-SSS3FxkRG5zGsmV-hgrnU6@mail.gmail.com>
Subject: Re: Bootstrap question
From: shimi <shimi.k@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001636c5c241ce1d0f048bac84cb

--001636c5c241ce1d0f048bac84cb
Content-Type: text/plain; charset=ISO-8859-1

If I have problems with never ending bootstraping I do the following. I try
each one if it doesn't help I try the next. It might not be the right thing
to do but it worked for me.

1. Restart the bootstraping node
2. If I see streaming 0/xxxx I restart the node and all the streaming nodes
3. Restart all the nodes
4. If there is data in the bootstraing node I delete it before I restart.

Good luck
Shimi

On Sun, Jul 18, 2010 at 12:21 AM, Anthony Molinaro <
anthonym@alumni.caltech.edu> wrote:

> So still waiting for any sort of answer on this one.  The cluster still
> refuses to do anything when I bring up new nodes.  I shut down all the
> new nodes and am waiting.  I'm guessing that maybe the old nodes have
> some state which needs to get cleared out?  Is there anything I can do
> at this point?  Are there alternate strategies for bootstrapping I can
> try?  (For instance can I just scp all the sstables to all the new
> nodes and do a repair, would that actually work?).
>
> Anyone seen this sort of issue?  All this is with 0.6.3 so I assume
> eventually others will see this issue.
>
> -Anthony
>
> On Thu, Jul 15, 2010 at 10:45:08PM -0700, Anthony Molinaro wrote:
> > Okay, so things were pretty messed up.  I shut down all the new nodes,
> > then the old nodes started doing the half the ring is down garbage which
> > pretty much requires a full restart of everything.  So I had to shut
> > everything down, then bring the seed back, then the rest of the nodes,
> > so they finally all agreed on the ring again.
> >
> > Then I started one of the new nodes, and have been watching the logs, so
> > far 2 hours since the "Bootstrapping" message appeared in the new
> > log and nothing has happened.  No anticompaction messages anywhere,
> there's
> > one node compacting, but its on the other end of the ring, so no where
> near
> > that new node.  I'm wondering if it will ever get data at this point.
> >
> > Is there something else I should try?  The only thing I can think of
> > is deleting the system directory on the new node, and restarting, so
> > I'll try that and see if it does anything.
> >
> > -Anthony
> >
> > On Thu, Jul 15, 2010 at 03:43:49PM -0500, Jonathan Ellis wrote:
> > > On Thu, Jul 15, 2010 at 3:28 PM, Anthony Molinaro
> > > <anthonym@alumni.caltech.edu> wrote:
> > > > Is the fact that 2 new nodes are in the range messing it up?
> > >
> > > Probably.
> > >
> > > >  And if so
> > > > how do I recover (I'm thinking, shutdown new nodes 2,3,4,5, the
> bringing
> > > > up nodes 2,4, waiting for them to finish, then bringing up 3,5?).
> > >
> > > Yes.
> > >
> > > You might have to restart the old nodes too to clear out the confusion.
> > >
> > > --
> > > Jonathan Ellis
> > > Project Chair, Apache Cassandra
> > > co-founder of Riptano, the source for professional Cassandra support
> > > http://riptano.com
> >
> > --
> > ------------------------------------------------------------------------
> > Anthony Molinaro                           <anthonym@alumni.caltech.edu>
>
> --
> ------------------------------------------------------------------------
> Anthony Molinaro                           <anthonym@alumni.caltech.edu>
>

--001636c5c241ce1d0f048bac84cb
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">If I have problems with never ending bootstraping I do the=
 following. I try each one if it doesn&#39;t help I try the next. It might =
not be the right thing to do but it worked for me.<br><br>1. Restart the bo=
otstraping node<br>
2. If I see streaming 0/xxxx I restart the node and all the streaming nodes=
<br>3. Restart all the nodes<br>4. If there is data in the bootstraing node=
 I delete it before I restart.<br><br>Good luck<br>Shimi<br><br><div class=
=3D"gmail_quote">
On Sun, Jul 18, 2010 at 12:21 AM, Anthony Molinaro <span dir=3D"ltr">&lt;<a=
 href=3D"mailto:anthonym@alumni.caltech.edu">anthonym@alumni.caltech.edu</a=
>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin: 0=
pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: =
1ex;">
So still waiting for any sort of answer on this one. =A0The cluster still<b=
r>
refuses to do anything when I bring up new nodes. =A0I shut down all the<br=
>
new nodes and am waiting. =A0I&#39;m guessing that maybe the old nodes have=
<br>
some state which needs to get cleared out? =A0Is there anything I can do<br=
>
at this point? =A0Are there alternate strategies for bootstrapping I can<br=
>
try? =A0(For instance can I just scp all the sstables to all the new<br>
nodes and do a repair, would that actually work?).<br>
<br>
Anyone seen this sort of issue? =A0All this is with 0.6.3 so I assume<br>
eventually others will see this issue.<br>
<br>
-Anthony<br>
<br>
On Thu, Jul 15, 2010 at 10:45:08PM -0700, Anthony Molinaro wrote:<br>
&gt; Okay, so things were pretty messed up. =A0I shut down all the new node=
s,<br>
&gt; then the old nodes started doing the half the ring is down garbage whi=
ch<br>
&gt; pretty much requires a full restart of everything. =A0So I had to shut=
<br>
&gt; everything down, then bring the seed back, then the rest of the nodes,=
<br>
&gt; so they finally all agreed on the ring again.<br>
&gt;<br>
&gt; Then I started one of the new nodes, and have been watching the logs, =
so<br>
&gt; far 2 hours since the &quot;Bootstrapping&quot; message appeared in th=
e new<br>
&gt; log and nothing has happened. =A0No anticompaction messages anywhere, =
there&#39;s<br>
&gt; one node compacting, but its on the other end of the ring, so no where=
 near<br>
&gt; that new node. =A0I&#39;m wondering if it will ever get data at this p=
oint.<br>
&gt;<br>
&gt; Is there something else I should try? =A0The only thing I can think of=
<br>
&gt; is deleting the system directory on the new node, and restarting, so<b=
r>
&gt; I&#39;ll try that and see if it does anything.<br>
&gt;<br>
&gt; -Anthony<br>
&gt;<br>
&gt; On Thu, Jul 15, 2010 at 03:43:49PM -0500, Jonathan Ellis wrote:<br>
&gt; &gt; On Thu, Jul 15, 2010 at 3:28 PM, Anthony Molinaro<br>
&gt; &gt; &lt;<a href=3D"mailto:anthonym@alumni.caltech.edu">anthonym@alumn=
i.caltech.edu</a>&gt; wrote:<br>
&gt; &gt; &gt; Is the fact that 2 new nodes are in the range messing it up?=
<br>
&gt; &gt;<br>
&gt; &gt; Probably.<br>
&gt; &gt;<br>
&gt; &gt; &gt; =A0And if so<br>
&gt; &gt; &gt; how do I recover (I&#39;m thinking, shutdown new nodes 2,3,4=
,5, the bringing<br>
&gt; &gt; &gt; up nodes 2,4, waiting for them to finish, then bringing up 3=
,5?).<br>
&gt; &gt;<br>
&gt; &gt; Yes.<br>
&gt; &gt;<br>
&gt; &gt; You might have to restart the old nodes too to clear out the conf=
usion.<br>
&gt; &gt;<br>
&gt; &gt; --<br>
&gt; &gt; Jonathan Ellis<br>
&gt; &gt; Project Chair, Apache Cassandra<br>
&gt; &gt; co-founder of Riptano, the source for professional Cassandra supp=
ort<br>
&gt; &gt; <a href=3D"http://riptano.com" target=3D"_blank">http://riptano.c=
om</a><br>
<div><div></div><div class=3D"h5">&gt;<br>
&gt; --<br>
&gt; ----------------------------------------------------------------------=
--<br>
&gt; Anthony Molinaro =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 &=
lt;<a href=3D"mailto:anthonym@alumni.caltech.edu">anthonym@alumni.caltech.e=
du</a>&gt;<br>
<br>
--<br>
------------------------------------------------------------------------<br=
>
Anthony Molinaro =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 &lt;<a=
 href=3D"mailto:anthonym@alumni.caltech.edu">anthonym@alumni.caltech.edu</a=
>&gt;<br>
</div></div></blockquote></div><br></div>

--001636c5c241ce1d0f048bac84cb--