incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Héctor Izquierdo Seliva <izquie...@strands.com>
Subject Re: Problems recovering a dead node
Date Wed, 04 May 2011 05:54:30 GMT

Hi Aaron

It has no data files whatsoever. The upgrade path is 0.7.4 -> 0.7.5. It
turns out the initial problem was the sw raid failing silently because
of another faulty disk.

Now that the storage is working, I brought up the node again, same IP,
same token and tried doing nodetool repair. 

All adjacent nodes have finished the streaming session, and now the node
has a total of 248 GB of data. Is this normal when the load per node is
about 18GB? 

Also there are 1245 pending tasks. It's been compacting or rebuilding
sstables for the last 8 hours non stop. There are 2057 sstables in the
data folder.

Should I have done thing differently or is this the normal behaviour?

Thanks!

El mié, 04-05-2011 a las 07:54 +1200, aaron morton escribió:
> When you say "it's clean" does that mean the node has no data files ?
> 
> After you replaced the disk what process did you use to recover  ?
> 
> Also what version are you running and what's the recent upgrade history ?
> 
> Cheers
> Aaron
> 
> On 3 May 2011, at 23:09, Héctor Izquierdo Seliva wrote:
> 
> > Hi everyone. One of the nodes in my 6 node cluster died with disk
> > failures. I have replaced the disks, and it's clean. It has the same
> > configuration (same ip, same token).
> > 
> > When I try to restart the node it starts to throw mmap underflow
> > exceptions till it closes again.
> > 
> > I tried setting io to standard, but it still fails. It gives errors
> > about two decorated keys being different, and the EOFException.
> > 
> > Here is an excerpt of the log
> > 
> > http://pastebin.com/ZXW1wY6T
> > 
> > I can provide more info if needed. I'm at a loss here so any help is
> > appreciated.
> > 
> > Thanks all for your time
> > 
> > Héctor Izquierdo
> > 
> 



Mime
View raw message