cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From aaron morton <aa...@thelastpickle.com>
Subject Re: CAn't bootstrap a new node to my cluster
Date Mon, 12 Mar 2012 08:50:53 GMT
It may be the case that the joining node does not have enough information. But there is a default
30 second delay while the node waits for the ring information to stabilise. 

What version are you using ? 

Next time you add a new node can you try it with logging set the DEBUG. If you get the error
please add it to https://issues.apache.org/jira/browse/CASSANDRA with the relevant logs. 

Cheers

-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 11/03/2012, at 9:11 PM, Cyril Scetbon wrote:

> After more than 9 hours, I've restart the node and reused the join command (data+cache+commitlog
have not been erased) and now the node is in normal state in less than a second :
> 
> nodetool -h localhost ring
> Address         DC          Rack        Status State   Load            Owns    Token
>                                                                               155962751505430129087380028406227096917
> 10.0.0.61       DC1         RAC1        Up     Normal  72.28 GB        8.33%   0
> 10.0.0.62       DC1         RAC1        Up     Normal  72.18 GB        8.33%   14178431955039102644307275309657008810
> 10.0.1.61       DC2         RAC1        Up     Normal  71.88 GB        8.33%   28356863910078205288614550619314017621
> 10.0.1.62       DC2         RAC1        Up     Normal  107.41 GB       8.33%   42535295865117307932921825928971026432
> 10.0.0.63       DC1         RAC1        Up     Normal  72.21 GB        8.33%   56713727820156410577229101238628035242
> 10.0.0.64       DC1         RAC1        Up     Normal  71.91 GB        8.33%   70892159775195513221536376548285044053
> 10.0.1.63       DC2         RAC1        Up     Normal  72.81 GB        8.33%   85070591730234615865843651857942052864
> 10.0.1.70       DC2         RAC1        Up     Normal  104.76 KB       7.05%   97059533058214537253124691979661801455
> 10.0.1.64       DC2         RAC1        Up     Normal  71.23 GB        1.29%   99249023685273718510150927167599061674
> 10.0.0.65       DC1         RAC1        Up     Normal  71.68 GB        8.33%   113427455640312821154458202477256070485
> 10.0.0.66       DC1         RAC1        Up     Normal  72.12 GB        8.33%   127605887595351923798765477786913079296
> 10.0.1.65       DC2         RAC1        Up     Normal  71.37 GB        8.33%   141784319550391026443072753096570088106
> 10.0.1.66       DC2         RAC1        Up     Normal  70.94 GB        8.33%   155962751505430129087380028406227096917
> 
> You see the repartition ? isn't weird ? I have done it many times and each time the first
join raises an error about not being able to join seed nodes. It seems that the more I wait
and the more the percent of data it owns is high ...
> 
> 
> 
> On 3/10/12 9:49 PM, Cyril Scetbon wrote:
>> Hi all,
>> 
>> When I try the join command I have the following error :
>> Exception in thread "main" java.lang.RuntimeException: No other nodes seen!  Unable
to bootstrap.If you intended to start a single-node cluster, you should make sure your broadcast_address
(or listen_address) is listed as a seed.  Otherwise, you need to determine why the seed being
contacted has no knowledge of the rest of the cluster.  Usually, this can be solved by giving
all nodes the same seed list.
>>       at org.apache.cassandra.dht.BootStrapper.getBootstrapSource(BootStrapper.java:168)
>>       at org.apache.cassandra.dht.BootStrapper.getBalancedToken(BootStrapper.java:150)
>> 
>> and see the messages below in the seed log :
>> ==> /var/log/cassandra/output.log <==
>> INFO 19:55:39,554 InetAddress /10.0.1.70 is now dead.
>> 
>> ==> /var/log/cassandra/system.log <==
>> INFO [GossipStage:1] 2012-03-10 19:55:39,554 Gossiper.java (line 818) InetAddress
/10.0.1.70 is now dead.
>> 
>> however the node I try to add finds the topology of the cluster :
>> loaded network topology /10.0.1.62:[Ljava.lang.String;@53533ff4, /10.0.1.63:[Ljava.lang.String;@7e8bd6e8,
/10.0.1.61:[Ljava.lang.String;@eca5a40, /10.0.0.66:[Ljava.lang.String;@6eb2756, /10.0.0.65:[Ljava.lang.String;@386382a6,
/10.0.0.64:[Ljava.lang.String;@d167d36, /10.0.0.61:[Ljava.lang.String;@3f766af0, /10.0.0.62:[Ljava.lang.String;@7766d31c,
/10.0.0.63:[Ljava.lang.String;@7fae1081, /10.0.1.70:[Ljava.lang.String;@7cbdc6a3, /10.0.1.65:[Ljava.lang.String;@f3c3750,
/10.0.1.64:[Ljava.lang.String;@3a536d00, /10.0.1.66:[Ljava.lang.String;@3b1938ea
>> 
>> The node says it's in joining mode but does not receive any token :
>> 
>> Mode: JOINING
>> Not sending any streams.
>> Not receiving any streams.
>> Pool Name                    Active   Pending      Completed
>> Commands                        n/a         0             12
>> Responses                       n/a         0           5490
>> 
>> any idea ?
>> 
> 
> 
> -- 
> Cyril SCETBON
> 


Mime
View raw message