hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Guang Gao <birdeey...@gmail.com>
Subject Re: RegionServer unable to connect to master
Date Wed, 29 Jan 2014 19:17:00 GMT
You mean the SSH key? Yes, any two nodes can ssh each other without password.

On Wed, Jan 29, 2014 at 2:10 PM, Fernando Iwamoto - Plannej
<fernando.iwamoto@plannej.com.br> wrote:
> Did you tried to pass the key to the machines?
>
>
> 2014-01-29 birdeeyore <birdeeyore@gmail.com>
>
>> Thanks for your reply. Here's some additional info. Thanks.
>>
>> $ cat hbase-site.xml
>> <configuration>
>>   <property>
>>     <name>hbase.cluster.distributed</name>
>>     <value>true</value>
>>   </property>
>>   <property>
>>     <name>hbase.rootdir</name>
>>     <value>hdfs://obelix8.local:9001/hbase</value>
>>   </property>
>>   <property>
>>     <name>hbase.zookeeper.quorum</name>
>>     <value>obelix105.local,obelix106.local,obelix107.local</value>
>>   </property>
>>   <property>
>>     <name>hbase.zookeeper.property.clientPort</name>
>>     <value>2183</value>
>>   </property>
>>   <property>
>>     <name>hbase.zookeeper.peerport</name>
>>     <value>2890</value>
>>   </property>
>>   <property>
>>     <name>hbase.zookeeper.leaderport</name>
>>     <value>3890</value>
>>   </property>
>>   <property>
>>     <name>hbase.zookeeper.property.dataDir</name>
>>     <value>/ssd/hbase/hbase-0.94.16/zookeeper</value>
>>   </property>
>>   <property>
>>     <name>hbase.master</name>
>>     <value>obelix8.local:60000</value>
>>   </property>
>>   <property>
>>     <name>hbase.master.info.port</name>
>>     <value>50070</value>
>>   </property>
>>   <property>
>>     <name>hbase.client.scanner.caching</name>
>>     <value>200</value>
>>   </property>
>> </configuration>
>>
>> ==========================
>>
>> $ cat regionservers
>> obelix105.local
>> obelix106.local
>> obelix107.local
>> obelix108.local
>> obelix109.local
>> obelix110.local
>> obelix111.local
>> obelix112.local
>> obelix113.local
>> obelix114.local
>>
>> =========================
>> On my master node:
>>
>> $ cat /etc/hosts
>> 127.0.0.1       localhost
>> 192.168.245.8      obelix8.local xx.yy.net      obelix8
>>
>> # The following lines are desirable for IPv6 capable hosts
>> ::1     ip6-localhost ip6-loopback
>> fe00::0 ip6-localnet
>> ff00::0 ip6-mcastprefix
>> ff02::1 ip6-allnodes
>> ff02::2 ip6-allrouters
>> 192.168.245.1   obelix.local
>>
>> ===============
>>
>> On one of my slave nodes:
>>
>> $ cat /etc/hosts
>> 127.0.0.1       localhost
>> 127.0.1.1       obelix105.local xx.yy.net    obelix105
>>
>> # The following lines are desirable for IPv6 capable hosts
>> ::1     ip6-localhost ip6-loopback
>> fe00::0 ip6-localnet
>> ff00::0 ip6-mcastprefix
>> ff02::1 ip6-allnodes
>> ff02::2 ip6-allrouters
>> 192.168.245.1   obelix.local
>>
>> ==========================
>>
>> The error of HBase 0.94.16+Hadoop 1.2.1:
>>
>> 2014-01-29 12:58:30,922 INFO
>> org.apache.hadoop.hbase.regionserver.HRegionServer: Attempting connect
>> to Master server at obelix8.local,60000,1391018303918
>> 2014-01-29 12:58:40,960 WARN
>> org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to connect
>> to master. Retrying. Error was:
>> java.net.SocketException: Invalid argument
>>         at sun.nio.ch.Net.connect(Native Method)
>>         at sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:532)
>>         at
>> org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:192)
>>         at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:511)
>>         at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:481)
>>         at
>> org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupConnection(HBaseClient.java:392)
>>         at
>> org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupIOstreams(HBaseClient.java:438)
>>         at
>> org.apache.hadoop.hbase.ipc.HBaseClient.getConnection(HBaseClient.java:1141)
>>         at
>> org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:988)
>>         at
>> org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:87)
>>         at $Proxy9.getProtocolVersion(Unknown Source)
>>         at
>> org.apache.hadoop.hbase.ipc.WritableRpcEngine.getProxy(WritableRpcEngine.java:141)
>>         at
>> org.apache.hadoop.hbase.ipc.HBaseRPC.waitForProxy(HBaseRPC.java:208)
>>         at
>> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:2043)
>>         at
>> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:2089)
>>         at
>> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:747)
>>         at java.lang.Thread.run(Thread.java:662)
>>
>> Best,
>>
>> Boduo
>>
>>
>> On Wed, Jan 29, 2014 at 8:21 AM, Jean-Marc Spaggiari
>> <jean-marc@spaggiari.org> wrote:
>> > Hi,
>> >
>> > can you please share your config files and your host file?
>> >
>> > Thanks,
>> >
>> > JM
>> >
>> >
>> > 2014-01-29 Guang Gao <birdeeyore@gmail.com>
>> >
>> >> Hi all,
>> >>
>> >> This is my first time to try to setup HBase on a 10-node cluster. I
>> tried
>> >> two settings: HBase 0.94.16+Hadoop 1.2.1, and HBase 0.96.1.1+Hadoop
>> 2.2.0.
>> >> In both cases, the region servers can't connect to the master properly.
>> >>
>> >> [HBase 0.94.16+Hadoop 1.2.1]:
>> >>
>> >> 2014-01-29 00:55:06,875 WARN
>> >> org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to connect to
>> >> master. Retrying. Error was:
>> >> java.net.SocketException: Invalid argument
>> >>         at sun.nio.ch.Net.connect(Native Method)
>> >>         at
>> sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:532)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:192)
>> >>         at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:511)
>> >>         at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:481)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupConnection(HBaseClient.java:392)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupIOstreams(HBaseClient.java:438)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.HBaseClient.getConnection(HBaseClient.java:1141)
>> >>         at
>> >> org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:988)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:87)
>> >>         at $Proxy9.getProtocolVersion(Unknown Source)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.WritableRpcEngine.getProxy(WritableRpcEngine.java:141)
>> >>         at
>> >> org.apache.hadoop.hbase.ipc.HBaseRPC.waitForProxy(HBaseRPC.java:208)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:2043)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:2089)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:747)
>> >>         at java.lang.Thread.run(Thread.java:662)
>> >>
>> >>
>> >>
>> >> [HBase 0.96.1.1+Hadoop 2.2.0]:
>> >>
>> >> 2014-01-28 18:50:54,587 WARN  [regionserver60020]
>> >> regionserver.HRegionServer: error telling master we are up
>> >> com.google.protobuf.ServiceException: java.net.SocketException: Invalid
>> >> argument
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.RpcClient.callBlockingMethod(RpcClient.java:1670)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.RpcClient$BlockingRpcChannelImplementation.callBlockingMethod(RpcClient.java:1711)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.protobuf.generated.RegionServerStatusProtos$RegionServerStatusService$BlockingStub.regionServerStartup(RegionServerStatusProtos.java:5402)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1926)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:784)
>> >>         at java.lang.Thread.run(Thread.java:662)
>> >> Caused by: java.net.SocketException: Invalid argument
>> >>         at sun.nio.ch.Net.connect(Native Method)
>> >>         at
>> sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:532)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:192)
>> >>         at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:529)
>> >>         at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:493)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupConnection(RpcClient.java:575)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupIOstreams(RpcClient.java:860)
>> >>         at
>> >> org.apache.hadoop.hbase.ipc.RpcClient.getConnection(RpcClient.java:1535)
>> >>         at
>> org.apache.hadoop.hbase.ipc.RpcClient.call(RpcClient.java:1424)
>> >>         at
>> >>
>> >>
>> org.apache.hadoop.hbase.ipc.RpcClient.callBlockingMethod(RpcClient.java:1653)
>> >>         ... 5 more
>> >>
>> >>
>> >>
>> >> I tried to replace lib/hadoop-core-xxx.jar of HBase 0.94.16 with the
>> jar of
>> >> Hadoop 1.2.1, and replace lib/hadoop-*.jar with those of Hadoop 2.2.0,
>> but
>> >> had no luck. Does anyone have any idea?
>> >>
>> >> Thanks.
>> >>
>>

Mime
View raw message